https://gitcode.net/opendilab/DI-engine/-/commit/9e6de54883c90d77d84f86fc07eebb77200ee54bpolish(pu):polish td3_vae config2021-12-22T10:38:06+08:00puyuan19962402552459@qq.comhttps://gitcode.net/opendilab/DI-engine/-/commit/9dd84dd319720a6fdce53d4f43429aca32cae5abpolish(pu): polish vae structure, use add not concat between the embeddings o...2021-12-23T20:36:37+08:00puyuan19962402552459@qq.compolish(pu): polish vae structure, use add not concat between the embeddings of obs and action, use tanh after sample z and after the reconstruction_action head
https://gitcode.net/opendilab/DI-engine/-/commit/3f7e2130e1e0ff38cb213f92f31cb36a9aa01098polish(pu):polish kl weight and prediction weight2021-12-24T18:11:04+08:00puyuan19962402552459@qq.comhttps://gitcode.net/opendilab/DI-engine/-/commit/c7d85c97bd5daba2a4d232a97983017be8857767polish(pu):polish td3_vae using the best setting2021-12-26T15:25:57+08:00puyuan19962402552459@qq.comhttps://gitcode.net/opendilab/DI-engine/-/commit/96ea36240859d9104d69808be205b9c11b2802c4style(pu): yapf format2021-12-26T15:59:17+08:00puyuan19962402552459@qq.comhttps://gitcode.net/opendilab/DI-engine/-/commit/6ca776427043da8f4e8ef59c6bc98c4fe572a0fdpolish(pu):polish config2021-12-26T16:01:43+08:00puyuan19962402552459@qq.comhttps://gitcode.net/opendilab/DI-engine/-/commit/70328aabcd696d3c7691fb8beb3394198a062420fix(pu): fix bug when collector_env_num>1, the self._traj_buffer is not empty...2021-12-26T19:09:44+08:00puyuan19962402552459@qq.comfix(pu): fix bug when collector_env_num>1, the self._traj_buffer is not empty and will leave over the data in random collect phase