v0.2.0
API Change
-
SampleCollector
rename toSampleSerialCollector
-
EpisodeCollector
rename toEpisodeSerialCollector
-
BaseSerialEvaluator
rename toInteractionSerialEvaluator
-
ZerglingCollector
rename toZerglingParallelCollector
-
OneVsOneCollector
rename toMarineParallelCollector
-
AdvancedBuffer
registry name frompriority
toadvanced
Env (dizoo)
- overcooked env (#20)
- procgen env (#26)
- modified predator env (#30)
- d4rl env (#37)
- imagenet dataset (#27)
- bsuite env (#58)
- move atari_py to ale-py
Algorithm
- SQIL algorithm (#25) (#44)
- CQL algorithm (discrete/continuous) (#37) (#68)
- MAPPO algorithm (#62)
- WQMIX algorithm (#24)
- D4PG algorithm (#76)
- update multi-discrete policy(dqn, ppo, rainbow) (#51) (#72)
Enhancement
- image classification supervised training pipeline (#27)
- add force_reproducibility option in subprocess env manager
- add/delete/restart replicas via cli for k8s
- add league metric (trueskill and elo) (#22)
- add tb in naive buffer and modify tb in advanced buffer (#39)
- add k8s launcher and di-orchestrator launcher, add related unittest (#45) (#49)
- add hyper-parameter scheduler module (#38)
- add plot function (#59)
Fix
- acer weight bug and update atari result (#21)
- mappo nan bug and dict obs cannot unsqueeze bug (#54)
- r2d2 hidden state and obs pre-processing bug (#36) (#52)
- ppo bug when use dual_clip and adv > 0
- qmix double_q hidden state bug
- spawn context problem in interaction unittest (#69)
- formatted config no eval bug (#53)
- the catch statements that will never succeed and system proxy bug (#71) (#79)
- lunarlander config polish
- c51 head dimension mismatch bug
- mujoco config typo bug
- ppg atari config multi buffer bug
- max use and priority update special branch bug in advanced_buffer
Style
- add docker deploy in github workflow (#70) (#78) (#80)
- support PyTorch 1.9.0
- add algo/env list in README
- rename advanced_buffer register name to advanced
New Repo
- DI-treetensor: Tree Nested PyTorch Tensor Lib
Contributors: @PaParaZz1 @YinminZhang @Will-Nie @puyuan1996 @Weiyuhong-1998 @HansBug @sailxjx @simonat2011 @konnase @RobinC94 @LikeJulia @LuciusMos @jayyoung0802 @yifan123 @davide97l @garyzhang99