v0.2.0

API Change

  1. SampleCollector rename to SampleSerialCollector
  2. EpisodeCollector rename to EpisodeSerialCollector
  3. BaseSerialEvaluator rename to InteractionSerialEvaluator
  4. ZerglingCollector rename to ZerglingParallelCollector
  5. OneVsOneCollector rename to MarineParallelCollector
  6. AdvancedBuffer registry name from priority to advanced

Env (dizoo)

  1. overcooked env (#20)
  2. procgen env (#26)
  3. modified predator env (#30)
  4. d4rl env (#37)
  5. imagenet dataset (#27)
  6. bsuite env (#58)
  7. move atari_py to ale-py

Algorithm

  1. SQIL algorithm (#25) (#44)
  2. CQL algorithm (discrete/continuous) (#37) (#68)
  3. MAPPO algorithm (#62)
  4. WQMIX algorithm (#24)
  5. D4PG algorithm (#76)
  6. update multi-discrete policy(dqn, ppo, rainbow) (#51) (#72)

Enhancement

  1. image classification supervised training pipeline (#27)
  2. add force_reproducibility option in subprocess env manager
  3. add/delete/restart replicas via cli for k8s
  4. add league metric (trueskill and elo) (#22)
  5. add tb in naive buffer and modify tb in advanced buffer (#39)
  6. add k8s launcher and di-orchestrator launcher, add related unittest (#45) (#49)
  7. add hyper-parameter scheduler module (#38)
  8. add plot function (#59)

Fix

  1. acer weight bug and update atari result (#21)
  2. mappo nan bug and dict obs cannot unsqueeze bug (#54)
  3. r2d2 hidden state and obs pre-processing bug (#36) (#52)
  4. ppo bug when use dual_clip and adv > 0
  5. qmix double_q hidden state bug
  6. spawn context problem in interaction unittest (#69)
  7. formatted config no eval bug (#53)
  8. the catch statements that will never succeed and system proxy bug (#71) (#79)
  9. lunarlander config polish
  10. c51 head dimension mismatch bug
  11. mujoco config typo bug
  12. ppg atari config multi buffer bug
  13. max use and priority update special branch bug in advanced_buffer

Style

  1. add docker deploy in github workflow (#70) (#78) (#80)
  2. support PyTorch 1.9.0
  3. add algo/env list in README
  4. rename advanced_buffer register name to advanced

New Repo

Contributors: @PaParaZz1 @YinminZhang @Will-Nie @puyuan1996 @Weiyuhong-1998 @HansBug @sailxjx @simonat2011 @konnase @RobinC94 @LikeJulia @LuciusMos @jayyoung0802 @yifan123 @davide97l @garyzhang99

项目简介

OpenDILab Decision AI Engine

🚀 Github 镜像仓库 🚀

源项目地址

https://github.com/opendilab/DI-engine

发行版本 5

v0.2.2

全部发行版

贡献者 28

全部贡献者

开发语言

  • Python 99.8 %
  • Shell 0.2 %
  • Makefile 0.0 %