v0.2.0

niuyazhe@sensetime.com

v0.2.0

API Change

SampleCollector rename to SampleSerialCollector
EpisodeCollector rename to EpisodeSerialCollector
BaseSerialEvaluator rename to InteractionSerialEvaluator
ZerglingCollector rename to ZerglingParallelCollector
OneVsOneCollector rename to MarineParallelCollector
AdvancedBuffer registry name from priority to advanced

Env (dizoo)

overcooked env (#20)
procgen env (#26)
modified predator env (#30)
d4rl env (#37)
imagenet dataset (#27)
bsuite env (#58)
move atari_py to ale-py

Algorithm

SQIL algorithm (#25) (#44)
CQL algorithm (discrete/continuous) (#37) (#68)
MAPPO algorithm (#62)
WQMIX algorithm (#24)
D4PG algorithm (#76)
update multi-discrete policy(dqn, ppo, rainbow) (#51) (#72)

Enhancement

image classification supervised training pipeline (#27)
add force_reproducibility option in subprocess env manager
add/delete/restart replicas via cli for k8s
add league metric (trueskill and elo) (#22)
add tb in naive buffer and modify tb in advanced buffer (#39)
add k8s launcher and di-orchestrator launcher, add related unittest (#45) (#49)
add hyper-parameter scheduler module (#38)
add plot function (#59)

Fix

acer weight bug and update atari result (#21)
mappo nan bug and dict obs cannot unsqueeze bug (#54)
r2d2 hidden state and obs pre-processing bug (#36) (#52)
ppo bug when use dual_clip and adv > 0
qmix double_q hidden state bug
spawn context problem in interaction unittest (#69)
formatted config no eval bug (#53)
the catch statements that will never succeed and system proxy bug (#71) (#79)
lunarlander config polish
c51 head dimension mismatch bug
mujoco config typo bug
ppg atari config multi buffer bug
max use and priority update special branch bug in advanced_buffer

Style

add docker deploy in github workflow (#70) (#78) (#80)
support PyTorch 1.9.0
add algo/env list in README
rename advanced_buffer register name to advanced

New Repo

DI-treetensor: Tree Nested PyTorch Tensor Lib

Contributors: @PaParaZz1 @YinminZhang @Will-Nie @puyuan1996 @Weiyuhong-1998 @HansBug @sailxjx @simonat2011 @konnase @RobinC94 @LikeJulia @LuciusMos @jayyoung0802 @yifan123 @davide97l @garyzhang99

项目简介

OpenDILab Decision AI Engine

🚀 Github 镜像仓库 🚀

源项目地址 ⬇ ⬇ ⬇

https://github.com/opendilab/DI-engine

Apache License 2.0
文件大小 135.7 MB
仓库大小 135.8 MB

发行版本 5

v0.2.2

12月 04, 2021

全部发行版

贡献者 28

全部贡献者

开发语言

Python 99.8 %
Shell 0.2 %
Makefile 0.0 %

OpenDILab开源决策智能平台 / DI-engine 上一次同步 2 年多