提交 · 993bb0e5be5c49ccb3a0082fba4a41ee5327acf9 · OpenDILab开源决策智能平台 / DI-engine

15 10月, 2021 3 次提交
- W
  fix(wyh): add test for rl_utils ppo and td (#89) · 993bb0e5
  由 Weiyuhong-1998 提交于 10月 15, 2021
```
* fix(wyh):test rl_utils code

* fix(wyh):modify rl utils bug ppo adv batch B,A

* fix(wyh):style

* fix(wyh):fix bug
```
  993bb0e5
- N
  
  fix(nyz): fix pyyaml higher version compatibility bug · f849386c
  由 niuyazhe 提交于 10月 15, 2021
  
  f849386c
- N
  
  polish(nyz): remove torch in env and correct dizoo yapf format · 4b7e50c4
  由 niuyazhe 提交于 10月 15, 2021
  
  4b7e50c4
12 10月, 2021 2 次提交
- N
  
  polish(nyz): polish sac and cql policy · f537adf0
  由 niuyazhe 提交于 10月 12, 2021
  
  f537adf0
- S
  feature(nyz): add gym-hybrid hybrid action space env (#86) · 292f0246
  由 Swain 提交于 10月 12, 2021
```
* feature(nyz): add gym-hybrid hybrid action space env

* style(nyz): update readme for gym_hybrid env
```
  292f0246
09 10月, 2021 1 次提交
- N
  
  fix(nyz): fix gym version>0.20.0 pendulum-v0 bug(enable docker, smac docker) · fac84bcf
  由 niuyazhe 提交于 10月 09, 2021
  
  fac84bcf
08 10月, 2021 1 次提交

feature(zlx): add vs bot training and self-play training with slime volley env (#23) · dbf432cd

由 LuciusMos 提交于 10月 08, 2021

* slime volley env in dizoo, first commit

* fix bug in slime volley env

* modify volley env to satisfy ding 1v1 requirements; add naive self-play and league training pipeline(evaluator is not finished, now use a very naive one)

* adopt volley builtin ai as default eval opponent

* polish(nyz): polish slime_volley_env and its test

* feature(nyz): add slime_volley vs bot ppo demo

* feature(nyz): add battle_sample_serial_collector and adapt abnormal check in subprocess env manager

* feature(nyz): add slime volley self-play demo

* style(nyz): add slime_volleyball env gif and split MARL and selfplay label

* feature(nyz): add save replay function in slime volleyball env
Co-authored-by: Nzlx-sensetime <zhaoliangxuan@sensetime.com>
Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com>

dbf432cd

02 10月, 2021 2 次提交
- N
  
  fix(nyz): fix test discrete cql config mismatch bug(enable docker, smac docker) · c500a2e5
  由 niuyazhe 提交于 10月 02, 2021
  
  c500a2e5
- N
  
  style(nyz): remove old container and old dqn config · 5da9e9fd
  由 niuyazhe 提交于 10月 02, 2021
  
  5da9e9fd
01 10月, 2021 3 次提交
- N
  
  fix(nyz): fix test discrete cql unittest bug · e173e663
  由 niuyazhe 提交于 10月 01, 2021
  
  e173e663
- N
  
  test(nyz): polish unittest and fix remove ckpt dir bug · d6a1eaca
  由 niuyazhe 提交于 10月 01, 2021
  
  d6a1eaca
- N
  
  style(nyz): fix typo and release multi python version bug(enable docker, smac docker) · 13450e65
  由 niuyazhe 提交于 10月 01, 2021
  
  13450e65
30 9月, 2021 7 次提交
- N
  
  v0.2.0 · 769401cc
  由 niuyazhe 提交于 9月 30, 2021
  
  769401cc
- D
  feature(davide): Implementation of D4PG (#76) · 16a89c35
  由 Davide Liu 提交于 9月 30, 2021
```
* added experience replay and n-step

* implementing distributional q value

* added distributional q-value

* added overview in qac_dist and d4pg

* derived D4PG from DDPG

* fixed a bug when action shape >1

* benchmark D4PG mujoco + minor fixs

-entry for DDPG mujoco
-entry for D4PG mujoco
-config for D4PG mujoco
-fixed style D4PG code
-unittests for QAC distributional

* formatted code

* minor updates (read description)

-added d4pg seria_entry test
-updated comments in QACDIST
-added d4pg in commander register
-added q_value in d4pg return dict
-added priority update in d4pg entry
-added assertion in QACDIST
```
  16a89c35
- Y
  feature(zym): add offlineRL algo Discrete CQL; add hdf5 dataset for offlineRL. (#68) · 206186f1
  由 Yinmin.Zhang 提交于 9月 30, 2021
```
* feature(zym): add offlineRL algo Discrete CQL.

* feature(zym): add offlineRL algo Discrete CQL; add hdf5 dataset for offlineRL.
```
  206186f1
- N
  
  style(nyz): add pypi release workflow(enable docker, smac docker) · 3f33b9d7
  由 niuyazhe 提交于 9月 30, 2021
  
  3f33b9d7
- N
  
  fix(nyz): fix pytorch1.9.0 compatibility bug and change naive buffer log freq · 6d62a71f
  由 niuyazhe 提交于 9月 30, 2021
  
  6d62a71f
- N
  
  fix(nyz): disable pytorch1.9.0 in default setting · 21767320
  由 niuyazhe 提交于 9月 30, 2021
  
  21767320
- N
  
  fix(nyz): fix il test unstable bug, update torch to 1.9.0, and polish readme · 231120f2
  由 niuyazhe 提交于 9月 30, 2021
  
  231120f2
29 9月, 2021 4 次提交

N

fix(nyz): fix ppg atari config bug, and ppg atari entry, and update default eval_freq · 02279cdb
由 niuyazhe 提交于 9月 28, 2021

02279cdb

feature(nyz): add smac docker (#80) · 13c3c9c2

由 Swain 提交于 9月 29, 2021

* style(nyz): add ctools.pysc2 import in smac env

* feature(nyz): add smac docker build(enable docker, smac docker)

* fix(nyz): fix if condition syntax in deploy(enable docker, smac docker)

* fix(nyz): fix if condition syntax in deploy(enable docker, smac docker)

* fix(nyz): remove cache layer in smac docker(enable docker, smac docker)

* feature(nyz): use self-hosted runner in docker smac deploy(enable docker, smac docker)

* feature(nyz): build smac docker manually(enable docker, smac docker)

* feature(nyz): use docker buildx as default tool in smac and add SC2Map in setup(enable docker, smac docker)

* feature(nyz): add __init__.py in smac env maps(enable docker, smac docker)

13c3c9c2

X

fix(xjx): test failed when using system proxy (#79) · 3e88650e
由 Xu Jingxin 提交于 9月 29, 2021

3e88650e

feature(nyz): add mujoco docker (#78) · 34479156

由 Swain 提交于 9月 29, 2021

* feature(nyz): add docker_mujoco build and upgrada numpy version to 1.20.0(enable docker)

* fix(nyz): fix numpy version compatibility bug and add -y option in apt-get(enable docker)

* fix(nyz): add libosmesa6-dev in Dockerfile.env(enable docker)

* fix(nyz): add permanent env variable about mujoco(enable docker)

* fix(nyz): change sh source to .(enable docker)

* fix(nyz): set env variable in bashrc(enable docker)

* fix(nyz): fix pip typo(enable docker)

* fix(nyz): add env in dockerfile(enable docker)

34479156

28 9月, 2021 2 次提交

蒲

feature(pu): add WQMIX algorithm (#24) · 63feb629

由蒲源提交于 9月 28, 2021

* add wqmix

* update annotation

* reformate

* update annotation

* update config

* fix annotation

* update as review

* fix as review

* add 5m6m MMM MMM2 config

* reformate

* fix(pu): fix rnn reset bug and add unittest

* fix(pu): fix rnn reset bug in centrally-weighted wqmix

* style(pu): yapf format and let WQMIXPolicy extend QMIXPolicy

* fix(pu): fix wqmix policy extend bug

* test(pu): add unittest test_wqmix

* fix(pu): fix mixer key bug in particle config

* feature(pu): add cooperative_navigation_wqmix_config

* style(pu): yapf format

* test(pu): change nn.Identity() to nn.Sequential()

* fix(pu): fix unittest bug in test_wqmix

63feb629

feature(nyz): move atari_py to ale-py; split base and env docker build (#77) · fb90757d

由 Swain 提交于 9月 28, 2021

* feature(nyz): move atari_py to ale-py and polish standard docker build(enable docker)

* fix(nyz): fix atari env import bug(enable docker)

* feature(nyz): add autorom install in docker(enable docker)

* feature(nyz): split base and env docker build(enable docker)

* fix(nyz): fix docker env source image bug(enable docker)

fb90757d

26 9月, 2021 5 次提交

R
feature(crb): add multi-discrete ppo and off policy ppo (#72) · f284a9ea
由 Robin Chen 提交于 9月 26, 2021
```
* add md ppo

* add doc string
```
f284a9ea
X
fix(xjx): fix the catch statments that will never succeed in test networks;... · f7bbf5d6
由 Xu Jingxin 提交于 9月 26, 2021
```
fix(xjx): fix the catch statments that will never succeed in test networks; fix silence method (#71)
```
f7bbf5d6

style(nyz): add docker deploy in github workflow (#70) · 13fa3b20

由 Swain 提交于 9月 26, 2021

* style(nyz): add docker deploy workflow(enable docker)

* style(nyz): fix docker push info(enable docker)

* style(nyz): modify org name and image default name rule(enable docker)

* style(nyz): change default version to date(enable docker)

13fa3b20

fix(hansbug): fix spawn context problem in interaction unittest (#69) · 91b84263

由 Swain 提交于 9月 26, 2021

* fix(hansbug): try support spawn backend

* fix(hansbug): try fix the xxxxing problem in interaction spawn support && reformat the code style

* fix(nyz): disable silence decorator for spawn context interaction test
Co-authored-by: HansBug <killog@126.com>

91b84263

N

fix(nyz): fix qmix double_q hidden state bug · 835e3c4c
由 niuyazhe 提交于 9月 09, 2021

835e3c4c

24 9月, 2021 2 次提交
- S
  
  style(nyz): create codecov.yml for project coverage status · e22e5e43
  由 Swain 提交于 9月 24, 2021
  
  e22e5e43
- S
  
  style(nyz): fix DRL typo in README · 89e4a5de
  由 Swain 提交于 9月 24, 2021
  
  89e4a5de
23 9月, 2021 2 次提交
- S
  
  style(nyz): update algorithm paper link · 9ea60112
  由 Swain 提交于 9月 23, 2021
  
  9ea60112
- N
  
  hotfix(nyz): fix import Tensor problem for torch incompatibility · 1e27817f
  由 niuyazhe 提交于 9月 23, 2021
  
  1e27817f
22 9月, 2021 3 次提交

feature(wyh):mappo and ippo win rate and time (#62) · 32acf1fa

由 Weiyuhong-1998 提交于 9月 22, 2021

* feature(wyh):mappo and ippo win rate and time

* feature(wyh):mappo and ippo epymarl win rate and time

* feature(wyh):smac epymarl commit id
Co-authored-by: N卫昱宏 <SENSETIME\weiyuhong@cn0214000504l.domain.sensetime.com>

32acf1fa

fix(wyh): add plot function (#59) · 48d6c826

由 Weiyuhong-1998 提交于 9月 22, 2021

* fix(wyh): plot function

* fix(wyh): plot function pytest

* fix(wyh):plot function modify comments

* feature(wyh):plot style
Co-authored-by: Nweiyuhong <weiyuhong@sensetime.com>

48d6c826

N

hotfix(nyz): fix circle import bug in ding/utils · 07ceba40
由 niuyazhe 提交于 9月 22, 2021

07ceba40

17 9月, 2021 3 次提交

feature(davide): add BSuite environment wrapper (#58) · 8050a9bb

由 Davide Liu 提交于 9月 17, 2021

* start implementing bsuite env

* add bsuite env

* Implemented

* removed unused file

* added cartpole_swing environment

* Update test_bsuite_env.py

* added env in readme and in setup.py

* Create bsuite.png

8050a9bb

feature(crb): update multi discrete policy(dqn, ppo, rainbow) (#51) · 332995e8

由 Robin Chen 提交于 9月 17, 2021

* update md_dqn

* update offpolicy ppo

* add rainbow md policy

* format code

* del ppo; leave to future updates

* add doc string; fix rainbow returns

332995e8

蒲

fix(pu): fix r2d2 done slice bug and LSTM hidden state reset bug (#52) · 2ffff07e

由蒲源提交于 9月 17, 2021

* test rnd

* fix mz config

* fix config

* fix(pu): fix r2d2

* fix(puyuan): fix r2d2

* feature(puyuan): add minigrid r2d2 config

* polish minigrid config

* modified as review

* fix(pu): fix bugffor compatibility

* polish(pu): add annotations and polish slice operation

* style(pu): run format.sh

* style(pu): correct yapf format

* fix(pu): fix config

* fix(pu): fix done slice bug and lstm reset bug

* style(pu): format config

* polish(pu): polish config params for cartpole, lunarlander and minigrid

* polish(pu): polish minigrid config params

* Update r2d2.py

* polish(pu): polish rnn reset problem

* fix(pu): fix merge error

* polish(pu): polish cartpole config

* polish(nyz): polish cartpole r2d2 config for faster convergence

* test(nyz): enable r2d2 algotest
Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com>

2ffff07e

OpenDILab开源决策智能平台 / DI-engine 上一次同步 接近 3 年

OpenDILab开源决策智能平台 / DI-engine
上一次同步接近 3 年