提交 · b1e9b4ea7027284ce3e1c5c6983f25b2f8a88888 · OpenDILab开源决策智能平台 / DI-engine

29 10月, 2021 4 次提交

feature(lcm): add MBPO algorithm (#113) · b1e9b4ea

由 Swain 提交于 10月 29, 2021

* feature(lcm): add MBPO algorithm (#87)

* add model-based rl

* fix yazhe's comments

* format

* pass flake8 test

* polish(nyz): polish mbpo import, name and test
Co-authored-by: Nlichuming <lichuming@lichumingdeMacBook-Pro.local>

b1e9b4ea

N

style(nyz): restrict deploy trigger branch · edb14698
由 niuyazhe 提交于 10月 29, 2021

edb14698
N

style(nyz): modify doc and deploy trigger and update mujoco license download link(smac docker) · c77ebf78
由 niuyazhe 提交于 10月 29, 2021

c77ebf78

feature(nyz): add PADDPG for hybrid action space as baseline (#109) · d2f79536

由 Swain 提交于 10月 29, 2021

* fix(nyz): fix gym_hybrid env not scale action bug

* feature(nyz): add PADDPG basic implementation for hybrid action space

* fix(nyz): fix td3/d4pg comatibility bug with new modifications

* fix(nyz): fix hybrid ddpg action type grad bug and update config

* feature(nyz): add eps greedy + multinomial wrapper and gym_hybrid ddpg convergence config

* style(nyz): update PADDPG in README

* test_model_hybrid_qac

* fix_typo_in_README

* test_policy_hybrid_qac

* polish(nyz): polish hybrid action space to dict structure and polish unittest

* fix(nyz): fix td3bc compatibility bug
Co-authored-by: N李可 <like2@CN0014008466M.local>

d2f79536

28 10月, 2021 1 次提交

feature(nyz): add gobigger baseline (#95) · a8fec8bb

由 Swain 提交于 10月 28, 2021

* feature(nyz): add gobigger baseline

* style(nyz): add gobigger env infor

* feature(nyz): add ignore prefix in default collate

* feautre(nyz): add vsbot training baseline

* fix(nyz): fix to_tensor empty list bug and polish gobigger baseline

* style(nyz): split gobigger baseline code

a8fec8bb

26 10月, 2021 2 次提交
- N
  
  polish(nyz): polish collector benchmark test(enable docker, smac docker) · fed80b44
  由 niuyazhe 提交于 10月 26, 2021
  
  fed80b44
- J
  test(yzj): add unittest for dataset, metric_serial_evaluator and learner (#107) · 0414eda5
  由 jayyoung0802 提交于 10月 26, 2021
```
* add 4 pytest dataset.py learner_aggregator.py learner_hook.py metric_serial_evaluator.py

* fix yapf and flake8 And remove invalid self._env

* fix fake_cls_config.py flake8
```
  0414eda5
25 10月, 2021 1 次提交

test(wyh): add more unittest for ppo and sac policy (#104) · c5af1cf2

由 Weiyuhong-1998 提交于 10月 25, 2021

* fix(wyh):reward model test

* fix(wyh):sac ppo test

* fix(wyh):ppo_continuous test

* fix(wyh):style

* fix(wyh):ppo test
Co-authored-by: NSwain <niuyazhe314@outlook.com>

c5af1cf2

22 10月, 2021 4 次提交

feature(zym): add offlineRL algo td3_bc and polish policy comments(#88) · 7c1b5e95

由 Yinmin.Zhang 提交于 10月 22, 2021

* feature(zym): add offlineRL algo td3_bc.

* feature(zym): add offlineRL algo td3_bc.

* feature(zym): add offlineRL algo td3_bc.

* polish(zym): polish some annotations in td3/ddpg/sac/ppo; polish `_forward_collect` and `_foward_eval`.

* fix(lj): fix dimension bug in cql for continuous env.

* fix(zym): fix dimension bug in cql for continuous env.

* fix(zym): fix dimension bug in cql for continuous env.

* polish(zym): update README.md.

7c1b5e95

polish(nyz): fix ppo bugs and update atari ppo offpolicy config (#108) · 2d5ec7c3

由 Swain 提交于 10月 22, 2021

* fix(nyz): fix ppo cuda bug and random collect bug

* config(nyz): add pong ppo off policy better config

* fix(nyz): fix ppo device bug in get_train_sample and update ppo offpolicy config

* style(nyz): correct yapf format

2d5ec7c3

N

fix(nyz): fix base policy model state_dict overlap bug · c2b14d48
由 niuyazhe 提交于 10月 22, 2021

c2b14d48
N

style(nyz): restrict ale-py version to 0.7.0(enable docker, smac docker) · f82f369d
由 niuyazhe 提交于 10月 22, 2021

f82f369d

21 10月, 2021 3 次提交

X
feature(xjx): test in pure docker environment (#103) · eee5c207
由 Xu Jingxin 提交于 10月 21, 2021
```
* Test in docker

* Add docker test entry

* Trap exit

* Test in docker
```
eee5c207

feature(lk): add gym-soccer (HFO) env (#94) · 8f47f4cb

由 Ke Li 提交于 10月 21, 2021

* add_soccer_env

* add_info

* close

* format

* test_gym_soccer

* rm_torch

* replay_log

* format_style

* add_gym_soccer_to_readme

* separate render_func

* add_gif_file

* scale_action

* flake_style_format

* resolve_review_comments

* add branch info for gym hybrid

8f47f4cb

N

polish(nyz): modify dizoo test mark to envtest(enable docker, smac docker) · f04b9eb7
由 niuyazhe 提交于 10月 21, 2021

f04b9eb7

20 10月, 2021 1 次提交
- X
  
  fix(xjx): replace distutil pyyaml with pip package (#99) · 0fcfdf26
  由 Xu Jingxin 提交于 10月 20, 2021
  
  0fcfdf26
19 10月, 2021 1 次提交
- W
  
  polish(nyp): polish dqfd policy, entry and config(#98) · aa8508bb
  由 Will-Nie 提交于 10月 19, 2021
  
  aa8508bb
17 10月, 2021 1 次提交
- N
  
  test(nyz): add test for ding/utils and remove DistributionImage · ad394fc5
  由 niuyazhe 提交于 10月 17, 2021
  
  ad394fc5
16 10月, 2021 3 次提交

W
fix(wyh): add model test and policy/entry test and remove unused qacd(#92) · 1568e53d
由 Weiyuhong-1998 提交于 10月 16, 2021
```
* fix(wyh):model test and policy/entry test

* fix(wyh):delect qacd

* fix(wyh):test serial entry onpolicy
```
1568e53d

feature(nyp): add DQfD algorithm (#48) · e2ca8738

由 Will-Nie 提交于 10月 16, 2021

* add_dqfd

* Is_expert to is_expert

* modify according to the last commnets

* value_gamma; done; marginloss; sqil compatibility

* finally shorten the code, revise config

* revise config, style

* add_readme/two_more_config

* correct format
Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com>

e2ca8738

N

fix(nyz): fix test ppo continuous input range bug(enable docker, smac docker) · 8efee984
由 niuyazhe 提交于 10月 16, 2021

8efee984

15 10月, 2021 3 次提交
- W
  fix(wyh): add test for rl_utils ppo and td (#89) · 993bb0e5
  由 Weiyuhong-1998 提交于 10月 15, 2021
```
* fix(wyh):test rl_utils code

* fix(wyh):modify rl utils bug ppo adv batch B,A

* fix(wyh):style

* fix(wyh):fix bug
```
  993bb0e5
- N
  
  fix(nyz): fix pyyaml higher version compatibility bug · f849386c
  由 niuyazhe 提交于 10月 15, 2021
  
  f849386c
- N
  
  polish(nyz): remove torch in env and correct dizoo yapf format · 4b7e50c4
  由 niuyazhe 提交于 10月 15, 2021
  
  4b7e50c4
12 10月, 2021 2 次提交
- N
  
  polish(nyz): polish sac and cql policy · f537adf0
  由 niuyazhe 提交于 10月 12, 2021
  
  f537adf0
- S
  feature(nyz): add gym-hybrid hybrid action space env (#86) · 292f0246
  由 Swain 提交于 10月 12, 2021
```
* feature(nyz): add gym-hybrid hybrid action space env

* style(nyz): update readme for gym_hybrid env
```
  292f0246
09 10月, 2021 1 次提交
- N
  
  fix(nyz): fix gym version>0.20.0 pendulum-v0 bug(enable docker, smac docker) · fac84bcf
  由 niuyazhe 提交于 10月 09, 2021
  
  fac84bcf
08 10月, 2021 1 次提交

feature(zlx): add vs bot training and self-play training with slime volley env (#23) · dbf432cd

由 LuciusMos 提交于 10月 08, 2021

* slime volley env in dizoo, first commit

* fix bug in slime volley env

* modify volley env to satisfy ding 1v1 requirements; add naive self-play and league training pipeline(evaluator is not finished, now use a very naive one)

* adopt volley builtin ai as default eval opponent

* polish(nyz): polish slime_volley_env and its test

* feature(nyz): add slime_volley vs bot ppo demo

* feature(nyz): add battle_sample_serial_collector and adapt abnormal check in subprocess env manager

* feature(nyz): add slime volley self-play demo

* style(nyz): add slime_volleyball env gif and split MARL and selfplay label

* feature(nyz): add save replay function in slime volleyball env
Co-authored-by: Nzlx-sensetime <zhaoliangxuan@sensetime.com>
Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com>

dbf432cd

02 10月, 2021 2 次提交
- N
  
  fix(nyz): fix test discrete cql config mismatch bug(enable docker, smac docker) · c500a2e5
  由 niuyazhe 提交于 10月 02, 2021
  
  c500a2e5
- N
  
  style(nyz): remove old container and old dqn config · 5da9e9fd
  由 niuyazhe 提交于 10月 02, 2021
  
  5da9e9fd
01 10月, 2021 3 次提交
- N
  
  fix(nyz): fix test discrete cql unittest bug · e173e663
  由 niuyazhe 提交于 10月 01, 2021
  
  e173e663
- N
  
  test(nyz): polish unittest and fix remove ckpt dir bug · d6a1eaca
  由 niuyazhe 提交于 10月 01, 2021
  
  d6a1eaca
- N
  
  style(nyz): fix typo and release multi python version bug(enable docker, smac docker) · 13450e65
  由 niuyazhe 提交于 10月 01, 2021
  
  13450e65
30 9月, 2021 7 次提交
- N
  
  v0.2.0 · 769401cc
  由 niuyazhe 提交于 9月 30, 2021
  
  769401cc
- D
  feature(davide): Implementation of D4PG (#76) · 16a89c35
  由 Davide Liu 提交于 9月 30, 2021
```
* added experience replay and n-step

* implementing distributional q value

* added distributional q-value

* added overview in qac_dist and d4pg

* derived D4PG from DDPG

* fixed a bug when action shape >1

* benchmark D4PG mujoco + minor fixs

-entry for DDPG mujoco
-entry for D4PG mujoco
-config for D4PG mujoco
-fixed style D4PG code
-unittests for QAC distributional

* formatted code

* minor updates (read description)

-added d4pg seria_entry test
-updated comments in QACDIST
-added d4pg in commander register
-added q_value in d4pg return dict
-added priority update in d4pg entry
-added assertion in QACDIST
```
  16a89c35
- Y
  feature(zym): add offlineRL algo Discrete CQL; add hdf5 dataset for offlineRL. (#68) · 206186f1
  由 Yinmin.Zhang 提交于 9月 30, 2021
```
* feature(zym): add offlineRL algo Discrete CQL.

* feature(zym): add offlineRL algo Discrete CQL; add hdf5 dataset for offlineRL.
```
  206186f1
- N
  
  style(nyz): add pypi release workflow(enable docker, smac docker) · 3f33b9d7
  由 niuyazhe 提交于 9月 30, 2021
  
  3f33b9d7
- N
  
  fix(nyz): fix pytorch1.9.0 compatibility bug and change naive buffer log freq · 6d62a71f
  由 niuyazhe 提交于 9月 30, 2021
  
  6d62a71f
- N
  
  fix(nyz): disable pytorch1.9.0 in default setting · 21767320
  由 niuyazhe 提交于 9月 30, 2021
  
  21767320
- N
  
  fix(nyz): fix il test unstable bug, update torch to 1.9.0, and polish readme · 231120f2
  由 niuyazhe 提交于 9月 30, 2021
  
  231120f2

OpenDILab开源决策智能平台 / DI-engine 上一次同步 2 年多

OpenDILab开源决策智能平台 / DI-engine
上一次同步 2 年多