提交 · dd4472e4e04259c563c2b1bca4426d39f18d25fb · OpenDILab开源决策智能平台 / DI-engine

02 9月, 2021 4 次提交

N

Merge branch 'main' of https://github.com/opendilab/DI-engine · dd4472e4
由 niuyazhe 提交于 9月 02, 2021

dd4472e4
N

hotfix(nyz): fix cartpole ppg value buffer sample typo · da19fdbd
由 niuyazhe 提交于 9月 02, 2021

da19fdbd

由 Swain 提交于 9月 02, 2021

* feature(nyz): add trueskill as league metric, naive elo calculator, fix game_env info bug

* fix(nyz): fix league player mutate bug

* fix(nyz): fix league unittest bug

* feature(nyz): add elo ranking in league metric env

* polish(nyz): modify fixed eval policy and trueskill init

* feature(nyz): add init main player in evaluation and fix stop_value bug

* style(nyz): rename test_league_metric to avoid pyc cache bug

d24f1f3d

N

hotfix(nyz): fix max use and priority update special branch bug · 19020398
由 niuyazhe 提交于 9月 02, 2021

19020398

31 8月, 2021 1 次提交
- N
  
  feature(nyz): add sample_range arg in replay buffer · 7e9e4e88
  由 niuyazhe 提交于 8月 31, 2021
  
  7e9e4e88
27 8月, 2021 4 次提交
- N
  
  Merge branch 'main' of https://github.com/opendilab/DI-engine · 608fee41
  由 niuyazhe 提交于 8月 27, 2021
  
  608fee41
- N
  
  hotfix(nyz): fix random policy typo in serial entry and base policy model device problem · 60a6867b
  由 niuyazhe 提交于 8月 27, 2021
  
  60a6867b
- Y
  
  fix(zym): modify default setting in mujoco (#35) · 6326529c
  由 Yinmin.Zhang 提交于 8月 27, 2021
  
  6326529c
- N
  
  polish(nyz): polish cartpole dqn visualize demo and add solo eval demo · 020eba28
  由 niuyazhe 提交于 8月 27, 2021
  
  020eba28
26 8月, 2021 1 次提交
- N
  
  style(nyz): add roadmap link in readme and correct format · 2c03244a
  由 niuyazhe 提交于 8月 26, 2021
  
  2c03244a
25 8月, 2021 2 次提交
- N
  
  style(nyz): rename advanced_buffer register name to advanced · 84583d44
  由 niuyazhe 提交于 8月 25, 2021
  
  84583d44
- N
  
  style(nyz): update coveragerc with new entry cli code and fix config inf replace · 6f500a5d
  由 niuyazhe 提交于 8月 25, 2021
  
  6f500a5d
24 8月, 2021 3 次提交
- N
  
  test(nyz): add sqil unittest and algotest, remove adder comment in policy, polish sqil config · 42e31ea2
  由 niuyazhe 提交于 8月 24, 2021
  
  42e31ea2
- N
  
  hotfix(nyz): fix mujoco benchmark config typos · f84d76e2
  由 niuyazhe 提交于 8月 24, 2021
  
  f84d76e2
- L
  
  feature(ljw): add/delete/restart replicas via cli for k8s · 2ef3ad6c
  由 lijianwen 提交于 8月 24, 2021
  
  2ef3ad6c
23 8月, 2021 3 次提交
- N
  
  Merge branch 'main' of https://github.com/opendilab/DI-engine · cffb5b27
  由 niuyazhe 提交于 8月 23, 2021
  
  cffb5b27
- N
  
  hotfix(nyz): fix c51 head dimension mismatch bug and ppo config mismatch bug · 0453f9cc
  由 niuyazhe 提交于 8月 23, 2021
  
  0453f9cc
- J
  Dev modified predator prey (#30) · 98e2c133
  由 Jie Liu 提交于 8月 23, 2021
```
* add modifiled predator_prey env

* add collision_ratio

* add readme and cfg for modified_predator_prey env

* add readme imgs for modified_predator_prey

* check format

* fix format
```
  98e2c133
20 8月, 2021 1 次提交

SQIL (#25) · 9929dc37

由 Will-Nie 提交于 8月 20, 2021

* add sqil

* conceal all the personal info

* revise according to the comments

* correct_format

* add_comment to hardcodes part

* pass flake8

* add force_reproducibility = True; device, ex_model

* check format

9929dc37

19 8月, 2021 2 次提交
- W
  
  add procgen env demo(#26) · 9bc39314
  由 Weiyuhong-1998 提交于 8月 19, 2021
  
  9bc39314
- N
  
  style(nyz): add scipy requirement · 78c1dee3
  由 niuyazhe 提交于 8月 19, 2021
  
  78c1dee3
13 8月, 2021 1 次提交

fix ACER's bug. update Qbert and space invader's config and result (#21) · e51fd711

由 simonat2011 提交于 8月 13, 2021

* fix weight bug

* update acer qbert result

* fix flake8 format problem

* update space qbert config

* update as review
Co-authored-by: Nshenziju <simonshen2011@foxmail.com>

e51fd711

11 8月, 2021 2 次提交
- N
  
  hotfix(nyz): fix lunarlander dqn config and get formatted config · 47315983
  由 niuyazhe 提交于 8月 11, 2021
  
  47315983
- N
  
  style(nyz): add bipedalwalker env graph · fd908cdc
  由 niuyazhe 提交于 8月 11, 2021
  
  fd908cdc
10 8月, 2021 1 次提交

add overcooked environment (#20) · c1d22458

由 garyzhang99 提交于 8月 10, 2021

* init runable ppo

* init overcooked env

* overcooked ppo in place

* runable ppo with shaped rewards

* modified config

* feature(nyz): modify win rate calculation with draws

* remove redundant code, modified baseline model

* Update __init__.py

* Update config.py

* modify temp_config_file.close() position in config.py to work in windows os

* remove redundant comments and rename files

* fix name bug and use namedlist

* add simple readme and remove redundant comments from copies

* resolve threads

* remove debug comments
Co-authored-by: Nniuyazhe <niuyazhe314@outlook.com>

c1d22458

06 8月, 2021 1 次提交
- N
  
  feature(nyz): add force_reproducibility option in subprocess env manager · a9749d28
  由 niuyazhe 提交于 8月 06, 2021
  
  a9749d28
03 8月, 2021 7 次提交

N

hotfix(nyz): fix return bug when adv_norm=True and remove unused normalize_advantage field · cf382d72
由 niuyazhe 提交于 8月 03, 2021

cf382d72
N

hotfix(nyz): fix qtran unittest import bug and qtran hidden size list bug · e2794fcb
由 niuyazhe 提交于 8月 03, 2021

e2794fcb
N

feature(lj): add new smac benchmark add qtran algo, polish env readme,fix double_q bug · 1797568a
由 niuyazhe 提交于 8月 01, 2021

1797568a
N

hotfix(nyz): fix dataloader deadlock bug when interrupted · f320fa12
由 niuyazhe 提交于 8月 03, 2021

f320fa12
N

hotfix(nyz): fix parallel algotest config deepcopy bug · 2f0d4a12
由 niuyazhe 提交于 8月 03, 2021

2f0d4a12
N

v0.1.1 · b7ec2d6c
由 niuyazhe 提交于 8月 03, 2021

b7ec2d6c

serial training league demo (#12) · 73295c22

由 Swain 提交于 8月 03, 2021

* feature(nyz): add naive 1v1 two player demo

* feature(nyz): add 1v1 evaluator and 2 rule-based policy for evaluation

* feature(nyz): modify game env and adjust hyper-param

* feature(nyz): add naive league training multi player demo

* feature(nyz): enable force snapshot to support init historical league player; finish league demo basic code

* feature(nyz): modify selfplay demo and add two type game env

* style(nyz): correct format style

* polish(nyz): correct format style and adapt league demo main

* feature(nyz): add league payoff viz and enable payoff update in league demo

* feature(nyz): modify win rate calculation with draws

* test(nyz): fix one vs one league test compatibility bug

* test(nyz): add selfplay and league demo into unittest and algotest

* style(nyz): correct format

* hotfix(nyz): fix ppo continuous comatibility bug

73295c22

01 8月, 2021 2 次提交

add ACER algorithm(szj) (#14) · dd4de1a0

由 simonat2011 提交于 8月 01, 2021

* add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala config.

* modified as reviewer mentions

* add qacd network

* fix bugs

* fix bugs

* update acer algorithm

* update ACER code

* update acer config

* fix bug

* update pong acer's config

* edit commit

* update code as mention

* fix the comment table and trust region

* fix format

* fix typing lint

* fix format,flake8

* fix format

* fix whitespace problem

* test(nyz): add acer unittest and algotest

* style(nyz): correct flake8 style
Co-authored-by: Nshenziju <simonshen2011@foxmail.com>
Co-authored-by: NSwain <niuyazhe314@outlook.com>

dd4de1a0

add pybullet env (#16) · dc161ea5

由 Yinmin.Zhang 提交于 8月 01, 2021

* add pybullet envs.

* add td3/ddpg/sac/ppo configs for pybullet.

* update td3/ddpg/sac/ppo configs for pybullet.

* update td3 configs; remove td3 model.

dc161ea5

29 7月, 2021 4 次提交
- N
  
  hotfix(nyz): fix qacd model style · 097ee4db
  由 niuyazhe 提交于 7月 29, 2021
  
  097ee4db
- S
  add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala config. (#11) · 6e8a746d
  由 simonat2011 提交于 7月 29, 2021
```
* add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala config.

* modified as reviewer mentions

* add qacd network

* fix bugs

* update dizoo readme

* add README.md about max reward result

* update dqn config and update README
Co-authored-by: Nshenziju <simonshen2011@foxmail.com>
Co-authored-by: Nsimon shen <simon@simondeMacBook-Air.local>
Co-authored-by: NSwain <niuyazhe314@outlook.com>
```
  6e8a746d
- N
  
  polish(nyz): polish cartpole ppo demo and related unittest · 4e833da2
  由 niuyazhe 提交于 7月 29, 2021
  
  4e833da2
- S
  Merge pull request #9 from YinminZhang/dev-on-policy · 3243c92d
  由 Swain 提交于 7月 29, 2021
```
on policy ppo (#9)
```
  3243c92d
23 7月, 2021 1 次提交
- Z
  Merge branch 'main' of https://github.com/opendilab/DI-engine into dev-on-policy · c96194f8
  由 zhangyinmin 提交于 7月 23, 2021
```
# Conflicts:
#	ding/policy/common_utils.py
#	ding/policy/ppo.py
```
  c96194f8

OpenDILab开源决策智能平台 / DI-engine 上一次同步 接近 3 年

OpenDILab开源决策智能平台 / DI-engine
上一次同步接近 3 年