- 15 10月, 2021 3 次提交
-
-
由 Weiyuhong-1998 提交于
* fix(wyh):test rl_utils code * fix(wyh):modify rl utils bug ppo adv batch B,A * fix(wyh):style * fix(wyh):fix bug
-
由 niuyazhe 提交于
-
由 niuyazhe 提交于
-
- 12 10月, 2021 2 次提交
- 09 10月, 2021 1 次提交
-
-
由 niuyazhe 提交于
-
- 08 10月, 2021 1 次提交
-
-
由 LuciusMos 提交于
* slime volley env in dizoo, first commit * fix bug in slime volley env * modify volley env to satisfy ding 1v1 requirements; add naive self-play and league training pipeline(evaluator is not finished, now use a very naive one) * adopt volley builtin ai as default eval opponent * polish(nyz): polish slime_volley_env and its test * feature(nyz): add slime_volley vs bot ppo demo * feature(nyz): add battle_sample_serial_collector and adapt abnormal check in subprocess env manager * feature(nyz): add slime volley self-play demo * style(nyz): add slime_volleyball env gif and split MARL and selfplay label * feature(nyz): add save replay function in slime volleyball env Co-authored-by: Nzlx-sensetime <zhaoliangxuan@sensetime.com> Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com>
-
- 02 10月, 2021 2 次提交
- 01 10月, 2021 3 次提交
- 30 9月, 2021 7 次提交
-
-
由 niuyazhe 提交于
-
由 Davide Liu 提交于
* added experience replay and n-step * implementing distributional q value * added distributional q-value * added overview in qac_dist and d4pg * derived D4PG from DDPG * fixed a bug when action shape >1 * benchmark D4PG mujoco + minor fixs -entry for DDPG mujoco -entry for D4PG mujoco -config for D4PG mujoco -fixed style D4PG code -unittests for QAC distributional * formatted code * minor updates (read description) -added d4pg seria_entry test -updated comments in QACDIST -added d4pg in commander register -added q_value in d4pg return dict -added priority update in d4pg entry -added assertion in QACDIST
-
由 Yinmin.Zhang 提交于
* feature(zym): add offlineRL algo Discrete CQL. * feature(zym): add offlineRL algo Discrete CQL; add hdf5 dataset for offlineRL.
-
由 niuyazhe 提交于
-
由 niuyazhe 提交于
-
由 niuyazhe 提交于
-
由 niuyazhe 提交于
-
- 29 9月, 2021 4 次提交
-
-
由 niuyazhe 提交于
-
由 Swain 提交于
* style(nyz): add ctools.pysc2 import in smac env * feature(nyz): add smac docker build(enable docker, smac docker) * fix(nyz): fix if condition syntax in deploy(enable docker, smac docker) * fix(nyz): fix if condition syntax in deploy(enable docker, smac docker) * fix(nyz): remove cache layer in smac docker(enable docker, smac docker) * feature(nyz): use self-hosted runner in docker smac deploy(enable docker, smac docker) * feature(nyz): build smac docker manually(enable docker, smac docker) * feature(nyz): use docker buildx as default tool in smac and add SC2Map in setup(enable docker, smac docker) * feature(nyz): add __init__.py in smac env maps(enable docker, smac docker)
-
由 Xu Jingxin 提交于
-
由 Swain 提交于
* feature(nyz): add docker_mujoco build and upgrada numpy version to 1.20.0(enable docker) * fix(nyz): fix numpy version compatibility bug and add -y option in apt-get(enable docker) * fix(nyz): add libosmesa6-dev in Dockerfile.env(enable docker) * fix(nyz): add permanent env variable about mujoco(enable docker) * fix(nyz): change sh source to .(enable docker) * fix(nyz): set env variable in bashrc(enable docker) * fix(nyz): fix pip typo(enable docker) * fix(nyz): add env in dockerfile(enable docker)
-
- 28 9月, 2021 2 次提交
-
-
由 蒲源 提交于
* add wqmix * update annotation * reformate * update annotation * update config * fix annotation * update as review * fix as review * add 5m6m MMM MMM2 config * reformate * fix(pu): fix rnn reset bug and add unittest * fix(pu): fix rnn reset bug in centrally-weighted wqmix * style(pu): yapf format and let WQMIXPolicy extend QMIXPolicy * fix(pu): fix wqmix policy extend bug * test(pu): add unittest test_wqmix * fix(pu): fix mixer key bug in particle config * feature(pu): add cooperative_navigation_wqmix_config * style(pu): yapf format * test(pu): change nn.Identity() to nn.Sequential() * fix(pu): fix unittest bug in test_wqmix
-
由 Swain 提交于
* feature(nyz): move atari_py to ale-py and polish standard docker build(enable docker) * fix(nyz): fix atari env import bug(enable docker) * feature(nyz): add autorom install in docker(enable docker) * feature(nyz): split base and env docker build(enable docker) * fix(nyz): fix docker env source image bug(enable docker)
-
- 26 9月, 2021 5 次提交
-
-
由 Robin Chen 提交于
* add md ppo * add doc string
-
由 Xu Jingxin 提交于
fix(xjx): fix the catch statments that will never succeed in test networks; fix silence method (#71)
-
由 Swain 提交于
* style(nyz): add docker deploy workflow(enable docker) * style(nyz): fix docker push info(enable docker) * style(nyz): modify org name and image default name rule(enable docker) * style(nyz): change default version to date(enable docker)
-
由 Swain 提交于
* fix(hansbug): try support spawn backend * fix(hansbug): try fix the xxxxing problem in interaction spawn support && reformat the code style * fix(nyz): disable silence decorator for spawn context interaction test Co-authored-by: HansBug <killog@126.com>
-
由 niuyazhe 提交于
-
- 24 9月, 2021 2 次提交
- 23 9月, 2021 2 次提交
- 22 9月, 2021 3 次提交
-
-
由 Weiyuhong-1998 提交于
* feature(wyh):mappo and ippo win rate and time * feature(wyh):mappo and ippo epymarl win rate and time * feature(wyh):smac epymarl commit id Co-authored-by: N卫昱宏 <SENSETIME\weiyuhong@cn0214000504l.domain.sensetime.com>
-
由 Weiyuhong-1998 提交于
* fix(wyh): plot function * fix(wyh): plot function pytest * fix(wyh):plot function modify comments * feature(wyh):plot style Co-authored-by: Nweiyuhong <weiyuhong@sensetime.com>
-
由 niuyazhe 提交于
-
- 17 9月, 2021 3 次提交
-
-
由 Davide Liu 提交于
* start implementing bsuite env * add bsuite env * Implemented * removed unused file * added cartpole_swing environment * Update test_bsuite_env.py * added env in readme and in setup.py * Create bsuite.png
-
由 Robin Chen 提交于
* update md_dqn * update offpolicy ppo * add rainbow md policy * format code * del ppo; leave to future updates * add doc string; fix rainbow returns
-
由 蒲源 提交于
* test rnd * fix mz config * fix config * fix(pu): fix r2d2 * fix(puyuan): fix r2d2 * feature(puyuan): add minigrid r2d2 config * polish minigrid config * modified as review * fix(pu): fix bugffor compatibility * polish(pu): add annotations and polish slice operation * style(pu): run format.sh * style(pu): correct yapf format * fix(pu): fix config * fix(pu): fix done slice bug and lstm reset bug * style(pu): format config * polish(pu): polish config params for cartpole, lunarlander and minigrid * polish(pu): polish minigrid config params * Update r2d2.py * polish(pu): polish rnn reset problem * fix(pu): fix merge error * polish(pu): polish cartpole config * polish(nyz): polish cartpole r2d2 config for faster convergence * test(nyz): enable r2d2 algotest Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com>
-