1. 23 9月, 2021 2 次提交
  2. 22 9月, 2021 3 次提交
  3. 17 9月, 2021 4 次提交
    • D
      feature(davide): add BSuite environment wrapper (#58) · 8050a9bb
      Davide Liu 提交于
      * start implementing bsuite env
      
      * add bsuite env
      
      * Implemented
      
      * removed unused file
      
      * added cartpole_swing environment
      
      * Update test_bsuite_env.py
      
      * added env in readme and in setup.py
      
      * Create bsuite.png
      8050a9bb
    • R
      feature(crb): update multi discrete policy(dqn, ppo, rainbow) (#51) · 332995e8
      Robin Chen 提交于
      * update md_dqn
      
      * update offpolicy ppo
      
      * add rainbow md policy
      
      * format code
      
      * del ppo; leave to future updates
      
      * add doc string; fix rainbow returns
      332995e8
    • fix(pu): fix r2d2 done slice bug and LSTM hidden state reset bug (#52) · 2ffff07e
      蒲源 提交于
      * test rnd
      
      * fix mz config
      
      * fix config
      
      * fix(pu): fix r2d2
      
      * fix(puyuan): fix r2d2
      
      * feature(puyuan): add minigrid r2d2 config
      
      * polish minigrid config
      
      * modified as review
      
      * fix(pu): fix bugffor compatibility
      
      * polish(pu): add annotations and polish slice operation
      
      * style(pu): run format.sh
      
      * style(pu): correct yapf format
      
      * fix(pu): fix config
      
      * fix(pu): fix done slice bug and lstm reset bug
      
      * style(pu): format config
      
      * polish(pu): polish config params for cartpole, lunarlander and minigrid
      
      * polish(pu): polish minigrid config params
      
      * Update r2d2.py
      
      * polish(pu): polish rnn reset problem
      
      * fix(pu): fix merge error
      
      * polish(pu): polish cartpole config
      
      * polish(nyz): polish cartpole r2d2 config for faster convergence
      
      * test(nyz): enable r2d2 algotest
      Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com>
      2ffff07e
    • N
      style(nyz): add d4rl env link and fix cql demo cmd · 3e5d6a6c
      niuyazhe 提交于
      3e5d6a6c
  4. 14 9月, 2021 1 次提交
  5. 13 9月, 2021 4 次提交
  6. 11 9月, 2021 2 次提交
  7. 09 9月, 2021 2 次提交
    • K
      feature(lqp): add k8s launcher and di-orchestrator launcher (#45) · 54eb8aa2
      Konnase Lee 提交于
      * feat: add k8s launcher
      
      * feat: install kubectl when install k3d
      
      * feat: add orchestrator launcher and a test case
      
      * ci: install kubernetes related package and cli
      
      * style: format code
      
      * style: flake check code
      
      * test k8s launcher
      
      * ci: change back to unit test
      
      * feat: delete cert manager when delete orchestrator
      
      * style: flake8 check
      
      * feat: merge k8s-launcher with k8s-helper
      
      1. merge k8s-launcher with k8s-helper
      2. move kubernetes package import to where it will be used
      3. hack/install-k8s-tools.sh -> ding/scripts/install-k8s-tools.sh
      54eb8aa2
    • N
      feature(nyz): add bebold experiment env · e646add9
      niuyazhe 提交于
      e646add9
  8. 08 9月, 2021 4 次提交
    • S
      feature(nyz): add supervised learning image classification training demo (#27) · 11cc97e8
      Swain 提交于
      * feature(nyz): add resnet for cv sl task
      
      * feature(nyz): add imagenet classification dataset and adapt compile config for sl
      
      * feature(nyz): add naive image training entry demo
      
      * style(nyz): polish image cls train log
      
      * polish(nyz): polish multi gpu training setting
      
      * feature(nyz): add nn training bp and update async execution
      
      * feature(nyz): add distributed sampler for different dist backend
      
      * fix(nyz): fix compile config collector and buffer compatibility problem
      
      * style(nyz): correct yapf format
      
      * fix(nyz): fix env manager compile config compatibility bug
      
      * refactor(nyz): abstarct ISerialEvaluator and rename serial evaluation implementation
      
      * refactor(nyz): refactor collector name
      
      * feature(nyz): add metric evaluator and image cls acc metric eval demo
      
      * fix(nyz): fix cuda and multi gpu bug in image cls demo
      11cc97e8
    • S
      style(nyz): update sparse reward badge in env table · 5e52c1a0
      Swain 提交于
      5e52c1a0
    • N
      style(nyz): polish env table in README · 1439e22f
      niuyazhe 提交于
      1439e22f
    • W
      style(wyh): add env information in readme (#46) · fa453ef0
      Weiyuhong-1998 提交于
      * env-list
      
      * env-list-fix-grammmer
      
      * env-only-test
      
      * modify-gif
      
      * modify-gif-pendulum
      
      * modify-gif-delect-maze
      fa453ef0
  9. 07 9月, 2021 2 次提交
  10. 06 9月, 2021 5 次提交
    • Y
      feature(zym): add offlineRL algo CQL; add offlineRL env D4RL (#37) · 69828ed5
      Yinmin.Zhang 提交于
      * feature(zym): add pybullet env info; add entropy type in sac.
      
      * feature(zym): add cql; add serial entry for offlineRL.
      
      * feature/polish(zym): add generation entry in mujoco env for offlineRL; polish cql/serial entry for offlineRL.
      
      * feature(lj): add d4rl env for offlineRL.
      
      * polish(zym): polish cql.
      
      * feature/polish(zym): add dataset registry; polish offlineRL pipeline.
      
      * fix(zym): fix bug in d4rl/mujoco config; fix bug in dataset for offlineRL.
      
      * style(zym): add pybulletgym and d4rl requirements in setup.
      
      * fix/polish(zym): support str in NaiveRLDataset; polish cql.
      
      * polish(zym): polish command policy.
      
      * feature(zym): add cql in pendulum env; add unittest/algotest for cql.
      
      * fix(zym): fix cql bug in unittest/algotest for cql.
      69828ed5
    • N
      style(nyz): add algorithm list in README · 110d4063
      niuyazhe 提交于
      110d4063
    • N
      style(nyz): add algorithm list in README · 12a727cd
      niuyazhe 提交于
      12a727cd
    • W
      enable user to use any expert model for sqil(#44) · 5fbc9453
      Will-Nie 提交于
      * enable user to use any model generated here
      
      * delete irelevant package
      
      * add test
      
      * bash format.sh to reformat style
      5fbc9453
    • fix(pu): fix r2d2 bug (#36) · c8dac674
      蒲源 提交于
      * test rnd
      
      * fix mz config
      
      * fix config
      
      * fix(pu): fix r2d2
      
      * feature(puyuan): add minigrid r2d2 config
      
      * polish minigrid config
      
      * modified as review
      
      * fix(pu): fix bugffor compatibility
      
      * polish(pu): add annotations and polish slice operation
      
      * style(pu): run format.sh
      
      * style(pu): correct yapf format
      c8dac674
  11. 03 9月, 2021 3 次提交
  12. 02 9月, 2021 4 次提交
  13. 31 8月, 2021 1 次提交
  14. 27 8月, 2021 3 次提交