DI-engine v0.2.2

Env (dizoo)

  1. apple key to door treasure env (#128)
  2. bsuite memory benchmark (#138)
  3. polish atari impala config

Algorithm

  1. Guided Cost IRL algorithm (#57)
  2. ICM exploration algorithm (#41)
  3. MP-DQN hybrid action space algorithm (#131)
  4. add loss statistics and polish r2d3 pong config (#126)

Enhancement

  1. add renew env mechanism in env manager and update timeout mechanism (#127) (#134)

Fix

  1. async subprocess env manager reset bug (#137)
  2. keepdims name bug in model wrapper
  3. on-policy ppo value norm bug
  4. GAE and RND unittest bug
  5. hidden state wrapper h tensor compatibility
  6. naive buffer auto config create bug

Style

  1. add supporters list

New Repo Feature

  1. treevalue speed benchmark

Contributors: @PaParaZz1 @puyuan1996 @RobinC94 @LikeJulia @Will-Nie @Weiyuhong-1998 @timothijoe @davide97l @lichuminglcm @YinminZhang

项目简介

OpenDILab Decision AI Engine

🚀 Github 镜像仓库 🚀

源项目地址

https://github.com/opendilab/DI-engine

发行版本 5

v0.2.2

全部发行版

贡献者 28

全部贡献者

开发语言

  • Python 99.8 %
  • Shell 0.2 %
  • Makefile 0.0 %