DI-engine
User Guide
Installation
Quick Start
Key Concept
Introduction to RL
RL Environments Tutorial
Hands on RL
RL Environments Tutorial
Best Practice
API Doc
Config
Env
Policy
Model
Reward Model
League
Learner
Collector
Buffer
Coordinator
RL Utils
Torch Utils
Utils
Interaction
FAQ
Feature
Developer Guide
Developer Guide
Tutorial-Developer
Architecture Design
DI-engine
Docs
»
API Doc
View page source
API Doc
ΒΆ
Config
config.config
config
Env
envs.env
base_env
envs.env_wrappers
env.env_wrappers
envs.env_manager
base_env_manager
subprocess_env_manager
Policy
DQN
DQNPolicy
Rainbow
RainbowDQNPolicy
R2D2
R2D2Policy
A2C
A2CPolicy
DDPG
DDPGPolicy
QMIX
QMIXPolicy
COMA
COMAPolicy
CollaQ
CollaQPolicy
PPG
PPGPolicy
Model
Common
common.encoder
common.head
common.utils
Template
template.q_learning
template.QAC
template.VAC
template.qmix
template.coma
Reward Model
Base Model
base_reward_estimate
Pdeil
pdeil_irl_model
Pwil
pwil_irl_model
Red
red_irl_model
Gail
gail_irl_model
League
league.league
base_league
league.one_vs_one_league
one_vs_one_league
league.player
player
league.payoff
shared_payoff
Learner
worker.learner
learner_hook
base_learner
worker.learner.comm
learner_comm
Collector
worker.collector.comm
base_comm_collector
flask_fs_collector
utils
worker.collector.base_serial_collector
base_serial_collector
worker.collector.base_serial_evaluator
base_serial_evaluator
worker.collector.sample_serial_collector
sample_serial_collector
worker.collector.episode_serial_collector
episode_serial_collector
Buffer
worker.replay_buffer
replay buffer
utils
Coordinator
worker.coordinator.coordinator
TaskState
Coordinator
worker.coordinator.commander
base_serial_commander
base_parallel_commander
solo_parallel_commander
worker.coordinator.comm_coordinator
CommCoordinator
worker.coordinator.resource_manager
NaiveResourceManager
RL Utils
rl_tuils.td
Temporal Differnece
rl_utils.gae
gae
rl_utils.ppo
ppo
rl_utils.adder
adder
rl_utils.exploration
exploration
rl_utils.a2c
a2c
rl_utils.isw
isw
rl_tuils.vtrace
vtrace
rl_tuils.value_rescale
value_rescale
rl_utils.coma
coma
rl_tuils.upgo
UPGO
Torch Utils
checkpoint_helper
build_checkpoint_helper
CheckpointHelper
CountVar
auto_checkpoint
data_helper
to_device
to_dtype
to_tensor
to_ndarray
to_list
tensor_to_list
same_shape
build_log_buffer
CudaFetcher
get_tensor_data
distribution
Pd
CategoricalPd
CategoricalPdPytorch
metric
levenshtein_distance
hamming_distance
nn_test_helper
is_differentiable
optimizer_helper
Adam
RMSprop
loss.cross_entropy_loss
LabelSmoothCELoss
SoftFocalLoss
build_ce_criterion
loss.multi_logits_loss
MultiLogitsLoss
network.activation
GLU
build_activation
network.nn_module
weight_init
sequential_pack
conv1d_block
conv2d_block
deconv2d_block
fc_block
MLP
one_hot
binary_encode
noise_block
ChannelShuffle
NearestUpsample
BilinearUpsample
NoiseLinearLayer
network.normalization
build_normalization
network.res_block
ResBlock
ResFCBlock
network.rnn
LSTMForwardWrapper
LSTM
PytorchLSTM
get_lstm
network.scatter_connection
ScatterConnection
network.soft_argmax
SoftArgmax
network.transformer
Attention
TransformerLayer
Transformer
Utils
utils.time_helper
time_helper
utils.collection_helper
collection_helper
utils.compression_helper
compression_helper
utils.default_helper
default_helper
utils.design_helper
design_helper
utils.pytorch_ddp_dist_helper
pytorch_ddp_dist_helper
utils.file_helper
file_helper
utils.import_helper
import_helper
utils.k8s_helper
k8s_helper
utils.lock_helper
lock_helper
utils.log_helper
log_helper
utils.system_helper
system_helper
utils.segment_tree
SegmentTree
utils.data
data.collate_fn
data.dataloader
cache
Interaction
interaction.base
interaction.base.app
interaction.base.common
interaction.base.network
interaction.base.threading
interaction.master
interaction.master.master
interaction.master.connection
interaction.master.task
interaction.slave
interaction.slave.slave
interaction.slave.action
interaction.exception
interaction.exception.base
interaction.exception.master
interaction.exception.slave