feature(nyp): add DQfD algorithm (#48)
* add_dqfd
* Is_expert to is_expert
* modify according to the last commnets
* value_gamma; done; marginloss; sqil compatibility
* finally shorten the code, revise config
* revise config, style
* add_readme/two_more_config
* correct format
Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com>
Showing
ding/entry/serial_entry_dqfd.py
0 → 100644
此差异已折叠。
ding/policy/dqfd.py
0 → 100644
想要评论请 注册 或 登录