-
由 Ke Li 提交于
* add_pdqn_model * modify_model_structure * initial_version_PDQN * bug_free_PDQN_no_test_convergence * update_pdqn_config * add_noise_to_continuous_args * polish(nyz): polish code style and add noise in pdqn * seperate_dis_and_cont_model * fix_bug_for_separation * fix(pu): current q value use the data action, fix cont loss detach bug, 1 encoder, dist and cont learning rate * polish(pu): actor delay update * fix(pu): fix disc cont update frequency * polish(pu): polish pdqn config * polish(lk): add comments and typelint for pdqn and dqn * feature(lk): add test file for pdqn model and policy * polish(lk): code style * polish(lk): rm the modify of unrelated files * polish(lk): rm useless commentes code in pdqn Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com> Co-authored-by: Npuyuan1996 <2402552459@qq.com>
39a7cfe3