• K
    feature(lk): add PDQN algorithm for hybrid action spaces (#118) · 39a7cfe3
    Ke Li 提交于
    * add_pdqn_model
    
    * modify_model_structure
    
    * initial_version_PDQN
    
    * bug_free_PDQN_no_test_convergence
    
    * update_pdqn_config
    
    * add_noise_to_continuous_args
    
    * polish(nyz): polish code style and add noise in pdqn
    
    * seperate_dis_and_cont_model
    
    * fix_bug_for_separation
    
    * fix(pu): current q value use the data action, fix cont loss detach bug, 1 encoder, dist and cont learning rate
    
    * polish(pu): actor delay update
    
    * fix(pu): fix disc cont update frequency
    
    * polish(pu): polish pdqn config
    
    * polish(lk): add comments and typelint for pdqn and dqn
    
    * feature(lk): add test file for pdqn model and policy
    
    * polish(lk): code style
    
    * polish(lk): rm the modify of unrelated files
    
    * polish(lk): rm useless commentes code in pdqn
    Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com>
    Co-authored-by: Npuyuan1996 <2402552459@qq.com>
    39a7cfe3
model_wrappers.py 24.5 KB