1. 24 12月, 2021 1 次提交
    • S
      feature(nyz): add H-PPO hybrid action space algorithm (#140) · 0b71fc4e
      Swain 提交于
      * feature(nyz): add hybrid ppo, unify action_space field and use dict type mu sigma
      
      * polish(nyz): polish ppo config continous field, move to action_space field
      
      * fix(nyz): fix ppo action_space field compatibility bug
      
      * fix(nyz): fix ppg/sac/cql action_space field compatibility bug
      
      * demo(nyz): update gym hybrid hppo config
      
      * polish(pu): polish hppo hyper-para, use tanh and fixed sigma 0.3 in actor_action_args, use clamp [0,1] and [-1,1] for acceleration_value and rotation_value correspondingly after sample from the pi distri. in collect phase
      
      * polish(pu):polish as review
      
      * polish(pu): polish hppo config
      
      * polish(pu): entropy weight=0.03 performs best empirically
      
      * fix(nyz): fix unittest compatibility bugs
      
      * polish(nyz): remove atari env unused print(ci skip)
      Co-authored-by: Npuyuan1996 <2402552459@qq.com>
      0b71fc4e
  2. 25 11月, 2021 1 次提交
    • T
      feature(zt): add curiosity icm algorithm (#41) · b50e8aea
      timothijoe 提交于
      * curisity_icm_v1
      
      * modified version1
      
      * modified v2
      
      * one_hot function change
      
      * add paper information
      
      * format minigrid ppo curiosity
      
      * flake8 ding checked
      
      * 6th-Oct-gpu-modified
      
      * reset configs in minigrid files
      
      * minigird-env-doorkey88-100-300
      
      * use modulelist instead of list in icm module
      
      * change icm reward model
      
      * delete origin curiosit_reward model and add icm_reward model
      
      * modified icm reward model
      
      * polish icm model by zt, (1) polish ding/reward_model/icm_reward_model.py and related __init__.py (2) add config files for pong:dizoo/atari/config/serial/pong/pong_ppo_offpolicy_icm.py and minigrid env: dizoo/minigrid/config/doorkey8_icm_config.py,fourroom_icm_config.py,minigrid_icm_config.py  (3) add element icm in README
      
      * remove some useless config files in minigrid
      
      * remove redundant part in ppo.py, add cartpole_ppo_icm_config.py, changed test_icm.py and Readme
      b50e8aea
  3. 31 10月, 2021 1 次提交
  4. 29 7月, 2021 1 次提交
  5. 16 7月, 2021 1 次提交
  6. 08 7月, 2021 1 次提交