1. 14 12月, 2021 1 次提交
    • W
      polish(nyp): fix unittest for trex training and collecting (#144) · f089d02a
      Will-Nie 提交于
      * add trex algorithm for pong
      
      * sort style
      
      * add atari, ll,cp; fix device, collision; add_ppo
      
      * add accuracy evaluation
      
      * correct style
      
      * add seed to make sure results are replicable
      
      * remove useless part in cum return  of model part
      
      * add mujoco onppo training pipeline; ppo config
      
      * improve style
      
      * add sac training config for mujoco
      
      * add log, add save data; polish config
      
      * logger; hyperparameter;walker
      
      * correct style
      
      * modify else condition
      
      * change rnd to trex
      
      * revise according to comments, add eposode collect
      
      * new collect mode for trex, fix all bugs, commnets
      
      * final change
      
      * polish after the final comment
      
      * add readme/test
      
      * add test for serial entry of trex/gcl
      
      * sort style
      
      * change mujoco to cartpole for test for trex_onppo
      
      * remove files generated by testing
      
      * revise tests for entry
      
      * sort style
      
      * revise tests
      
      * modify pytest
      
      * fix(nyz): speed up ppg/ppo and marl algo unittest
      
      * polish(nyz): speed up trex unittest and fix trex entry default config bug
      
      * fix(nyz): fix same name bug
      
      * fix(nyz): fix remove conflict bug(ci skip)
      Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com>
      f089d02a
  2. 08 12月, 2021 2 次提交
    • N
      fix(nyz): fix trex unittest bugs · 234de26b
      niuyazhe 提交于
      234de26b
    • W
      feature(nyp): add Trex algorithm (#119) · 63105fef
      Will-Nie 提交于
      * add trex algorithm for pong
      
      * sort style
      
      * add atari, ll,cp; fix device, collision; add_ppo
      
      * add accuracy evaluation
      
      * correct style
      
      * add seed to make sure results are replicable
      
      * remove useless part in cum return  of model part
      
      * add mujoco onppo training pipeline; ppo config
      
      * improve style
      
      * add sac training config for mujoco
      
      * add log, add save data; polish config
      
      * logger; hyperparameter;walker
      
      * correct style
      
      * modify else condition
      
      * change rnd to trex
      
      * revise according to comments, add eposode collect
      
      * new collect mode for trex, fix all bugs, commnets
      
      * final change
      
      * polish after the final comment
      
      * add readme/test
      
      * add test for serial entry of trex/gcl
      
      * sort style
      63105fef