由 Will-Nie 提交于 12月 08, 2021

* add trex algorithm for pong

* sort style

* add atari, ll,cp; fix device, collision; add_ppo

* add accuracy evaluation

* correct style

* add seed to make sure results are replicable

* remove useless part in cum return  of model part

* add mujoco onppo training pipeline; ppo config

* improve style

* add sac training config for mujoco

* add log, add save data; polish config

* logger; hyperparameter;walker

* correct style

* modify else condition

* change rnd to trex

* revise according to comments, add eposode collect

* new collect mode for trex, fix all bugs, commnets

* final change

* polish after the final comment

* add readme/test

* add test for serial entry of trex/gcl

* sort style

63105fef

test_serial_entry_guided_cost.py 1.1 KB

OpenDILab开源决策智能平台 / DI-engine 上一次同步 2 年多

Replace test_serial_entry_guided_cost.py

OpenDILab开源决策智能平台 / DI-engine
上一次同步 2 年多