- 16 3月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* update comments for ES * check dependence on paddle or torch * update readme * update readme#2 * users can still use parl.remote when no DL framework was found * yapf
-
- 09 3月, 2020 1 次提交
-
-
由 rical730 提交于
* add maddpg example * format with yapf * fix coding style * fix coding style * unittest without import multiagent env * update maddpg code * update maddpg readme * add copyright comments * update parl.maddpg without import gym * update NeurlIPS2018.gif to NeurlIPS2019.gif * update readme and comments
-
- 06 3月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* fix paddle version bug * add gym dependence (introduced by MADDPG) * recall
-
- 08 2月, 2020 1 次提交
-
-
由 rical730 提交于
* add maddpg example * format with yapf * fix coding style * fix coding style * unittest without import multiagent env * update maddpg code * update maddpg readme * add copyright comments
-
- 30 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add sac
-
- 13 8月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* add learning curve for ES * add learning curve for ES * support new APIs of the cluster * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * rename learner.py * Update README.md * Update README.md * Update README.cn.md * Update README.md * Update README.cn.md * Update README.md
-
- 26 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* fix the compatibility issue * fix the comment issue * support paddle 1.5.1 and replace PE with compiler * yapf©right * yapf * fix the teamcity problem * fix the teamcity problem * fix comment * only support paddle 1.5.1 * Cmake * fix comment
-
- 05 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* Update README.cn.md * Update README.md * Update README.md * Update README.cn.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md
-
- 19 4月, 2019 2 次提交
-
-
由 Hongsheng Zeng 提交于
* add A2C benchmark; add more information in PyPI homepage * filter picture in PyPI homepage
-
由 Bo Zhou 提交于
* Update README.md * Update Dockerfile * Update build.sh
-
- 18 4月, 2019 2 次提交
-
-
由 Hongsheng Zeng 提交于
* fix typo * Update README.md * Update README.md * Update README.md * soft depend on fluid; add module to monitor client status * improve performance of IMPALA example * fix bug of some client cannot exit normally * refine comment * .
-
由 Bo Zhou 提交于
* Update README.md * Create README.cn.md * Update README.cn.md * Update README.cn.md * Update README.cn.md * Update README.cn.md * Update README.cn.md * Update README.cn.md * Update README.cn.md * Update README.cn.md * Update README.cn.md * Update README.cn.md * Update README.md * Update README.md * Update README.md * Update README.cn.md
-
- 17 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md * add a3c algorithm, A2C example and rl_utils * require training in single gpu/cpu * only check cpu/gpu num in learner * refine Readme * update impala benchmark picture; update Readme * add benchmark result of A2C * move get_params/set_params in agent_base * add GA3C example * Update README.md * Update README.md * Update README.md * Update README.md * refine Readme * add benchmark * add default safe eps in numpy logp calculation * refine document; make unittest stable
-
- 15 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md * add a3c algorithm, A2C example and rl_utils * require training in single gpu/cpu * only check cpu/gpu num in learner * refine Readme * update impala benchmark picture; update Readme * add benchmark result of A2C * move get_params/set_params in agent_base * fix shell script cannot run in ubuntu * refine comment and document * Update README.md * Update README.md
-
- 13 4月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* Update remote_decorator.py * Update README.md * add an figure for the demonstration about parallelization * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * add a link to IMPALA
-
- 08 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md
-
- 11 3月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* Update README.md * Update train.py * Update README.md * Update agent_base.py * Update train.py * Update train.py * Update train.py
-
- 01 3月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* Update model_base.py * Update README.md * Update README.md
-
- 27 2月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* first version of network communication * fix code styple problems * add a script to get machine's information * code styple problems#2 * fix unit test problems * update dockfile to fix the installation issue of cmake * thread-saftey ensurance & copright * resolve comments
-
- 14 2月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* fix PPO bug; add more benchmark result * refine code * update benchmark of PPO, after fix bug * refine code
-
- 18 1月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* remove not used files, add benchmark for DQN and DDPG, add Parameters management Readme * Update README.md * Update README.md * add parl dependence in examples, use np shuffle instead of sklean * fix codestyle * refine readme of nips example * fix bug * fix code style * Update README.md * Update README.md * Update README.md * refine document and remove outdated design doc * Update README.md * Update README.md * refine comment * release version 1.0 * gif of examples * Update README.md * update Readme
-
- 15 1月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* NeurIPS2018-AI-for-Prosthetics-Challenge training code * remove model_zoo, provide download link * remove model_zoo, provide download link * add restore_from_one_head api, refine README, fix logger bug * fix test bug * fix rpm bug, refine ddpg train script * fix rpm bug, refine Readme
-
- 04 1月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add PPO example * Update Readme * Update Readme * fix codestyle * Update Readme * refine action mapping * add more unitest case * remove unnecessary params initialize, add more comments, add benchmark result * rename * remove PARL dependence in readme of examples
-
- 15 12月, 2018 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add DDPG example, fix some tiny bug * add license * unify code structure * unify code structure * refine gputils, fix seed in QuickStart * use white noise in DDPG * fix codestyle
-
- 07 12月, 2018 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add QuickStart example, refine DQN example * add examples link * refine the naming, and add quick start training result
-
- 04 12月, 2018 1 次提交
-
-
由 Bo Zhou 提交于
a more detailed example for DQN model.
-
- 29 11月, 2018 1 次提交
-
-
由 Bo Zhou 提交于
* Update README.md * Update README.md * add diagram/logo * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md
-
- 17 5月, 2018 2 次提交
- 26 4月, 2018 1 次提交
-
-
由 emailweixu 提交于
-