提交 · fa420300e2b10dd5304328c15d8eabd3d5e33454 · PaddlePaddle / PARL

16 3月, 2020 1 次提交

update comments for ES (#211) · fa420300

由 Bo Zhou 提交于 3月 16, 2020

* update comments for ES

* check dependence on paddle or torch

* update readme

* update readme#2

* users can still use parl.remote when no DL framework was found

* yapf

fa420300

09 3月, 2020 1 次提交

update parl.maddpg without import gym (#208) · 7f2abd56

由 rical730 提交于 3月 09, 2020

* add maddpg example

* format with yapf

* fix coding style

* fix coding style

* unittest without import multiagent env

* update maddpg code

* update maddpg readme

* add copyright comments

* update parl.maddpg without import gym

* update NeurlIPS2018.gif to NeurlIPS2019.gif

* update readme and comments

7f2abd56

06 3月, 2020 1 次提交
- B
  fix paddle version bug (#207) · 450a4a34
  由 Bo Zhou 提交于 3月 06, 2020
```
* fix paddle version bug

* add gym dependence (introduced by MADDPG)

* recall
```
  450a4a34
08 2月, 2020 1 次提交

add maddpg example (#200) · 9216d941

由 rical730 提交于 2月 08, 2020

* add maddpg example

* format with yapf

* fix coding style

* fix coding style

* unittest without import multiagent env

* update maddpg code

* update maddpg readme

* add copyright comments

9216d941

30 12月, 2019 1 次提交
- L
  add sac (#188) · c070db83
  由 LI Yunxiang 提交于 12月 30, 2019
```
* add sac
```
  c070db83
13 8月, 2019 1 次提交

Zhoubo01 es (#127) · 5612ecde

由 Bo Zhou 提交于 8月 13, 2019

* add learning curve for ES

* add learning curve for ES

* support new APIs of the cluster

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* rename learner.py

* Update README.md

* Update README.md

* Update README.cn.md

* Update README.md

* Update README.cn.md

* Update README.md

5612ecde

26 7月, 2019 1 次提交

replace PE with compiler(new feature in paddle151). (#99) · d33f3002

由 Bo Zhou 提交于 7月 26, 2019

* fix the compatibility issue

* fix the comment issue

* support paddle 1.5.1 and replace PE with compiler

* yapf&copyright

* yapf

* fix the teamcity problem

* fix the teamcity problem

* fix comment

* only support paddle 1.5.1

* Cmake

* fix comment

d33f3002

05 7月, 2019 1 次提交

Documents cn (#85) · 96c58265

由 Bo Zhou 提交于 7月 05, 2019

* Update README.cn.md

* Update README.md

* Update README.md

* Update README.cn.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

96c58265

19 4月, 2019 2 次提交
- H
  add A2C benchmark; add more information in PyPI homepage (#70) · 3b97394e
  由 Hongsheng Zeng 提交于 4月 19, 2019
```
* add A2C benchmark; add more information in PyPI homepage

* filter picture in PyPI homepage
```
  3b97394e
- B
  Update README.md (#68) · f12b790f
  由 Bo Zhou 提交于 4月 19, 2019
```
* Update README.md

* Update Dockerfile

* Update build.sh
```
  f12b790f
18 4月, 2019 2 次提交

Refine (#67) · 3556c786

由 Hongsheng Zeng 提交于 4月 18, 2019

* fix typo

* Update README.md

* Update README.md

* Update README.md

* soft depend on fluid; add module to monitor client status

* improve performance of IMPALA example

* fix bug of some client cannot exit normally

* refine comment

* .

3556c786

Add a Chinese documentation (#65) · 432d75b7

由 Bo Zhou 提交于 4月 18, 2019

* Update README.md

* Create README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.md

* Update README.md

* Update README.md

* Update README.cn.md

432d75b7

17 4月, 2019 1 次提交

GA3C example (#63) · 3c511e8f

由 Hongsheng Zeng 提交于 4月 17, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* add GA3C example

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* refine Readme

* add benchmark

* add default safe eps in numpy logp calculation

* refine document; make unittest stable

3c511e8f

15 4月, 2019 1 次提交

A2C example (#62) · 39846831

由 Hongsheng Zeng 提交于 4月 15, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* fix shell script cannot run in ubuntu

* refine comment and document

* Update README.md

* Update README.md

39846831

13 4月, 2019 1 次提交

add some introduction for our parallelization feature (#61) · 452050a0

由 Bo Zhou 提交于 4月 13, 2019

* Update remote_decorator.py

* Update README.md

* add an figure for the demonstration about parallelization

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* add a link to IMPALA

452050a0

08 4月, 2019 1 次提交

implement of IMPALA with the newest parallel design (#60) · b28289ac

由 Hongsheng Zeng 提交于 4月 08, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

b28289ac

11 3月, 2019 1 次提交

update documents (#58) · d8449b74

由 Bo Zhou 提交于 3月 11, 2019

* Update README.md

* Update train.py

* Update README.md

* Update agent_base.py

* Update train.py

* Update train.py

* Update train.py

d8449b74

01 3月, 2019 1 次提交
- B
  Update some docs. (#51) · 46188cd4
  由 Bo Zhou 提交于 3月 01, 2019
```
* Update model_base.py

* Update README.md

* Update README.md
```
  46188cd4
27 2月, 2019 1 次提交

first version of network communication (#49) · bbde58fb

由 Bo Zhou 提交于 2月 27, 2019

* first version of network communication

* fix code styple problems

* add a script to get machine's information

* code styple problems#2

* fix unit test problems

* update dockfile to fix the installation issue of cmake

* thread-saftey ensurance & copright

* resolve comments

bbde58fb

14 2月, 2019 1 次提交

fix PPO bug; add more benchmark result (#47) · 65ad2a4e

由 Hongsheng Zeng 提交于 2月 14, 2019

* fix PPO bug; add more benchmark result

* refine code

* update benchmark of PPO, after fix bug

* refine code

65ad2a4e

18 1月, 2019 1 次提交

Refine documents of PARL (#43) · 7a7583ab

由 Hongsheng Zeng 提交于 1月 18, 2019

* remove not used files, add benchmark for DQN and DDPG, add Parameters management Readme

* Update README.md

* Update README.md

* add parl dependence in examples, use np shuffle instead of sklean

* fix codestyle

* refine readme of nips example

* fix bug

* fix code style

* Update README.md

* Update README.md

* Update README.md

* refine document and remove outdated design doc

* Update README.md

* Update README.md

* refine comment

* release version 1.0

* gif of examples

* Update README.md

* update Readme

7a7583ab

15 1月, 2019 1 次提交

NeurIPS2018-AI-for-Prosthetics-Challenge training code (#40) · cdb50056

由 Hongsheng Zeng 提交于 1月 15, 2019

* NeurIPS2018-AI-for-Prosthetics-Challenge training code

* remove model_zoo, provide download link

* remove model_zoo, provide download link

* add restore_from_one_head api, refine README, fix logger bug

* fix test bug

* fix rpm bug, refine ddpg train script

* fix rpm bug, refine Readme

cdb50056

04 1月, 2019 1 次提交

add PPO example (#39) · f8de849b

由 Hongsheng Zeng 提交于 1月 04, 2019

* add PPO example

* Update Readme

* Update Readme

* fix codestyle

* Update Readme

* refine action mapping

* add more unitest case

* remove unnecessary params initialize, add more comments, add benchmark result

* rename

* remove PARL dependence in readme of examples

f8de849b

15 12月, 2018 1 次提交

Add DDPG example (#36) · 53c94787

由 Hongsheng Zeng 提交于 12月 15, 2018

* add DDPG example, fix some tiny bug

* add license

* unify code structure

* unify code structure

* refine gputils, fix seed in QuickStart

* use white noise in DDPG

* fix codestyle

53c94787

07 12月, 2018 1 次提交

Add QuickStart example (#35) · cdd4622a

由 Hongsheng Zeng 提交于 12月 06, 2018

* add QuickStart example, refine DQN example

* add examples link

* refine the naming, and add quick start training result

cdd4622a

04 12月, 2018 1 次提交
- B
  Update README.md (#34) · 5be4ca00
  由 Bo Zhou 提交于 12月 04, 2018
```
a more detailed example for DQN model.
```
  5be4ca00
29 11月, 2018 1 次提交

add introduction about abstractions and features in README and logo (#31) · ec005b50

由 Bo Zhou 提交于 11月 29, 2018

* Update README.md

* Update README.md

* add diagram/logo

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

ec005b50

17 5月, 2018 2 次提交
- H
  
  revisions · ad049bca
  由 haonanyu 提交于 5月 16, 2018
  
  ad049bca
- H
  
  parameter sharing in fluid with simple test cases · 1e32a717
  由 haonanyu 提交于 5月 14, 2018
  
  1e32a717
26 4月, 2018 1 次提交
- E
  
  Initial commit · f355bc64
  由 emailweixu 提交于 4月 25, 2018
  
  f355bc64