1. 21 8月, 2020 1 次提交
    • R
      add torch coma (#216) · 1cbcfb15
      rical730 提交于
      * add torch coma
      
      * add Apache License comment
      
      * update readme
      
      * update readme for installing sc2 on windows
      
      * update readme
      
      * add new line at the end of shell file
      
      * update readme
      
      * update readme of coma
      
      * fix model_path
      
      * self.algorithm to self.alg
      Co-authored-by: NBo Zhou <2466956298@qq.com>
      1cbcfb15
  2. 11 6月, 2020 1 次提交
  3. 03 3月, 2020 1 次提交
  4. 27 11月, 2019 1 次提交
  5. 06 11月, 2019 1 次提交
    • L
      add pytorch a2c (#167) · 4abc0534
      LI Yunxiang 提交于
      * add pytorch a2c
      
      * add set/get_weights test & copyright
      
      * yapf....
      
      * Update model_base_test_torch.py
      
      * update
      
      * Delete banma.py
      
      * Update model_base_test_torch.py
      
      * update
      
      * Update model.py
      
      * update torch tests
      
      * Update model_base_test_torch.py
      4abc0534
  6. 25 9月, 2019 1 次提交
    • F
      torchdqn (#150) · 757cc391
      fuyw 提交于
      * git commit -m torchdqn
      
      * yapf
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * yapf
      
      * remove fstring format
      
      * torch_test yapf
      
      * yapf
      
      * Add torch in unittest.requirements
      
      * update torch_unittest
      
      * Torch and FLUID conflict problem in __init__.py
      
      * Unittest fail for torch when both torch and fluid exists.
      
      * cluster_test fail in the unittest, add timeout seconds.
      
      * Torch backend for PARL
      
      * add sleep time for unit test send_job_test.py
      
      * Unit test for send_job_test.py
      
      * use multiple try for unit test
      
      * Fix compatibility for python2.7.
      
      * fix send_job_test.py bugs
      
      * check file exist before send_job_test.py
      
      * Modify send_job_test.py
      757cc391
  7. 08 4月, 2019 1 次提交
    • H
      implement of IMPALA with the newest parallel design (#60) · b28289ac
      Hongsheng Zeng 提交于
      * add IMPALA algorithm and some common utils
      
      * update README.md
      
      * refactor files structure of impala algorithm; seperate numpy utils from utils
      
      * add hyper parameter scheduler module; add entropy and lr scheduler in impala
      
      * clip reward in atari wrapper instead of learner side; fix codestyle
      
      * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers
      
      * Update README.md
      b28289ac
  8. 26 11月, 2018 1 次提交
    • H
      sync paras in program, fix deepcopy bug, python3 compatibility (#28) · e11b40c5
      Hongsheng Zeng 提交于
      * sync paras in program, fix deepcopy bug, python3 compatibility
      
      * refactor code, add plutil directory, clean import order
      
      * remove old comment
      
      * refine comment
      
      * fix codestyle
      
      * cache sync program, add gputils module, refine model_base unittest
      
      * fix codestyle
      
      * refine sync params cache
      
      * add fetch_value module
      e11b40c5
  9. 20 11月, 2018 1 次提交
    • B
      redesign basic class in PARL (#26) · 1a1e1f03
      Bo Zhou 提交于
      * redesign basic class in PARL
      
      * code style fixed
      
      * update yaml's version
      
      * update yaml's version & update code to fix style problem
      
      * add debug message for  function
      
      * delete test code
      
      * rename function: has_fun -> has_func
      1a1e1f03
  10. 06 6月, 2018 1 次提交
    • H
      preliminary implementations of the ComputationTask, Algorithm, and Model classes (#9) · 4b4b5824
      Haonan 提交于
      * prelimary implementations of ComputationTask, Algorithm and Model classes
      
      * remove "model_func" from the args of an algorithm
      
      * a clean clone() function for Algorithm and Model
      
      * add use_next_value as a input to learn()
      
      * further re-structure
      
      * added Feedforward and RLAlgorithm classes
      
      * maxid -> argmax
      
      * discrete_distribution -> category_distribution
      
      * category -> categorical
      
      * revisions
      4b4b5824
  11. 17 5月, 2018 2 次提交