Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
OpenDILab开源决策智能平台
DI-engine
提交
12bc041d
D
DI-engine
项目概览
OpenDILab开源决策智能平台
/
DI-engine
上一次同步 2 年多
通知
56
Star
321
Fork
0
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
0
列表
看板
标记
里程碑
合并请求
0
DevOps
流水线
流水线任务
计划
Wiki
1
Wiki
分析
仓库
DevOps
项目成员
Pages
D
DI-engine
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
0
Issue
0
列表
看板
标记
里程碑
合并请求
0
合并请求
0
Pages
DevOps
DevOps
流水线
流水线任务
计划
分析
分析
仓库分析
DevOps
Wiki
1
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
流水线任务
提交
Issue看板
前往新版Gitcode,体验更适合开发者的 AI 搜索 >>
提交
12bc041d
编写于
11月 15, 2021
作者:
N
niuyazhe
1
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
style(nyz): add mbrl badge and env doc link
上级
3a91c429
变更
1
隐藏空白更改
内联
并排
Showing
1 changed file
with
6 addition
and
4 deletion
+6
-4
README.md
README.md
+6
-4
未找到文件。
README.md
浏览文件 @
12bc041d
...
...
@@ -126,7 +126,7 @@ ding -m serial -e cartpole -p dqn -s 0
| 26 |
[
RND
](
https://arxiv.org/abs/1810.12894
)
| !
[
exp
](
https://img.shields.io/badge/-exploration-orange
)
|
[
reward_model/rnd
](
https://github.com/opendilab/DI-engine/blob/main/ding/reward_model/rnd_reward_model.py
)
| python3 -u cartpole_ppo_rnd_main.py |
| 27 |
[
CQL
](
https://arxiv.org/pdf/2006.04779.pdf
)
| !
[
offline
](
https://img.shields.io/badge/-offlineRL-darkblue
)
|
[
policy/cql
](
https://github.com/opendilab/DI-engine/blob/main/ding/policy/cql.py
)
| python3 -u d4rl_cql_main.py |
| 28 |
[
TD3BC
](
https://arxiv.org/pdf/2106.06860.pdf
)
| !
[
offline
](
https://img.shields.io/badge/-offlineRL-darkblue
)
|
[
policy/td3_bc
](
https://github.com/opendilab/DI-engine/blob/main/ding/policy/td3_bc.py
)
| python3 -u mujoco_td3_bc_main.py |
| 29 |
[
MBPO
](
https://arxiv.org/pdf/1906.08253.pdf
)
| !
[
continuous
](
https://img.shields.io/badge/-continous-green
)
|
[
model/template/model_based/mbpo
](
https://github.com/opendilab/DI-engine/blob/main/ding/model/template/model_based/mbpo.py
)
| python3 -u sac_halfcheetah_mopo_default_config.py |
| 29 |
[
MBPO
](
https://arxiv.org/pdf/1906.08253.pdf
)
| !
[
mbrl
](
https://img.shields.io/badge/-ModelBasedRL-lightblue
)
|
[
model/template/model_based/mbpo
](
https://github.com/opendilab/DI-engine/blob/main/ding/model/template/model_based/mbpo.py
)
| python3 -u sac_halfcheetah_mopo_default_config.py |
| 30 |
[
PER
](
https://arxiv.org/pdf/1511.05952.pdf
)
| !
[
other
](
https://img.shields.io/badge/-other-lightgrey
)
|
[
worker/replay_buffer
](
https://github.com/opendilab/DI-engine/blob/main/ding/worker/replay_buffer/advanced_buffer.py
)
|
`rainbow demo`
|
| 31 |
[
GAE
](
https://arxiv.org/pdf/1506.02438.pdf
)
| !
[
other
](
https://img.shields.io/badge/-other-lightgrey
)
|
[
rl_utils/gae
](
https://github.com/opendilab/DI-engine/blob/main/ding/rl_utils/gae.py
)
|
`ppo demo`
|
...
...
@@ -146,19 +146,21 @@ ding -m serial -e cartpole -p dqn -s 0
![
offline
](
https://img.shields.io/badge/-offlineRL-darkblue
)
means offline RL algorithm
![
mbrl
](
https://img.shields.io/badge/-ModelBasedRL-lightblue
)
means model-based RL algorithm
![
other
](
https://img.shields.io/badge/-other-lightgrey
)
means other sub-direction algorithm, usually as plugin-in in the whole pipeline
P.S: The
`.py`
file in
`Runnable Demo`
can be found in
`dizoo`
### Environment Versatility
| No | Environment | Label | Visualization |
dizoo link
|
| No | Environment | Label | Visualization |
Code and Doc Links
|
| :--: | :--------------------------------------: | :---------------------------------: | :--------------------------------:|:---------------------------------------------------------: |
| 1 |
[
atari
](
https://github.com/openai/gym/tree/master/gym/envs/atari
)
| !
[
discrete
](
https://img.shields.io/badge/-discrete-brightgreen
)
| !
[
original
](
./dizoo/atari/atari.gif
)
|
[
dizoo link
](
https://github.com/opendilab/DI-engine/tree/main/dizoo/atari/envs
)
|
| 1 |
[
atari
](
https://github.com/openai/gym/tree/master/gym/envs/atari
)
| !
[
discrete
](
https://img.shields.io/badge/-discrete-brightgreen
)
| !
[
original
](
./dizoo/atari/atari.gif
)
|
[
code link
](
https://github.com/opendilab/DI-engine/tree/main/dizoo/atari/envs
)
[
env tutorial
]
(https://di-engine-docs.readthedocs.io/en/latest/env_tutorial/atari.html)
[
环境指南
](
https://di-engine-docs.readthedocs.io/en/main-zh/env_tutorial/atari_zh.html
)
|
| 2 |
[
box2d/bipedalwalker
](
https://github.com/openai/gym/tree/master/gym/envs/box2d
)
| !
[
continuous
](
https://img.shields.io/badge/-continous-green
)
| !
[
original
](
./dizoo/box2d/bipedalwalker/original.gif
)
|
[
dizoo link
](
https://github.com/opendilab/DI-engine/tree/main/dizoo/box2d/bipedalwalker/envs
)
|
| 3 |
[
box2d/lunarlander
](
https://github.com/openai/gym/tree/master/gym/envs/box2d
)
| !
[
discrete
](
https://img.shields.io/badge/-discrete-brightgreen
)
| !
[
original
](
./dizoo/box2d/lunarlander/lunarlander.gif
)
|
[
dizoo link
](
https://github.com/opendilab/DI-engine/tree/main/dizoo/box2d/lunarlander/envs
)
|
| 4 |
[
classic_control/cartpole
](
https://github.com/openai/gym/tree/master/gym/envs/classic_control
)
| !
[
discrete
](
https://img.shields.io/badge/-discrete-brightgreen
)
| !
[
original
](
./dizoo/classic_control/cartpole/cartpole.gif
)
|
[
dizoo link
](
https://github.com/opendilab/DI-engine/tree/main/dizoo/classic_control/cartpole/envs
)
|
| 5 |
[
classic_control/pendulum
](
https://github.com/openai/gym/tree/master/gym/envs/classic_control
)
| !
[
discrete
](
https://img.shields.io/badge/-discrete-brightgreen
)
| !
[
original
](
./dizoo/classic_control/pendulum/pendulum.gif
)
|
[
dizoo link
](
https://github.com/opendilab/DI-engine/tree/main/dizoo/classic_control/pendulum/envs
)
|
| 5 |
[
classic_control/pendulum
](
https://github.com/openai/gym/tree/master/gym/envs/classic_control
)
| !
[
continuous
](
https://img.shields.io/badge/-continous-green
)
| !
[
original
](
./dizoo/classic_control/pendulum/pendulum.gif
)
|
[
dizoo link
](
https://github.com/opendilab/DI-engine/tree/main/dizoo/classic_control/pendulum/envs
)
|
| 6 |
[
competitive_rl
](
https://github.com/cuhkrlcourse/competitive-rl
)
| !
[
discrete
](
https://img.shields.io/badge/-discrete-brightgreen
)
![
selfplay
](
https://img.shields.io/badge/-selfplay-blue
)
| !
[
original
](
./dizoo/competitive_rl/competitive_rl.gif
)
|
[
dizoo link
](
https://github.com/opendilab/DI-engine/tree/main/dizoo.classic_control
)
|
| 7 |
[
gfootball
](
https://github.com/google-research/football
)
| !
[
discrete
](
https://img.shields.io/badge/-discrete-brightgreen
)![
sparse
](
https://img.shields.io/badge/-sparse%20reward-orange
)![
selfplay
](
https://img.shields.io/badge/-selfplay-blue
)
| !
[
original
](
./dizoo/gfootball/gfootball.gif
)
|
[
dizoo link
](
https://github.com/opendilab/DI-engine/tree/main/dizoo.gfootball/envs
)
|
| 8 |
[
minigrid
](
https://github.com/maximecb/gym-minigrid
)
| !
[
discrete
](
https://img.shields.io/badge/-discrete-brightgreen
)![
sparse
](
https://img.shields.io/badge/-sparse%20reward-orange
)
| !
[
original
](
./dizoo/minigrid/minigrid.gif
)
|
[
dizoo link
](
https://github.com/opendilab/DI-engine/tree/main/dizoo/minigrid/envs
)
|
...
...
OpenDILab开源决策智能平台
@m0_55289267
mentioned in commit
daba0e7f
·
11月 16, 2021
mentioned in commit
daba0e7f
mentioned in commit daba0e7faa1736f1eb374453e454c93d0dc85286
开关提交列表
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录