OpenI/PARL: PARL 是一个高性能、灵活的强化学习框架 - PARL - OpenI

关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

Bo Zhou 4970be2bc2 Update README.md (#1063 ) * Update README.md * Update README.md		1 year ago
..
README.md	Update README.md (#1063)	1 year ago

actor.py	dev 2.1.1 (#1009)	1 year ago

es.py	Paddle es (#653)	2 years ago

es_config.py	[WIP] starting a single heartbeat server at client side (#1044)	1 year ago

mujoco_agent.py	new MuJoCo version compat (#969)	1 year ago

mujoco_model.py	Paddle es (#653)	2 years ago

noise.py	Paddle es (#653)	2 years ago

obs_filter.py	Paddle es (#653)	2 years ago

optimizers.py	Paddle es (#653)	2 years ago

requirements.txt	dev 2.1.1 (#1009)	1 year ago

train.py	Parl2.1.0 (#992)	1 year ago

utils.py	clean workspace created by job (#1032)	1 year ago

README.md

Reproduce ES with PARL
- Mujoco games introduction
- Benchmark result
How to use

Reproduce ES with PARL

Based on PARL, we have implemented the Evolution Strategies (ES) algorithm and evaluate it in Mujoco environments. Its performance reaches the same level of indicators as the paper.

ES in
Evolution Strategies as a Scalable Alternative to Reinforcement Learning

Mujoco games introduction

Please see here to know more about Mujoco games.

Benchmark result

result

How to use

Dependencies

Python3.7+
paddlepaddle>=2.0.0
parl>=2.1.1
gym>=0.26.0
mujoco>=2.2.2

Distributed training

To replicate the performance reported above, we encourage you to train with 24 or 48 CPUs.
If you haven't created a cluster before, enter the following command to create a cluster. For more information about the cluster, please refer to our documentation.

xparl start --port 8837 --cpu_num 24

Then we can start the distributed training by running:

python train.py

Training result will be saved in train_log with the training curve.

Reference

PARL 是一个高性能、灵活的强化学习框架

https://parl.readthedocs.io

ai开发工具

Python C++ JavaScript Shell Markdown other

2466956298@qq.com zenghongsheng@baidu.com likejiao@baidu.com 39279048+Banmahhhh@users.noreply.github.com lsb19@tsinghua.org.cn 68997378+swag1ong@users.noreply.github.com zhoubo01@baidu.com 76139596+ShuaibinLi@users.noreply.github.com 52879090+YuechengLiu@users.noreply.github.com wangzelong0663@gmail.com royxroy@163.com zenghsh3@gmail.com tan_ze@outlook.com 52879090+liuyuecheng-github@users.noreply.github.com 915647399@qq.com haonanyu@baidu.com cclauss@me.com yu239@users.noreply.github.com tangzhiyi11@users.noreply.github.com 50344320+ZiyuanMa@users.noreply.github.com 115619013+Aidilele@users.noreply.github.com 49400846+Jiukaishi@users.noreply.github.com 58016616+ljy2222@users.noreply.github.com bestwanglei@gmail.com skylian@users.noreply.github.com

How to access data resources in code