OpenI/PARL: PARL 是一个高性能、灵活的强化学习框架 - PARL - OpenI

关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

Bo Zhou 4970be2bc2 Update README.md (#1063 ) * Update README.md * Update README.md		1 year ago
..
.result	modify readme and image of paddle a2c (#652)	2 years ago

README.md	Update README.md (#1063)	1 year ago

a2c_config.py	[WIP] starting a single heartbeat server at client side (#1044)	1 year ago

actor.py	dev 2.1.1 (#1009)	1 year ago

atari_agent.py	import paddle by default (#619)	2 years ago

atari_model.py	import paddle by default (#619)	2 years ago

requirements.txt	modify parl version (#1001)	1 year ago

train.py	dev 2.1.1 (#1009)	1 year ago

README.md

Reproduce A2C with PARL
- Atari game introduction
- Benchmark result
How to use

Reproduce A2C with PARL

Based on PARL, the A2C algorithm of deep reinforcement learning has been reproduced, reaching the same level of indicators as the paper in Atari benchmarks.

Atari game introduction

Please see here to know more about Atari games.

Benchmark result

Performance of A2C on some envrionments in training process after 10 million sample steps.

result

How to use

Dependencies

paddlepaddle>=2.0.0
parl>=1.4.3
gym==0.12.1
atari-py==0.1.7
opencv-python

Distributed Training

At first, we can start a local cluster with 5 CPUs:

xparl start --port 8110 --cpu_num 5

Note that if you have started a master before, you don't have to run the above
command. For more information about the cluster, please refer to our
documentation

Then we can start the distributed training by running:

python train.py

Reference

PARL 是一个高性能、灵活的强化学习框架

https://parl.readthedocs.io

ai开发工具

Python C++ JavaScript Shell Markdown other

2466956298@qq.com zenghongsheng@baidu.com likejiao@baidu.com 39279048+Banmahhhh@users.noreply.github.com lsb19@tsinghua.org.cn 68997378+swag1ong@users.noreply.github.com zhoubo01@baidu.com 76139596+ShuaibinLi@users.noreply.github.com 52879090+YuechengLiu@users.noreply.github.com wangzelong0663@gmail.com royxroy@163.com zenghsh3@gmail.com tan_ze@outlook.com 52879090+liuyuecheng-github@users.noreply.github.com 915647399@qq.com haonanyu@baidu.com cclauss@me.com yu239@users.noreply.github.com tangzhiyi11@users.noreply.github.com 50344320+ZiyuanMa@users.noreply.github.com 115619013+Aidilele@users.noreply.github.com 49400846+Jiukaishi@users.noreply.github.com 58016616+ljy2222@users.noreply.github.com bestwanglei@gmail.com skylian@users.noreply.github.com

How to access data resources in code