OpenI/PARL: PARL 是一个高性能、灵活的强化学习框架 - PARL - OpenI

关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

Happy 83a6dcaa3e dev 2.1.1 (#1009 ) * dev 2.1.1 * unittest		1 year ago
..
README.md	dev 2.1.1 (#1009)	1 year ago

actor.py	paddle version of impala (#993)	1 year ago

atari_agent.py	paddle version of impala (#993)	1 year ago

atari_model.py	paddle version of impala (#993)	1 year ago

impala_config.py	paddle version of impala (#993)	1 year ago

requirements.txt	dev 2.1.1 (#1009)	1 year ago

train.py	dev 2.1.1 (#1009)	1 year ago

README.md

Reproduce IMPALA with PARL
- Atari games introduction
- Benchmark result
How to use

Reproduce IMPALA with PARL

Based on PARL, the IMPALA algorithm of deep reinforcement learning is reproduced, and the same level of indicators of the paper is reproduced in the classic Atari game.

Paper: IMPALA in Impala: Scalable distributed deep-rl with importance weighted actor-learner architectures

Atari games introduction

Please see here to know more about Atari games.

Benchmark result

Learning curve with one learner (in a P40 GPU) and 32 actors (in 32 CPUs).

PongNoFrameskip-v4: mean_episode_rewards can reach 18-19 score in about 10 minutes.
Learning curves (mean_episode_rewards) of other games in an hour.

How to use

Dependencies

paddlepaddle>=2.0.0
parl>=2.1.1
gym==0.12.1
atari-py==0.1.7
opencv-python

Distributed Training:

At first, We can start a local cluster with 32 CPUs:

xparl start --port 8010 --cpu_num 32

Note that it is not necessary to run the command each time before training.
We can reuse the xparl cluster for distributed training if we have started it before.
documentation

Then we can start the distributed training by running:

python train.py

Reference

PARL 是一个高性能、灵活的强化学习框架

https://parl.readthedocs.io

ai开发工具

Python C++ JavaScript Shell Markdown other

2466956298@qq.com zenghongsheng@baidu.com likejiao@baidu.com 39279048+Banmahhhh@users.noreply.github.com lsb19@tsinghua.org.cn 68997378+swag1ong@users.noreply.github.com zhoubo01@baidu.com 76139596+ShuaibinLi@users.noreply.github.com 52879090+YuechengLiu@users.noreply.github.com wangzelong0663@gmail.com royxroy@163.com zenghsh3@gmail.com tan_ze@outlook.com 52879090+liuyuecheng-github@users.noreply.github.com 915647399@qq.com haonanyu@baidu.com cclauss@me.com yu239@users.noreply.github.com tangzhiyi11@users.noreply.github.com 50344320+ZiyuanMa@users.noreply.github.com 115619013+Aidilele@users.noreply.github.com 49400846+Jiukaishi@users.noreply.github.com 58016616+ljy2222@users.noreply.github.com bestwanglei@gmail.com skylian@users.noreply.github.com

How to access data resources in code