Are you sure you want to delete this task? Once this task is deleted, it cannot be recovered.
Sharpiless 78e76ede28 | 3 years ago | |
---|---|---|
Fruit.py | 3 years ago | |
Game.py | 3 years ago | |
LICENSE | 3 years ago | |
Main.py | 3 years ago | |
README.md | 3 years ago | |
State.py | 3 years ago | |
requirements.txt | 3 years ago | |
res.zip | 3 years ago | |
resnet.py | 3 years ago | |
train_keras.py | 3 years ago | |
train_paddle.py | 3 years ago | |
train_torch.py | 3 years ago |
代码地址:https://github.com/Sharpiless/play-daxigua-using-Reinforcement-Learning
用强化学习DQN算法,训练AI模型来玩合成大西瓜游戏,提供Keras版本、PARL(paddle)版本和pytorch版本。
AI Studio:https://aistudio.baidu.com/aistudio/personalcenter/thirdview/67156
这里使用pygame重写了大西瓜游戏,并封装为适合RL环境的代码。
解压图片素材:
unzip res.zip
运行:
python Main.py
即可开始游戏:
RL算法采用DQN算法,其中Keras版本使用了简单的卷积神经网络来计算Q值,PRAL版本使用ResNet。
运行:
python train_keras.py
或者
python train_paddle.py
或者
python train_torch.py
开始训练:
感兴趣的同学关注我的公众号——可达鸭的深度学习教程:
Dear OpenI User
Thank you for your continuous support to the Openl Qizhi Community AI Collaboration Platform. In order to protect your usage rights and ensure network security, we updated the Openl Qizhi Community AI Collaboration Platform Usage Agreement in January 2024. The updated agreement specifies that users are prohibited from using intranet penetration tools. After you click "Agree and continue", you can continue to use our services. Thank you for your cooperation and understanding.
For more agreement content, please refer to the《Openl Qizhi Community AI Collaboration Platform Usage Agreement》