#2435 智算网络NPU多卡训练任务支持

Closed
created 1 year ago by lewis · 5 comments
lewis commented 1 year ago
目前对于智算网络的NPU训练任务并不支持多卡训练,缺少rank_table等信息,需要根据自定义镜像中的run_train.sh改写启动命令。
lewis added the
enhancement
label 1 year ago
tanglj added this to the V20220718 milestone 1 year ago
tanglj modified the milestone from V20220718 to V20220801 1 year ago
lewis was assigned by tanglj 1 year ago
liuzx was assigned by wangj 1 year ago
wangj commented 1 year ago
Owner
@liuzx 能提供一下样例代码吗
liuzx commented 1 year ago
Collaborator
参考https://git.openi.org.cn/OpenIOSSG/MNIST_Example/src/branch/master/train_for_c2net_dataparallel.py 的代码
lewis commented 1 year ago
Owner
grampus_multi_parallel分支,可测试。
lewis added the
test
label 1 year ago
lewis commented 1 year ago
Owner
![image](/attachments/3313f8fb-8b70-45bd-8315-08887c8eb2f8) 选择这个镜像去做单卡以及多卡的测试。
wangj commented 1 year ago
Owner
已随V20220801.patch上线。
wangj closed this issue 1 year ago
Sign in to join this conversation.
No Milestone
No Assignees
3 Participants
Notifications
Due Date

No due date set.

Loading…
There is no content yet.