#1204 启智集群mindspore2.2镜像NPU不可用,提供CANN较新版本的镜像

Closed
created 4 months ago by Yuxiang · 1 comments
Yuxiang commented 4 months ago
<!-- 为了更有效地识别与解决您的问题,请尽可能的补充如下信息 --> ### 问题描述 启智集群 NPU 镜像 mindspore2.2_cann7_xianxing,运行mindspore测试报异常,报错:Segmentation fault (core dumped),npu-smi info无法正确识别NPU。 ### 相关环境(GPU/NPU) 环境:启智集群 NPU 镜像:mindspore2.2_cann7_xianxing ### 相关集群(启智/智算) 启智 ### 任务类型(调试/训练/推理) 调试 ### 任务名 yuxia202312281548983 ### 日志说明或问题截图 (MindSpore) [ma-user Ascend]$python -c "import mindspore;mindspore.set_context(device_target='Ascend');mindspore.run_check()" /home/ma-user/anaconda3/envs/MindSpore/lib/python3.9/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float64'> type is zero. setattr(self, word, getattr(machar, word).flat[0]) /home/ma-user/anaconda3/envs/MindSpore/lib/python3.9/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero. setattr(self, word, getattr(machar, word).flat[0]) MindSpore version: 2.2.0 The result of multiplication calculation is correct, MindSpore has been installed on platform [Ascend] successfully! Segmentation fault (core dumped) (MindSpore) [ma-user Ascend]$npu-smi info npu-smi: symbol lookup error: npu-smi: undefined symbol: dsmi_subscribe_fault_event ### 期望的解决方案或建议 期望提供可用的mindspore2.2或以上版本镜像,或CANN 7.0.RC1.beta1或以上版本镜像 目前平台上mindspore和CANN镜像版本较低,无法调试2023年度更新的代码
Yuxiang commented 4 months ago
Poster
mindspore2.2_cann7_xianxing 2024-01-05日测试,已经能够正常使用,感谢! ``` EulerOS 2.0 (SP8), CANN-7.0.RC1 [V100R001C29],[V100R001C30],[V100R001C13],[V100R003C10],[V100R003C11] Tips: 1) Navigate to the target conda environment. For details, see /home/ma-user/README. 2) Copy (Ctrl+C) and paste (Ctrl+V) on the jupyter terminal. 3) Store your data in /home/ma-user/work, to which a persistent volume is mounted. (MindSpore) [ma-user work]$python -c "import mindspore;mindspore.set_context(device_target='Ascend');mindspore.run_check()" /home/ma-user/anaconda3/envs/MindSpore/lib/python3.9/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float64'> type is zero. setattr(self, word, getattr(machar, word).flat[0]) /home/ma-user/anaconda3/envs/MindSpore/lib/python3.9/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero. setattr(self, word, getattr(machar, word).flat[0]) MindSpore version: 2.2.0 The result of multiplication calculation is correct, MindSpore has been installed on platform [Ascend] successfully! /home/ma-user/anaconda3/envs/MindSpore/lib/python3.9/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float64'> type is zero. setattr(self, word, getattr(machar, word).flat[0]) /home/ma-user/anaconda3/envs/MindSpore/lib/python3.9/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero. setattr(self, word, getattr(machar, word).flat[0]) ```
Yuxiang closed this issue 4 months ago
Sign in to join this conversation.
No Milestone
No Assignees
1 Participants
Notifications
Due Date

No due date set.

Dependencies

This issue currently doesn't have any dependencies.

Loading…
There is no content yet.