Deleting a branch is permanent. It CANNOT be undone. Continue?
Dear OpenI User
Thank you for your continuous support to the Openl Qizhi Community AI Collaboration Platform. In order to protect your usage rights and ensure network security, we updated the Openl Qizhi Community AI Collaboration Platform Usage Agreement in January 2024. The updated agreement specifies that users are prohibited from using intranet penetration tools. After you click "Agree and continue", you can continue to use our services. Thank you for your cooperation and understanding.
For more agreement content, please refer to the《Openl Qizhi Community AI Collaboration Platform Usage Agreement》
问题描述
昇腾重庆智算,npu训练任务,云脑端任务名gaoya202306202375867,gaoya202307011494499,训练失败,这两个任务的日志,没有显示相关训练失败原因,需要定位下是什么问题
相关环境(GPU/NPU)
NPU
相关集群(启智/智算)
智算
任务类型(调试/训练/推理)
训练
任务名
任务名gaoya202306202375867,gaoya202307011494499
日志说明或问题截图
期望的解决方案或建议
定位到问题
我也遇到过这种情况,训练失败,日志里没有提示信息
最近训练也出现过训练失败,但是日志不显示失败原因,很麻烦,要一点点看代码
gaoya202307011494499这个任务是由于7月12号昇腾重庆智算整体下线,导致任务被强制停止
gaoya202307011494499这个任务失败,分中心回复是由于分中心升级导致的。
刚遇到新建调试任务后,点击调试,在lab中显示Directory not found错误