Deleting a branch is permanent. It CANNOT be undone. Continue?
Deleting a branch is permanent. It CANNOT be undone. Continue?
Dear OpenI User
Thank you for your continuous support to the Openl Qizhi Community AI Collaboration Platform. In order to protect your usage rights and ensure network security, we updated the Openl Qizhi Community AI Collaboration Platform Usage Agreement in January 2024. The updated agreement specifies that users are prohibited from using intranet penetration tools. After you click "Agree and continue", you can continue to use our services. Thank you for your cooperation and understanding.
For more agreement content, please refer to the《Openl Qizhi Community AI Collaboration Platform Usage Agreement》
问题描述
创建训练任务可以,创建调试任务则会报错
https://openi.pcl.ac.cn/dingleilei/study/cloudbrain/210676
相关环境(GPU/NPU)
GPU
相关集群(启智/智算)
启智
任务类型(调试/训练/推理)
调试
任务名
dingl202309102151918
日志说明或问题截图
[Scheduled]
Successfully assigned ed9ed5aa2938366557d3851691b38794/ba5f00f004fdc011ee08fce0c73e6100b9fd-task1-0 to t4-32
[Pulled]
Container image "dockerhub.pcl.ac.cn:5000/user-images/openi:lora-scripts" already present on machine
[Created]
Created container task1-container
[Failed]
Error: failed to start container "task1-container": Error response from daemon: error while creating mount source path '/mnt/opendata/minio/opendata/jobs/dingl2023091021t185894150/pretrainmodel/v1-5-pruned-emaonly.ckpt': mkdir /mnt/opendata/minio/opendata: no such file or directory
[Scheduled]
Successfully assigned ed9ed5aa2938366557d3851691b38794/ba5f00f004fdc011ee08fce0c73e6100b9fd-task1-0 to t4-32
[Pulled]
Container image "dockerhub.pcl.ac.cn:5000/user-images/openi:lora-scripts" already present on machine
[Created]
Created container task1-container
[Failed]
Error: failed to start container "task1-container": Error response from daemon: error while creating mount source path '/mnt/opendata/minio/opendata/jobs/dingl2023091021t185894150/code': mkdir /mnt/opendata/minio/opendata: no such file or directory
[Error]
Error on reading termination message from logs: failed to try resolving symlinks in path "/var/log/pods/ed9ed5aa2938366557d3851691b38794_ba5f00f004fdc011ee08fce0c73e6100b9fd-task1-0_428867e1-e8e6-40b9-a10a-6447fba69395/task1-container/0.log": lstat /var/log/pods/ed9ed5aa2938366557d3851691b38794_ba5f00f004fdc011ee08fce0c73e6100b9fd-task1-0_428867e1-e8e6-40b9-a10a-6447fba69395/task1-container/0.log: no such file or directory
期望的解决方案或建议
可以创建