#1228 nvidia这边ubuntu22的gpu镜像与cuda drvier11.4不兼容,建议清理下架

Closed
created 3 months ago by xiaoxiong · 2 comments
cuda drvier11.4是安装在nvidia物理机上的驱动,最高支持的是ubuntu18和20,与22并不兼容;另外默认安装的cuda11.8高于cuda drvier11.4,也不兼容;这两个都会导致训练直接报错。且集群镜像通过容器化技术将cuda drvier11.4直接导入进来,用户也无法卸载升级该驱动来修复版本不兼容的问题(除非官方直接对物理机上的cuda drvier进行升级)。
liuzx commented 2 months ago
Collaborator
cuda版本跟ubuntu没有关系,跟底层驱动版本有关系。你的报错是什么报错?可以贴出具体的任务名,或截图看看
liuzx commented 2 months ago
Collaborator
gpu镜像已清理下架,并上新的镜像
liuzx closed this issue 2 months ago
Sign in to join this conversation.
No Milestone
No Assignees
2 Participants
Notifications
Due Date

No due date set.

Dependencies

This issue currently doesn't have any dependencies.

Loading…
There is no content yet.