#393 启智训练集群偶现lr为负数的情况

Closed
created 1 year ago by lixiangyi · 1 comments
lixiangyi commented 1 year ago
<!-- 为了更有效地识别与解决您的问题,请尽可能的补充如下信息 --> ### 问题描述 在单节点8卡训练及多节点8卡训练均出现过lr为负数的情况,无法训练网络 ### 相关环境(GPU/NPU) NPU ### 相关集群(启智/智算) 启智 ### 任务类型(调试/训练/推理) 训练 ### 任务名 lixia202212061510312 - v24和v23 ### 日志说明或问题截图 ![image](/attachments/69c46938-decf-4ed4-841f-8095bf7125d0) ### 期望的解决方案或建议 修复lr为负数的情况
liuzx commented 1 month ago
Collaborator
重试下看是否还有此问题
liuzx closed this issue 1 month ago
Sign in to join this conversation.
No Milestone
No Assignees
2 Participants
Notifications
Due Date

No due date set.

Dependencies

This issue currently doesn't have any dependencies.

Loading…
There is no content yet.