name | about | labels |
---|---|---|
Bug Report | Use this template for reporting a bug | kind/bug |
resnet50图像学习在910A上graph模式训练精度达不到官网的0.74
Ascend
/GPU
/CPU
) / 硬件环境:Please delete the backend not involved / 请删除不涉及的后端:
/device ascend
Software Environment / 软件环境 (Mandatory / 必填):
-- MindSpore version (e.g., 1.7.0.Bxxx) :
-- Python version (e.g., Python 3.7.5) :
-- OS platform and distribution (e.g., Linux Ubuntu 16.04):
-- GCC/Compiler version (if compiled from source):
Excute Mode / 执行模式 (Mandatory / 必填)(PyNative
/Graph
):
Please delete the mode not involved / 请删除不涉及的模式:
/mode graph
用例路径:solution_test/cases/03subject_test/06document/02network_cases
关联用例:test_ms_tutorial_cv_resnet50_image_classification_0001.py
网络训练成功,精度达到0.74
Epoch: [ 1/ 5], Steps: [ 1/196], Train Loss: [2.312]
Epoch: [ 1/ 5], Steps: [101/196], Train Loss: [1.825]
Epoch: [ 1/ 5], Steps: [196/196], Train Loss: [1.425]
--------------------------------------------------
Epoch: [ 1/ 5], Average Train Loss: [1.873], Accuracy: [0.472]
--------------------------------------------------
Epoch: [ 2/ 5], Steps: [ 1/196], Train Loss: [1.543]
Epoch: [ 2/ 5], Steps: [101/196], Train Loss: [1.393]
Epoch: [ 2/ 5], Steps: [196/196], Train Loss: [1.366]
--------------------------------------------------
Epoch: [ 2/ 5], Average Train Loss: [1.348], Accuracy: [0.564]
--------------------------------------------------
Epoch: [ 3/ 5], Steps: [ 1/196], Train Loss: [1.150]
Epoch: [ 3/ 5], Steps: [101/196], Train Loss: [1.208]
Epoch: [ 3/ 5], Steps: [196/196], Train Loss: [1.011]
--------------------------------------------------
Epoch: [ 3/ 5], Average Train Loss: [1.175], Accuracy: [0.600]
--------------------------------------------------
Epoch: [ 4/ 5], Steps: [ 1/196], Train Loss: [1.045]
Epoch: [ 4/ 5], Steps: [101/196], Train Loss: [1.151]
Epoch: [ 4/ 5], Steps: [196/196], Train Loss: [0.923]
--------------------------------------------------
Epoch: [ 4/ 5], Average Train Loss: [1.100], Accuracy: [0.617]
--------------------------------------------------
Epoch: [ 5/ 5], Steps: [ 1/196], Train Loss: [0.969]
Epoch: [ 5/ 5], Steps: [101/196], Train Loss: [0.941]
Epoch: [ 5/ 5], Steps: [196/196], Train Loss: [1.063]
--------------------------------------------------
Epoch: [ 5/ 5], Average Train Loss: [1.069], Accuracy: [0.623]
走给吕昱峰
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。
感谢您的提问,您可以评论//mindspore-assistant更快获取帮助:
训练到11个epoch才达到官网的指标
CCB结论:
遗留原因:910A从VM切换到GE后架构发生变动
风险:外部用户使用本教程时设为GRAPH_MODE时会碰到精度收敛慢的问题
规避措施:本版本遗留,630版本继续解决
二分法定位到commitID为0140d5d8eea76c016454fac1d779492dd84ca818导致出现的问题,相关PRhttps://gitee.com/mindspore/mindspore/pulls/67037
登录 后才可以发表评论