name | about | labels |
---|---|---|
Bug Report | Use this template for reporting a bug | kind/bug |
[ST][MS][MF][r2.3][qwen_14b_8K长序列][微调][910B3 8P]网络微调性能劣化,880 < 901
模型仓地址:https://gitee.com/mindspore/mindformers/blob/dev/research/qwen/qwen.md
Ascend
/GPU
/CPU
) / 硬件环境:Please delete the backend not involved / 请删除不涉及的后端:
/device ascend/
CANN版本:MILAN-Florence-ASL/ABL V100R001C17SPC001B240 Alpha
Mindspore版本:MindSpore_r2.3_d51c17c7(MindSporeDaily)
MindFormers版本:MindFormers_dev_a4fc9e6d(MindFormersDaily)
PyNative
/Graph
):Please delete the mode not involved / 请删除不涉及的模式:
/mode graph
用例仓地址:MindFormers_Test/cases/qwen/14b/train/
用例:
不涉及
网络训推理成功,编译时间达标,性能达标
8192 * 0.1075 = 880.64 < 901
2024-04-24 21:58:00,225 - mindformers[mindformers/core/callback/callback.py:327] - INFO - 0.4% | | 0.10743 samples/s/p 3 days, 11:42:54 }
2024-04-24 21:58:18,837 - mindformers[mindformers/core/callback/callback.py:256] - WARNING - micro_batch_interleave_num: %s > 1, multiple copies in parallel is open.
2024-04-24 21:58:18,838 - mindformers[mindformers/core/callback/callback.py:319] - INFO - { Epoch:[ 1/ 5], step:[ 126/ 6500], loss: 0.927, per_step_time: 9302ms, lr: 1e-05, overflow cond: False, loss_scale: 64.0
2024-04-24 21:58:18,838 - mindformers[mindformers/core/callback/callback.py:327] - INFO - 0.4% | | 0.10750 samples/s/p 3 days, 11:39:07 }
2024-04-24 21:58:37,455 - mindformers[mindformers/core/callback/callback.py:256] - WARNING - micro_batch_interleave_num: %s > 1, multiple copies in parallel is open.
2024-04-24 21:58:37,455 - mindformers[mindformers/core/callback/callback.py:319] - INFO - { Epoch:[ 1/ 5], step:[ 128/ 6500], loss: 0.760, per_step_time: 9304ms, lr: 1e-05, overflow cond: False, loss_scale: 64.0
2024-04-24 21:58:37,456 - mindformers[mindformers/core/callback/callback.py:327] - INFO - 0.4% | | 0.10748 samples/s/p 3 days, 11:39:55 }
2024-04-24 21:58:56,080 - mindformers[mindformers/core/callback/callback.py:256] - WARNING - micro_batch_interleave_num: %s > 1, multiple copies in parallel is open.
2024-04-24 21:58:56,081 - mindformers[mindformers/core/callback/callback.py:319] - INFO - { Epoch:[ 1/ 5], step:[ 130/ 6500], loss: 1.167, per_step_time: 9307ms, lr: 1e-05, overflow cond: False, loss_scale: 64.0
2024-04-24 21:58:56,081 - mindformers[mindformers/core/callback/callback.py:327] - INFO - 0.4% | | 0.10744 samples/s/p 3 days, 11:41:23 }
走给杨贵龙
Please assign maintainer to check this issue.
请为此issue分配处理人。
@sunjiawei999
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。
感谢您的提问,您可以评论//mindspore-assistant更快获取帮助:
已发送邮件调整基线,根据邮件中基线重新测试
调整基线需要CCB
新模型第一次验收,无需ccb,邮件记录基线变更,按照最新版本的性能数据作为基准
按最新转测邮件性能880tokens/s基线看护,问题单关闭
Sign in to comment