Skip to content

Issues: hiyouga/LLaMA-Factory

🚨FAQs | 常见问题🚨
#4614 opened Jun 28, 2024 by hiyouga
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

与LLaVA官方代码训练结果性能相差较大 pending This problem is yet to be addressed
#5890 opened Nov 1, 2024 by zhipeixu
1 task done
二次预训练阶段全参微调,损失曲线是否正常,如何优化 pending This problem is yet to be addressed
#5888 opened Nov 1, 2024 by Shame-fight
1 task done
How to mask out specific chunks for loss calculation pending This problem is yet to be addressed
#5886 opened Oct 31, 2024 by Hanzhang-lang
1 task done
RuntimeError: a Tensor with 4 elements cannot be converted to Scalar pending This problem is yet to be addressed
#5885 opened Oct 31, 2024 by shedding-ash
1 task done
求助! pending This problem is yet to be addressed
#5884 opened Oct 31, 2024 by lucky0223
训练qwen2-vl-7b-instruct pending This problem is yet to be addressed
#5882 opened Oct 31, 2024 by lxb0425
1 task done
如何离线eval自己的数据集? pending This problem is yet to be addressed
#5881 opened Oct 31, 2024 by GasolSun36
1 task done
LLaMA-3.1-8B, Zero3/FSDP and liger_kernel with embed_tokens/lm)head: pending This problem is yet to be addressed
#5879 opened Oct 30, 2024 by thusinh1969
1 task done
显存充足,无法调用,显示只使用一点显存 pending This problem is yet to be addressed
#5878 opened Oct 30, 2024 by Lgugeng
1 task done
微调qwen2.5 3B模型报“UnicodeDecodeError”错误,请作者帮忙看看,谢谢! pending This problem is yet to be addressed
#5875 opened Oct 30, 2024 by yangdy11111
1 task done
对qwen2.5-14B增量预训练后推理时,部分重复一段话 pending This problem is yet to be addressed
#5872 opened Oct 30, 2024 by Ayanami07
1 task done
视频使用mkv文件报错 pending This problem is yet to be addressed
#5870 opened Oct 30, 2024 by HelloWorld506
1 task done
请问一下 什么时候支持openbmb/MiniCPM-V-2_6 这个多模态的微调 谢谢 pending This problem is yet to be addressed
#5869 opened Oct 30, 2024 by ML-GCN
1 task done
template formatter能否支持一定程度上的逻辑判断? pending This problem is yet to be addressed
#5868 opened Oct 30, 2024 by Ricardo-L-C
1 task done
Question regarding Function Calling in ShareGPT format pending This problem is yet to be addressed
#5866 opened Oct 30, 2024 by emrecanacikgoz
1 task done
多机多卡SFT微调运行报错
#5864 opened Oct 30, 2024 by rocket2q19
1 task done
When exporting, drop unused parameters instead of erroring pending This problem is yet to be addressed
#5853 opened Oct 29, 2024 by inflatebot
1 task done
How to continue training LoRA made without llama factory? pending This problem is yet to be addressed
#5848 opened Oct 28, 2024 by Sehyo
1 task done
Support ferretui model pending This problem is yet to be addressed
#5847 opened Oct 28, 2024 by dushwe
model.generate与llamafactory-cli train do_predict给出的结果不一致 pending This problem is yet to be addressed
#5845 opened Oct 28, 2024 by mzc2113391
1 task done
ProTip! What’s not been updated in a month: updated:<2024-09-30.