-
Notifications
You must be signed in to change notification settings - Fork 4.1k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
与LLaVA官方代码训练结果性能相差较大
pending
This problem is yet to be addressed
#5890
opened Nov 1, 2024 by
zhipeixu
1 task done
二次预训练阶段全参微调,损失曲线是否正常,如何优化
pending
This problem is yet to be addressed
#5888
opened Nov 1, 2024 by
Shame-fight
1 task done
[trainer_utils.py] Why layerwise GaLoRE optimizer does not support gradient accumulation, any underlining reasons?
pending
This problem is yet to be addressed
#5887
opened Nov 1, 2024 by
oncleJules
1 task done
How to mask out specific chunks for loss calculation
pending
This problem is yet to be addressed
#5886
opened Oct 31, 2024 by
Hanzhang-lang
1 task done
RuntimeError: a Tensor with 4 elements cannot be converted to Scalar
pending
This problem is yet to be addressed
#5885
opened Oct 31, 2024 by
shedding-ash
1 task done
使用llava论文提供的数据集进行训练报错“The number of images does not match the number of image tokens”
pending
This problem is yet to be addressed
#5883
opened Oct 31, 2024 by
wwwbq
1 task done
训练qwen2-vl-7b-instruct
pending
This problem is yet to be addressed
#5882
opened Oct 31, 2024 by
lxb0425
1 task done
如何离线eval自己的数据集?
pending
This problem is yet to be addressed
#5881
opened Oct 31, 2024 by
GasolSun36
1 task done
LLaMA-3.1-8B, Zero3/FSDP and liger_kernel with embed_tokens/lm)head:
pending
This problem is yet to be addressed
#5879
opened Oct 30, 2024 by
thusinh1969
1 task done
显存充足,无法调用,显示只使用一点显存
pending
This problem is yet to be addressed
#5878
opened Oct 30, 2024 by
Lgugeng
1 task done
openapi.json 没有上传文件相关的接口,怎么实现api推荐分析文件啊,这样多模态才能调试
pending
This problem is yet to be addressed
#5876
opened Oct 30, 2024 by
a67793581
1 task done
微调qwen2.5 3B模型报“UnicodeDecodeError”错误,请作者帮忙看看,谢谢!
pending
This problem is yet to be addressed
#5875
opened Oct 30, 2024 by
yangdy11111
1 task done
对qwen2.5-14B增量预训练后推理时,部分重复一段话
pending
This problem is yet to be addressed
#5872
opened Oct 30, 2024 by
Ayanami07
1 task done
视频使用mkv文件报错
pending
This problem is yet to be addressed
#5870
opened Oct 30, 2024 by
HelloWorld506
1 task done
请问一下 什么时候支持openbmb/MiniCPM-V-2_6 这个多模态的微调 谢谢
pending
This problem is yet to be addressed
#5869
opened Oct 30, 2024 by
ML-GCN
1 task done
template formatter能否支持一定程度上的逻辑判断?
pending
This problem is yet to be addressed
#5868
opened Oct 30, 2024 by
Ricardo-L-C
1 task done
Question regarding Function Calling in ShareGPT format
pending
This problem is yet to be addressed
#5866
opened Oct 30, 2024 by
emrecanacikgoz
1 task done
When exporting, drop unused parameters instead of erroring
pending
This problem is yet to be addressed
#5853
opened Oct 29, 2024 by
inflatebot
1 task done
Newcomer for help: If the same training corpus is used, is there a way to save the pre-tokenized data and load it directly next time?
pending
This problem is yet to be addressed
#5851
opened Oct 29, 2024 by
Wiselnn570
1 task done
在A40 96G显存上对llama-3.1-70B-instruction通过QLoRA微调成功也导出成功,想在只有CPU的服务器上运行,提示You are trying to offload the whole model to the disk. Please use the disk_offload function instead
pending
This problem is yet to be addressed
#5849
opened Oct 28, 2024 by
gannyee
1 task done
How to continue training LoRA made without llama factory?
pending
This problem is yet to be addressed
#5848
opened Oct 28, 2024 by
Sehyo
1 task done
Support ferretui model
pending
This problem is yet to be addressed
#5847
opened Oct 28, 2024 by
dushwe
model.generate与llamafactory-cli train do_predict给出的结果不一致
pending
This problem is yet to be addressed
#5845
opened Oct 28, 2024 by
mzc2113391
1 task done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-09-30.