Skip to content

Issues: Lightning-AI/litgpt

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

use initial_checkpoint_dir for continue-pretraining but can't load model correctly documentation Improvements or additions to documentation enhancement New feature or request question Further information is requested
#1729 opened Sep 18, 2024 by wodelt
Question about tie_embeddings question Further information is requested
#1727 opened Sep 14, 2024 by twaka
Cannot attend to 9904, block size is only 4096 question Further information is requested
#1717 opened Sep 11, 2024 by starjob42
Data Loading bug in pretrain on resume over multiple epochs bug Something isn't working
#1712 opened Sep 7, 2024 by fdalvi
Qwen series question Further information is requested
#1709 opened Sep 3, 2024 by Godlikemandyy
[BUG] LLaMA 3.1 RoPE question Further information is requested
#1699 opened Aug 28, 2024 by zzhhjjj
Training not working with default script bug Something isn't working
#1698 opened Aug 28, 2024 by ByteBrigand
Microsoft Phi 3.5 MoE enhancement New feature or request
#1686 opened Aug 21, 2024 by rasbt
attention mask is incorrect when generate with softcapping bug Something isn't working
#1672 opened Aug 13, 2024 by twaka
Disable KV cache option enhancement New feature or request
#1671 opened Aug 12, 2024 by rasbt
Gemma 2B weights seem to have changed bug Something isn't working
#1665 opened Aug 8, 2024 by rasbt
Tensor parallelism generates non-sensical outputs bug Something isn't working
#1663 opened Aug 8, 2024 by rasbt
Use FlexAttention enhancement New feature or request performance
#1662 opened Aug 8, 2024 by rasbt
TPU Pod Training question Further information is requested
#1643 opened Jul 30, 2024 by opooladz
access hidden layer(s) from a model question Further information is requested
#1642 opened Jul 30, 2024 by Byungsooo
Implement prompt caching to speed up inference enhancement New feature or request
#1638 opened Jul 27, 2024 by rasbt
Skip safetensors->bin file conversion enhancement New feature or request
#1625 opened Jul 24, 2024 by rasbt
Support downloading and using quantized weights (GGUF) enhancement New feature or request
#1616 opened Jul 23, 2024 by rasbt
ProTip! Follow long discussions with comments:>50.