Skip to content

Issues: pytorch/torchtitan

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

DDP (replicate) + TP? question Further information is requested
#577 opened Sep 13, 2024 by yzs981130
Pipeline Parallelism + FSDP question Further information is requested
#562 opened Aug 29, 2024 by jeromeku
Adjust MFU to account for FP8
#560 opened Aug 23, 2024 by lessw2020
2D whole model compile fails at embedding layer bug Something isn't working
#534 opened Aug 20, 2024 by tianyu-l
train llama3 error
#502 opened Aug 5, 2024 by starstream
Only half of parameters are saved when applied PP bug Something isn't working
#474 opened Jul 22, 2024 by dmammfl
[FP8 options] Float8Linear vs TransformerEngine question Further information is requested
#462 opened Jul 16, 2024 by yundai424
Question about custom cuda operators for tensor parallelism question Further information is requested
#434 opened Jun 28, 2024 by vermouth1992
Question about Pipeline parallelism question Further information is requested
#431 opened Jun 27, 2024 by vermouth1992
DataLoader state is empty for different ranks ? question Further information is requested
#409 opened Jun 17, 2024 by ahatamiz
Some testing from me
#407 opened Jun 17, 2024 by ad8e
How to use nsys? enhancement New feature or request
#399 opened Jun 13, 2024 by vedantroy
benchmark perf numbers on H100 GPUs and update performance.md documentation Improvements or additions to documentation
#394 opened Jun 12, 2024 by tianyu-l torchtitan release 1.0
Add torchdata to requirements after release better_engineering Repo code quality improvements
#351 opened May 21, 2024 by gokulavasan
freqs_cis in llama model should be a non-persistent buffer bug Something isn't working
#316 opened May 8, 2024 by tianyu-l
ProTip! Updated in the last three days: updated:>2024-09-17.