Skip to content

Actions: huggingface/nanotron

Code Quality

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
510 workflow runs
510 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Ring attention
Code Quality #505: Pull request #181 synchronize by zzhhjjj
July 21, 2024 14:17 17s zzhhjjj:ring_attention
July 21, 2024 14:17 17s
Ring attention
Code Quality #504: Pull request #181 synchronize by zzhhjjj
July 19, 2024 17:44 15s zzhhjjj:ring_attention
July 19, 2024 17:44 15s
Ring attention
Code Quality #503: Pull request #181 synchronize by zzhhjjj
July 19, 2024 17:25 19s zzhhjjj:ring_attention
July 19, 2024 17:25 19s
Memory optimization in async tp-linear
Code Quality #502: Pull request #208 opened by AleHD
July 18, 2024 16:49 17s AleHD:mem_fix_async
July 18, 2024 16:49 17s
Ring attention
Code Quality #501: Pull request #181 synchronize by zzhhjjj
July 18, 2024 16:45 16s zzhhjjj:ring_attention
July 18, 2024 16:45 16s
Ring attention
Code Quality #500: Pull request #181 synchronize by zzhhjjj
July 18, 2024 15:58 16s zzhhjjj:ring_attention
July 18, 2024 15:58 16s
Fix tp mem cache
Code Quality #499: Pull request #203 synchronize by AleHD
July 17, 2024 15:21 17s AleHD:fix_tp_mem_cache
July 17, 2024 15:21 17s
Fix tp mem cache
Code Quality #491: Pull request #203 reopened by 3outeille
July 15, 2024 10:22 16s AleHD:fix_tp_mem_cache
July 15, 2024 10:22 16s
Merge pull request #207 from C-TC/recompute
Code Quality #490: Commit 4c23ed0 pushed by 3outeille
July 14, 2024 11:59 17s main
July 14, 2024 11:59 17s
[FP8 Training] End-to-end FP8 Training
Code Quality #489: Pull request #70 synchronize by xrsrke
July 12, 2024 09:35 15s xrsrke/fp8-end-to-end
July 12, 2024 09:35 15s
[FP8 Training] End-to-end FP8 Training
Code Quality #488: Pull request #70 synchronize by xrsrke
July 10, 2024 13:19 22s xrsrke/fp8-end-to-end
July 10, 2024 13:19 22s
[FP8 Training] End-to-end FP8 Training
Code Quality #487: Pull request #70 synchronize by xrsrke
July 10, 2024 10:39 19s xrsrke/fp8-end-to-end
July 10, 2024 10:39 19s
[Feature] Monitor model states during training
Code Quality #486: Pull request #183 synchronize by xrsrke
July 10, 2024 03:38 15s xrsrke/monitor_nn
July 10, 2024 03:38 15s
[FP8 Training] End-to-end FP8 Training
Code Quality #485: Pull request #70 synchronize by xrsrke
July 9, 2024 08:21 16s xrsrke/fp8-end-to-end
July 9, 2024 08:21 16s
Add layer-wise activation recomputation to llama model
Code Quality #484: Pull request #207 opened by C-TC
July 8, 2024 11:56 22s C-TC:recompute
July 8, 2024 11:56 22s
[FP8 Training] End-to-end FP8 Training
Code Quality #483: Pull request #70 synchronize by xrsrke
July 8, 2024 11:35 16s xrsrke/fp8-end-to-end
July 8, 2024 11:35 16s
[FP8 Training] End-to-end FP8 Training
Code Quality #482: Pull request #70 synchronize by xrsrke
July 8, 2024 08:45 20s xrsrke/fp8-end-to-end
July 8, 2024 08:45 20s
[FP8 Training] End-to-end FP8 Training
Code Quality #481: Pull request #70 synchronize by xrsrke
July 5, 2024 12:53 15s xrsrke/fp8-end-to-end
July 5, 2024 12:53 15s
[FP8 Training] End-to-end FP8 Training
Code Quality #480: Pull request #70 synchronize by xrsrke
July 5, 2024 12:36 22s xrsrke/fp8-end-to-end
July 5, 2024 12:36 22s
Move MoE Implementation into src/, add Load Balancing Losses
Code Quality #479: Pull request #192 synchronize by haeggee
July 3, 2024 13:45 17s swiss-ai:moe
July 3, 2024 13:45 17s
Llama3 conversion scripts 🦙
Code Quality #476: Pull request #174 synchronize by ischlag
July 2, 2024 15:04 20s TJ-Solergibert:llama3_converter
July 2, 2024 15:04 20s
Ring attention
Code Quality #475: Pull request #181 synchronize by zzhhjjj
July 2, 2024 14:32 16s zzhhjjj:ring_attention
July 2, 2024 14:32 16s
Ring attention
Code Quality #474: Pull request #181 synchronize by zzhhjjj
July 2, 2024 14:20 19s zzhhjjj:ring_attention
July 2, 2024 14:20 19s
Ring attention
Code Quality #472: Pull request #181 synchronize by zzhhjjj
July 2, 2024 13:33 17s zzhhjjj:ring_attention
July 2, 2024 13:33 17s