Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ci: Override N_REPEAT
#3051 opened Jan 23, 2026 by ko3n1g Loading…
6 tasks
Core 0.16
added vllm fakequant export support
#3050 opened Jan 22, 2026 by kinjalpatel27 Loading…
6 tasks
Bug fix with --no-use-tokenizer-from-checkpoint-args Final Review Apply this label to indicate that your PR is ready for final review.
#3049 opened Jan 22, 2026 by jon-barker Loading…
6 tasks
Ko3n1g/f deca yed/deyuf/dev pull main 260122
#3046 opened Jan 22, 2026 by ko3n1g Loading…
6 tasks
Core 0.16
[dev] pull main 260122
#3045 opened Jan 22, 2026 by FDecaYed Loading…
6 tasks
Core 0.16
Add absorbed-mla
#3044 opened Jan 22, 2026 by kunlunl Loading…
6 tasks
[fix] Bug fix for offloading in evaluate() Expert Review Apply this label to indicate that your PR is ready for expert review.
#3043 opened Jan 22, 2026 by lhb8125 Loading…
6 tasks
Core 0.16
Fuse MLA DOWN projection GEMMs community-request complexity: medium dev2main: mbridge dev to main: this PR is needed in main for mbridge Expert Review Apply this label to indicate that your PR is ready for expert review.
#3039 opened Jan 22, 2026 by cjld Loading…
6 tasks
Core 0.16
Sync GitHub and Slack teams
#3037 opened Jan 22, 2026 by Phlip79 Queued
6 tasks
Core 0.16
Logging cleanup (only log on rank 0 if possible) complexity: medium Expert Review Apply this label to indicate that your PR is ready for expert review.
#3036 opened Jan 22, 2026 by deepakn94 Loading… Core 0.16
Skip empty sequences and chunks in MTP tensor roll
#3035 opened Jan 22, 2026 by BestJuly Loading…
6 tasks
Core 0.16
Record moe routing decisions during inference.
#3034 opened Jan 21, 2026 by sidsingh-nvidia Loading…
6 tasks
Add ability to save wgrads and dgrads complexity: medium enhancement New feature or request Final Review Apply this label to indicate that your PR is ready for final review. module: distributed
#3032 opened Jan 21, 2026 by deepakn94 Loading… Core 0.16
Add end-to-end tests for M-FSDP and ND-Parallel Final Review Apply this label to indicate that your PR is ready for final review. module: megatron-fsdp
#3031 opened Jan 21, 2026 by shjwudp Loading…
6 tasks
Track off-policyness across RL steps
#3030 opened Jan 21, 2026 by tdene Loading…
6 tasks
Core 0.16
Move tensor offload/onload out of RL code
#3029 opened Jan 21, 2026 by tdene Loading…
6 tasks
Core 0.16
debug: Revert logging memory as optional
#3028 opened Jan 21, 2026 by chtruong814 Loading…
6 tasks
Core 0.16
Fix several bugs of experimental attention variant
#3026 opened Jan 21, 2026 by yuzhongw-nvidia Loading…
6 tasks
fix ep weight gradnorm/num_zero calculation error for muon Final Review Apply this label to indicate that your PR is ready for final review. Run functional tests
#3024 opened Jan 21, 2026 by FDecaYed Loading…
6 tasks
Core 0.16
spec implementation
#3021 opened Jan 20, 2026 by shanmugamr1992 Loading…
6 tasks
Minimize README contents community-request docs-only documentation only (docs or docstrings) needs-follow-up Issue needs follow-up
#3020 opened Jan 20, 2026 by megnvidia Loading…
6 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.