Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add sigmoid GLU
#2656 opened Feb 6, 2026 by singleheart Loading…
6 of 8 tasks
[C] NVFP4 quantization for GroupedTensor MoE
#2655 opened Feb 6, 2026 by ksivaman Draft
3 of 13 tasks
[PyTorch] Python GroupedTensor MoE
#2654 opened Feb 6, 2026 by ksivaman Loading…
11 of 15 tasks
[Common] Bucket batch size with higher granularity for THD 2.13.0
#2653 opened Feb 6, 2026 by cyanguwa Loading…
8 of 13 tasks
[PyTorch Debug] Skip logging stats if unsupported
#2652 opened Feb 5, 2026 by pggPL Loading…
8 of 13 tasks
[JAX] Debugging inspect utility
#2651 opened Feb 4, 2026 by jberchtold-nvidia Draft
13 tasks
[Pytorch] Make test script generate checkpoints if they don't exist
#2650 opened Feb 4, 2026 by kainzhong Loading…
5 of 13 tasks
[JAX] Fix FSDP when FSDP+EP is active
#2649 opened Feb 3, 2026 by jberchtold-nvidia Loading…
8 of 13 tasks
[PyTorch Debug] Support tensor dump
#2645 opened Feb 3, 2026 by pggPL Draft
13 tasks
Add NVTE_KEEP_BACKWARD_UNQUANTIZED
#2644 opened Feb 3, 2026 by zianglih Loading…
4 of 13 tasks
ci(fix): NGC build
#2643 opened Feb 2, 2026 by ko3n1g Loading…
13 tasks
Add examples for MoE models - Mixtral in TE
#2642 opened Feb 2, 2026 by faradawn Loading…
2 of 13 tasks
Fix Broken Quickstart Links
#2641 opened Feb 2, 2026 by faradawn Loading…
6 of 13 tasks
docs(readme): update convergence table, latest news, and outdated links
#2638 opened Feb 1, 2026 by sbhavani Loading…
5 of 13 tasks
Fix FP8 block scaling with sequence parallel
#2637 opened Jan 31, 2026 by cuichenx Loading…
1 of 13 tasks
Fix Github workflows issues
#2636 opened Jan 30, 2026 by pggPL Loading…
8 of 13 tasks
Add 2d quant for mxfp8
#2634 opened Jan 29, 2026 by kunlunl Loading…
13 tasks
[Common] Fuse pre-swizzling into grouped MXFP8 quantization kernel
#2630 opened Jan 28, 2026 by Oleg-Goncharov Loading…
7 of 13 tasks
[PyTorch] Pad V when Q/V head dims differ (MLA) for THD
#2629 opened Jan 27, 2026 by HollowMan6 Loading…
8 of 13 tasks
[PyTorch] SonicMoE Fused Softmax-TopK Integration
#2627 opened Jan 27, 2026 by denera Draft
4 of 13 tasks
Fix incorrect MNNVL fabric check
#2626 opened Jan 27, 2026 by nvcastet Loading…
13 tasks
docs: update cuDNN sliding window attention support
#2624 opened Jan 26, 2026 by sbhavani Loading…
7 of 13 tasks
ProTip! Add no:assignee to see everything that’s not assigned.