Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[JAX] Load modules during initialize for Norm and Act primitives
#2219 opened Sep 30, 2025 by jberchtold-nvidia Loading…
8 of 13 tasks
[JAX] Rework amax reduction over TPSP
#2218 opened Sep 30, 2025 by phu0ngng Loading…
7 of 13 tasks
[JAX] Fix rng_state shape in fused attention
#2217 opened Sep 30, 2025 by phu0ngng Loading…
7 of 13 tasks
[PyTorch Debug] Custom feature tutorial.
#2216 opened Sep 30, 2025 by pggPL Loading…
8 of 13 tasks
[Draft][PyTorch][MOE] Support NVFP4 Grouped Linear
#2215 opened Sep 30, 2025 by zhongbozhu Loading…
1 of 17 tasks
UBNEXT with optional add-rms fuse
#2212 opened Sep 29, 2025 by nv-akorzh Loading…
[PyTorch] fix int32 overflow in permute kernels
#2196 opened Sep 23, 2025 by hxbai Loading…
1 of 13 tasks
[PyTorch] Add max_score support for MuonClip 2.9.0
#2195 opened Sep 22, 2025 by cyanguwa Loading…
8 of 13 tasks
[JAX] Clamped Swiglu Integration
#2194 opened Sep 22, 2025 by vthumbe1503 Draft
13 tasks
FSDP grad fusion support
#2191 opened Sep 21, 2025 by sanandaraj5597 Loading…
[Feat] Draft: support offloading activation
#2187 opened Sep 18, 2025 by lhb8125 Loading…
13 tasks
[PyTorch Debug] Add nvdlfw-inspect to dependencies
#2173 opened Sep 15, 2025 by pggPL Draft
7 tasks done
[Pytorch] Support for Swiglu Activation used in GPT OSS
#2161 opened Sep 8, 2025 by vthumbe1503 Loading…
8 of 13 tasks
[Common][PyTorch][Rework] PDL for Quantization
#2150 opened Sep 4, 2025 by yaox12 Loading…
1 of 13 tasks
[PyTorch] CPU Overhead Micro-optimizations
#2146 opened Sep 2, 2025 by zhongbozhu Loading…
13 tasks
[main][feature][under updating]adapt for offload activation
#2145 opened Sep 2, 2025 by GeYuhong Loading…
1 of 13 tasks
ProTip! no:milestone will show everything without a milestone.