Skip to content

Pull requests: ROCm/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Current scaling: two-stage Triton amax kernel
#385 opened Nov 26, 2025 by matthiasdiener Loading…
6 of 13 tasks
Enable AOTriton BWD V3 API
#382 opened Nov 25, 2025 by Micky774 Loading…
13 tasks
Old FP8 support code cleanup
#379 opened Nov 24, 2025 by ipanfilo Loading…
1 of 13 tasks
Re-enable supported GEMM configs
#378 opened Nov 24, 2025 by ipanfilo Loading…
13 tasks
Layernorm forward optimization
#377 opened Nov 24, 2025 by eliotwang Loading…
13 tasks
IFU dev v2.6
#374 opened Nov 19, 2025 by wangye805 Loading…
9 of 13 tasks
Userbuffer epic
#367 opened Nov 11, 2025 by alextmagro Draft
JAX FA Benchmarking Script
#351 opened Oct 24, 2025 by Micky774 Loading…
13 tasks
Triton norms dispatch refactor
#305 opened Sep 5, 2025 by Micky774 Loading…
13 tasks
heyi's layernorm optimization
#225 opened Jul 3, 2025 by eliotwang Loading…
8 of 13 tasks
Added Dockerfile for CI images
#195 opened May 28, 2025 by VeeraRajasekhar Loading…
7 of 13 tasks
[ROCm] support triton-based flash-attn in TE
#177 opened May 1, 2025 by wangye805 Loading…
8 of 13 tasks
Update attention example attention.ipynb
#152 opened Mar 19, 2025 by anhminhnguyenhoang Loading…
5 of 13 tasks
Honor the NVTE_FUSED_ATTN_<backend> in test_fused_attn.py
#123 opened Feb 11, 2025 by wangye805 Loading…
13 tasks
ProTip! no:milestone will show everything without a milestone.