Skip to content

feat & fix: support GQA/MQA and decode-phase attention via IAttentionLayer; add comprehensive HLO-level tests; fix bugs#4246

Merged
zewenli98 merged 5 commits into
mainfrom
evanli/hlo-attention-tests
May 13, 2026
Merged

feat & fix: support GQA/MQA and decode-phase attention via IAttentionLayer; add comprehensive HLO-level tests; fix bugs#4246
zewenli98 merged 5 commits into
mainfrom
evanli/hlo-attention-tests

Commits

Commits on May 12, 2026