Skip to content

Pull requests: quic/efficient-transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Updated example file for ai200 runs
#1127 opened Jun 25, 2026 by asmigosw Contributor Loading…
Revert PR #1105
#1124 opened Jun 25, 2026 by ochougul Contributor Loading…
added head pruning
#1123 opened Jun 25, 2026 by rtambare-pixel Loading…
Reduce MoE export RAM by aliasing fused expert weights as views
#1121 opened Jun 24, 2026 by quic-rishinr Contributor Loading…
cleaned up layerwise API and added CustomLoader (#1105)
#1120 opened Jun 24, 2026 by ochougul Contributor Loading…
qwen3_5 fixes + Tests
#1115 opened Jun 24, 2026 by mohiso22 Contributor Loading…
RepeatKV Transform changes with comments addressed
#1114 opened Jun 24, 2026 by quic-dhirajku Contributor Loading…
Updated supported models list in validate.md
#1112 opened Jun 23, 2026 by quic-vishali Contributor Loading…
Reduce CI model test matrix runtime
#1111 opened Jun 23, 2026 by ochougul Contributor Draft
Add Support for Kimi K2.5 Vision
#1108 opened Jun 22, 2026 by quic-mamta Contributor Draft
Added fix to layerwise implementation to incorporate CB
#1106 opened Jun 22, 2026 by abhishek-singh591 Contributor Loading…
Subfunction fix w/o Layerwise 1.22 Release 1.22 candidate bugfix
#1104 opened Jun 19, 2026 by mohiso22 Contributor Loading…
[Nightly-CI-Summary]: Adding nightly summary report
#1103 opened Jun 19, 2026 by abukhoy Contributor Draft
Fix(0618)(layerwise): dedup merged ONNX graph and keep key prefill passess enabled. 1.22 Release 1.22 candidate bugfix
#1099 opened Jun 18, 2026 by vbaddi Contributor Loading…
Gemma4 CI tests
#1097 opened Jun 18, 2026 by tchawada Contributor Loading…
Feature/add minimax m3 vl
#1096 opened Jun 18, 2026 by shagsood Draft
[CI Exp]: CI Revamp
#1094 opened Jun 17, 2026 by abukhoy Contributor Draft
Fix: Enable Qwen MoE prefill blocking without chunking enhancement New feature or request
#1091 opened Jun 16, 2026 by vbaddi Contributor Draft
KV handoff with buffer slicing APIs to avoid KV I/O copies enhancement New feature or request
#1087 opened Jun 16, 2026 by quic-akuruvil Contributor Loading…
Added MDP generation to QEff Compile
#1086 opened Jun 16, 2026 by quic-mohmeh Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.