Skip to content
Change the repository type filter

All

    Repositories list

    • paroquant

      Public
      [ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
      Python
      MIT License
      2226790Updated May 7, 2026May 7, 2026
    • dflash

      Public
      DFlash: Block Diffusion for Flash Speculative Decoding
      Python
      MIT License
      2714k459Updated May 6, 2026May 6, 2026
    • [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity
      Python
      MIT License
      47520Updated Mar 10, 2026Mar 10, 2026
    • Fast, memory-efficient attention column reduction (e.g., sum, mean, max)
      Python
      MIT License
      14600Updated Feb 10, 2026Feb 10, 2026
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.