Skip to content
@FasterDecoding

FasterDecoding

Think deeper, decode faster

Pinned Loading

  1. Medusa Medusa Public

    Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

    Jupyter Notebook 2.7k 190

Repositories

Showing 5 of 5 repositories
  • REST Public

    REST: Retrieval-Based Speculative Decoding, NAACL 2024

    FasterDecoding/REST’s past year of commit activity
    C 213 Apache-2.0 17 12 0 Updated Sep 11, 2025
  • SnapKV Public
    FasterDecoding/SnapKV’s past year of commit activity
    Python 297 Apache-2.0 27 18 1 Updated Jul 10, 2025
  • TEAL Public
    FasterDecoding/TEAL’s past year of commit activity
    Python 157 MIT 13 5 1 Updated Feb 15, 2025
  • BitDelta Public
    FasterDecoding/BitDelta’s past year of commit activity
    Jupyter Notebook 204 Apache-2.0 17 3 1 Updated Dec 5, 2024
  • Medusa Public

    Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

    FasterDecoding/Medusa’s past year of commit activity
    Jupyter Notebook 2,684 Apache-2.0 190 50 6 Updated Jun 25, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics