Skip to content
View kssteven418's full-sized avatar

Block or report kssteven418

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. SqueezeAILab/LLMCompiler SqueezeAILab/LLMCompiler Public

    [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

    Python 1.7k 124

  2. SqueezeAILab/SqueezeLLM SqueezeAILab/SqueezeLLM Public

    [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

    Python 691 45

  3. Squeezeformer Squeezeformer Public

    [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

    Python 252 18

  4. I-BERT I-BERT Public

    [ICML'21 Oral] I-BERT: Integer-only BERT Quantization

    Python 247 36

  5. LTP LTP Public

    [KDD'22] Learned Token Pruning for Transformers

    Python 98 18

  6. BigLittleDecoder BigLittleDecoder Public

    [NeurIPS'23] Speculative Decoding with Big Little Decoder

    Python 92 10