Skip to content
View KexinFeng's full-sized avatar

Block or report KexinFeng

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. djl djl Public

    Forked from deepjavalibrary/djl

    An Engine-Agnostic Deep Learning Framework in Java

    Java

  2. EAGLE EAGLE Public

    Forked from SafeAILab/EAGLE

    EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty

    Python

  3. flash-attention flash-attention Public

    Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    Python

  4. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 64.3k 11.7k