Skip to content
View qianlihuang's full-sized avatar
  • Shanghai, China

Highlights

  • Pro

Block or report qianlihuang

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
qianlihuang/README.md

Hi there 👋

Pinned Loading

  1. sglang sglang Public

    Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Python

  2. TensorRT-LLM TensorRT-LLM Public

    Forked from NVIDIA/TensorRT-LLM

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

    Python

  3. SpecForge SpecForge Public

    Forked from sgl-project/SpecForge

    Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

    Python

  4. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python