Skip to content
View xuebwang-amd's full-sized avatar

Block or report xuebwang-amd

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. QuaRot QuaRot Public

    Forked from spcl/QuaRot

    Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.

    Python

  2. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  3. transformers transformers Public

    Forked from huggingface/transformers

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Python

  4. inference_results_v5.1 inference_results_v5.1 Public

    Forked from mlcommons/inference_results_v5.1

    This repository contains the results and code for the MLPerf™ Inference v5.1 benchmark.

    HTML