Skip to content
View Haozon's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Haozon

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Haozon/README.md

Hi there 👋

About Me

  • 🔭 I’m currently working on Meituan, focusing on algorithms and engineering related to LLM inference acceleration, familar with model quantization and sparsity.

📊 My GitHub Stats

Popular repositories Loading

  1. search-smpq search-smpq Public

    Python 1

  2. yolov5prune yolov5prune Public

    Forked from midasklr/yolov5prune

    learn the project of pruning yolov5

    Python

  3. OSbook OSbook Public

    Forked from NKU-EmbeddedSystem/OSbook

    Experiments for OS book

    Python

  4. CUDA_Custom_MatMul_Experiment CUDA_Custom_MatMul_Experiment Public

    Forked from eduand-alvarez/CUDA_Custom_MatMul_Experiment

    This project integrates a custom CUDA-based matrix multiplication kernel into a PyTorch deep learning model, leveraging GPU acceleration for matrix operations. The goal is to compare the performanc…

    Python

  5. deepsparse deepsparse Public

    Forked from neuralmagic/deepsparse

    Sparsity-aware deep learning inference runtime for CPUs

    Python

  6. getpaper getpaper Public

    用于从awesome中或者其他地方中下载一定主题的 paper(更新ing)

    Python