Haozon

Follow

🎯

Focusing

Hao Zhang Haozon

🎯

Focusing

Follow

1 follower · 13 following

Student

Achievements

Achievements

Haozon/README.md

Hi there 👋

About Me

🔭 I’m currently working on Meituan, focusing on algorithms and engineering related to LLM inference acceleration, familar with model quantization and sparsity.

📊 My GitHub Stats

Popular repositories Loading

search-smpq search-smpq Public

Python 1
yolov5prune yolov5prune Public

Forked from midasklr/yolov5prune

learn the project of pruning yolov5

Python
OSbook OSbook Public

Forked from NKU-EmbeddedSystem/OSbook

Experiments for OS book

Python
CUDA_Custom_MatMul_Experiment CUDA_Custom_MatMul_Experiment Public

Forked from eduand-alvarez/CUDA_Custom_MatMul_Experiment

This project integrates a custom CUDA-based matrix multiplication kernel into a PyTorch deep learning model, leveraging GPU acceleration for matrix operations. The goal is to compare the performanc…

Python
deepsparse deepsparse Public

Forked from neuralmagic/deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Python
getpaper getpaper Public

用于从awesome中或者其他地方中下载一定主题的 paper（更新ing）

Python