Skip to content
View AbiGMe's full-sized avatar
🧠
E-com real estate app | Spring Boot·Keycloak·PG·Redis·Docker·Next.js·Flutter
🧠
E-com real estate app | Spring Boot·Keycloak·PG·Redis·Docker·Next.js·Flutter
  • Illinois

Highlights

  • Pro

Block or report AbiGMe

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
10 stars written in C++
Clear filter

LLM inference in C/C++

C++ 90,638 13,889 Updated Dec 1, 2025

Port of OpenAI's Whisper model in C/C++

C++ 44,815 4,972 Updated Nov 20, 2025

Cross-platform, customizable ML solutions for live and streaming media.

C++ 32,172 5,630 Updated Nov 25, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,267 1,900 Updated Dec 1, 2025

A Fast and Easy to use microframework for the web.

C++ 4,548 507 Updated Nov 22, 2025

An industrial-grade C++ implementation of RAFT consensus algorithm based on brpc, widely used inside Baidu to build highly-available distributed systems.

C++ 4,183 912 Updated Oct 25, 2024

a distributed deep learning platform

C++ 3,583 1,273 Updated Nov 26, 2025

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,555 122 Updated Mar 23, 2025

A library of GPU kernels for sparse matrix operations.

C++ 277 54 Updated Nov 24, 2020

Single-thread, end-to-end C++ implementation of the Bitnet (1.58-bit weight) model

C++ 13 4 Updated Nov 17, 2024