Skip to content
View AbiGMe's full-sized avatar
🧠
E-com real estate app | Spring Boot·Keycloak·PG·Redis·Docker·Next.js·Flutter
🧠
E-com real estate app | Spring Boot·Keycloak·PG·Redis·Docker·Next.js·Flutter
  • Illinois

Highlights

  • Pro

Block or report AbiGMe

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM Inference, Alignment, DPO

plus Preference Tuning + Reward Optimization, high-Throughput, RL, HPO, KV Cache compressions, Reasoning methods, Jailbreaking, TTT n Computation
125 repositories

random search, hill climbing, policy gradient

Python 145 68 Updated Sep 17, 2018

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,874 682 Updated Oct 11, 2025

Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch

Python 180 8 Updated Jun 20, 2025

An implementation of PPO in Pytorch

Python 100 11 Updated Nov 27, 2025

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 374 24 Updated Jul 8, 2025

LLM KV cache compression made easy

Python 700 78 Updated Nov 27, 2025

Curated list of datasets and tools for post-training.

4,025 331 Updated Nov 10, 2025

LLM101n: Let's build a Storyteller

35,718 1,946 Updated Aug 1, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,190 982 Updated Jul 1, 2024

An autoregressive character-level language model for making more things

Python 3,476 876 Updated Jun 4, 2024

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Python 392 57 Updated Jun 10, 2025

A simple, performant and scalable Jax LLM!

Python 2,011 433 Updated Dec 1, 2025

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

Python 78 21 Updated Sep 15, 2025

LLM inference in C/C++

C++ 90,638 13,889 Updated Dec 1, 2025

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,555 122 Updated Mar 23, 2025

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-re…

Python 8,795 767 Updated Nov 30, 2025

Jlama is a modern LLM inference engine for Java

Java 1,200 138 Updated Oct 12, 2025

Inference code for Llama models

Python 58,955 9,819 Updated Jan 26, 2025

Implementation of Direct Preference Optimization

Python 17 1 Updated Jul 17, 2023

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,502 259 Updated Aug 13, 2024

structured outputs for llms

Python 11,905 897 Updated Nov 28, 2025

[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Python 671 63 Updated Jun 28, 2025

Implementation for MatMul-free LM.

Python 3,037 197 Updated Jul 21, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,267 1,900 Updated Dec 1, 2025

Avalanche: an End-to-End Library for Continual Learning based on PyTorch.

Python 1,989 309 Updated Mar 11, 2025

Code for "Supermasks in Superposition"

Python 124 22 Updated Oct 3, 2023

Collection of reinforcement learning algorithms

Python 2,821 564 Updated Jun 17, 2024

Implementation of the "Online learning of long-range dependencies" paper, NeurIPS 2023

Python 20 2 Updated Nov 4, 2024

🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data for end-to-end AI benchmarking

Python 212 63 Updated Nov 19, 2025