Stars
MiMo-Audio: Audio Language Models are Few-Shot Learners
Implementation for FP8/INT8 Rollout for RL training without performence drop.
FlashInfer: Kernel Library for LLM Serving
Development repository for the Triton language and compiler
slime is an LLM post-training framework for RL Scaling.
从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!
vLLM Documentation in Chinese Simplified / vLLM 中文文档
An open-source framework for self-supervised recommender systems.
AgentScope: Agent-Oriented Programming for Building LLM Applications
[WSDM'2024 Oral] "LLMRec: Large Language Models with Graph Augmentation for Recommendation"
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
4 bits quantization of LLaMA using GPTQ
Open, Multi-modal Catalog for Data & AI
Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Create a knowledge graph out of unstructed legal text - use said knowledge graph in a graph augmented retrieval augmented generation pipeline
LLMとKGを使った推薦システム(使用了LLM和KG可以生成比较文字的商品推荐系统)
Knowledge graph-based retrieval augmeted generation demonstrator for EGC 2024
Hugging Face RoBERTa with Flash Attention 2