qwen3

Star

Here are 69 public repositories matching this topic...

vllm-project / vllm

Sponsor

Star

A high-throughput and memory-efficient inference and serving engine for LLMs

Updated Nov 2, 2025
Python

unslothai / unsloth

Sponsor

Star

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Updated Nov 2, 2025
Python

sgl-project / sglang

Star

SGLang is a fast serving framework for large language models and vision language models.

Updated Nov 2, 2025
Python

1Panel-dev / MaxKB

Star

🔥 MaxKB is an open-source platform for building enterprise-grade agents. MaxKB 是强大易用的开源企业级智能体平台。

agent chatbot knowledgebase rag llm langchain pgvector ollama maxkb llama3 agentic-ai mcp-server deepseek-r1 qwen3

Updated Oct 31, 2025
Python

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Updated Nov 2, 2025
Python

OpenPipe / ART

Star

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

agent reinforcement-learning rl lora llms qwen agentic-ai grpo qwen3

Updated Oct 24, 2025
Python

zilliztech / deep-searcher

Star

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

agent openai grok claude rag milvus vector-database llm zilliz deepseek agentic-rag grok3 reasoning-models deepseek-r1 deep-research qwen3 llama4

Updated Jul 10, 2025
Python

xlite-dev / Awesome-LLM-Inference

Star

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

mla vllm llm-inference awesome-llm flash-attention tensorrt-llm paged-attention deepseek flash-attention-3 deepseek-v3 minimax-01 deepseek-r1 flash-mla qwen3

Updated Aug 19, 2025
Python

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

Star

Fully Open Framework for Democratized Multimodal Training

llm mllm vision-language-model llava qwen3

Updated Nov 2, 2025
Python

Zeyi-Lin / Qwen3-Medical-SFT

Star

Qwen3 Fine-tuning: Medical R1 Style Chat

r1 fine-tuning sft qwen3

Updated May 31, 2025
Python

hud-evals / hud-python

Star

OSS RL environment + evals toolkit

reinforcement-learning rl lora reinforcement-learning-environments llm llms qwen grpo qwen3

Updated Oct 30, 2025
Python

NetEase-Media / grps_trtllm

Star

Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.

Updated May 14, 2025
Python

NVIDIA-NeMo / Automodel

Star

Pytorch DTensor native training library for LLMs/VLMs with OOTB Hugging Face support

pytorch openai llama mistral vlm finetuning huggingface llm llm-training finetuning-llms qwen llama3 gemma3 qwen3 gemma3n gpt-oss qwen3-next

Updated Nov 1, 2025
Python

aws-samples / easy-model-deployer

Star

Deploy open-source LLMs on AWS in minutes — with OpenAI-compatible APIs and a powerful CLI/SDK toolkit.

Updated Aug 26, 2025
Python

AaronFeng753 / Better-Qwen3

Star

Auto Thinking Mode switch for Qwen3 in Open webui

qwen open-webui qwen3

Updated May 8, 2025
Python

bold84 / cot_proxy

Star

Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and <think> tag filtering. Perfect for using advanced models with apps that lack parameter customization.

llm qwen3

Updated May 19, 2025
Python

okay456okay / nof1.ai.monitor

Star

监控nof1.ai Alpha Arena AI大模型加密货币交易行为的通知系统

cryptocurrency trade-bot gpt-5 gemini-pro deepseek-v3 qwen3 claude-sonnet-4-5 nof1ai alphaarena

Updated Oct 29, 2025
Python

adamjen / Prompt_Maker

Star

Makes a improved prompts from a basic prompt

agents crewai agentic-workflow qwen3

Updated Jun 19, 2025
Python

gty111 / gLLM

Star

gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling

pipeline-parallelism tensor-parallelism llm-serving llm-inference pagedattention continuous-batching qwen3 token-throttling chunked-prefill

Updated Sep 29, 2025
Python

taishan1994 / LLM-Quantization

Star

记录量化LLM中的总结。

quantization llm gptq quarot qwen3

Updated Oct 14, 2025
Python

Improve this page

Add a description, image, and links to the qwen3 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the qwen3 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qwen3

Here are 69 public repositories matching this topic...

vllm-project / vllm

unslothai / unsloth

sgl-project / sglang

1Panel-dev / MaxKB

modelscope / ms-swift

OpenPipe / ART

zilliztech / deep-searcher

xlite-dev / Awesome-LLM-Inference

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

Zeyi-Lin / Qwen3-Medical-SFT

hud-evals / hud-python

NetEase-Media / grps_trtllm

NVIDIA-NeMo / Automodel

aws-samples / easy-model-deployer

AaronFeng753 / Better-Qwen3

bold84 / cot_proxy

okay456okay / nof1.ai.monitor

adamjen / Prompt_Maker

gty111 / gLLM

taishan1994 / LLM-Quantization

Improve this page

Add this topic to your repo