Starred repositories
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.
The spatial IDE for recursive multi-agent orchestration. It's like an Obsidian graph-view that you work directly inside of.
A ComfyUI custom node suite for Qwen3-TTS, supporting 1.7B and 0.6B models, Custom Voice, Voice Design, Voice Cloning and Fine-Tuning.
A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automatic transcription.
A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.
Suno AI's Bark model in C/C++ for fast text-to-speech generation
Proxy that allows you to use ollama as a copilot like Github copilot
My collection of skills for productivity and automation.
A TTS that fits in your CPU (and pocket)
SimpleMem: Efficient Lifelong Memory for LLM Agents
A comprehensive guide to running autonomous AI coding loops using Geoff Huntley's Ralph methodology. View as formatted guide below 👇
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Open Source AI Platform - AI Chat with advanced features that works with every LLM
A tiny LM that does inference entirely at compile time
gary149 / llama-agent
Forked from ggml-org/llama.cppAgents in llama.cpp
Model Context Protocol Servers
jim-plus / llm-abliteration
Forked from Orion-zhen/abliterationMake abliterated models with transformers, easy and fast
Best practices for distilling large language models.
Harness the power of Docker, Python, and Ollama for streamlined image analysis with Ollama-Vision. Quick setup, GPU acceleration, and advanced processing in one package.
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
A curated list of skills, tools, tutorials, and capabilities for AI coding agents (Claude, Codex, Copilot, VS Code)
A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems that require e…
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.