Starred repositories
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.
The spatial IDE for recursive multi-agent orchestration. It's like an Obsidian graph-view that you work directly inside of.
A ComfyUI custom node suite for Qwen3-TTS, supporting 1.7B and 0.6B models, Custom Voice, Voice Design, Voice Cloning and Fine-Tuning.
A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automatic transcription.
A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.
Suno AI's Bark model in C/C++ for fast text-to-speech generation
Proxy that allows you to use ollama as a copilot like Github copilot
My collection of skills for productivity and automation.
A TTS that fits in your CPU (and pocket)
SimpleMem: Efficient Lifelong Memory for LLM Agents
A comprehensive guide to running autonomous AI coding loops using Geoff Huntley's Ralph methodology. View as formatted guide below 👇
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Open Source AI Platform - AI Chat with advanced features that works with every LLM
A tiny LM that does inference entirely at compile time
Model Context Protocol Servers
Best practices for distilling large language models.
Harness the power of Docker, Python, and Ollama for streamlined image analysis with Ollama-Vision. Quick setup, GPU acceleration, and advanced processing in one package.
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
A curated list of skills, tools, tutorials, and capabilities for AI coding agents (Claude, Codex, Copilot, VS Code)
A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems that require e…
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Foundational Models for State-of-the-Art Speech and Text Translation