An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
-
Updated
Jul 25, 2025 - Python
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
End-to-end platform for building voice first multimodal agents
A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.
Conversational voice AI agents
Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI API is used (e.g. Open WebUI, AnythingLLM, etc.)
The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio 🎙️📜
Full python wrapper for the elevenlabs API.
Automatically generate engaging AI podcasts from nothing but an episode title.
Custom TTS Integration using ElevenLabs API
Acid Reflux for your Ears!
Hitchcock a multi-agent movie maker, powered by mahilo
This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows
Your own personal assistant thanks to chat-gpt, whisper, and elevenlabs tts
Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...
A production-ready voice agent implementation using LiveKit and Python, featuring advanced conversational AI capabilities and optional telephony integration. It provides intelligent turn detection, function calling, comprehensive logging, telephony integration, and audio enhancement.
Add a description, image, and links to the elevenlabs topic page so that developers can more easily learn about it.
To associate your repository with the elevenlabs topic, visit your repo's landing page and select "manage topics."