Skip to content
#

vibevoice-microsoft

Here are 7 public repositories matching this topic...

Language: All
Filter by language

A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.

  • Updated Oct 2, 2025
  • Python

A Gradio-based demo for end-to-end vision-to-speech inference: Extract text or descriptions from images using Qwen2.5-VL-7B-Instruct, then convert to natural speech audio via Microsoft VibeVoice-Realtime-0.5B.

  • Updated Dec 8, 2025
  • Python

Improve this page

Add a description, image, and links to the vibevoice-microsoft topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vibevoice-microsoft topic, visit your repo's landing page and select "manage topics."

Learn more