Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
-
Updated
Mar 11, 2025 - Python
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
A multimodal chat interface with many tools.
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)
AUTOMATIC1111: Software for tensor operations, saving tensor data in .safetensors format. ComfyUI: UI library, possibly managing tensor data safely with *.safetensors. InvokeAI: ML platform using *.safetensors for secure tensor storage.
Play with Orpheus TTS, a Llama-based Speech-LLM designed for high-quality, empathetic text-to-speech generation. This model has been fine-tuned to deliver human-level speech synthesis 🔥🗣️
A journalist that knows lots of news about AI!📰💻
Gradio app using Gemini to transcribe and summarize audios into Thai governmental format
This project is a multi-agent customer service chatbot designed for an e-commerce platform. The chatbot employ specialized agents handle distinct tasks to ensure efficient and accurate interactions. The chatbot aims to enhance user experience by streamlining order processing, answering FAQs, and providing personalized recommendations.
EduAI is a Python chatbot that enhances programming learning with abundant resources, text-to-speech accessibility, and interactive Q&A, fostering universal programming knowledge access.
Conversate effortlessly in more than 50 languages!
Chatbot - Your Personal Culinary Advisor: Discover What to Cook Next!
SAT-Landforms-Classifier is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify satellite images into different landform categories using the SiglipForImageClassification architecture
Build amazing AI and RAG-powered applications, plain and simple🪂
A web-based application that generates descriptive captions for uploaded images using Hugging Face’s "Salesforce/blip-image-captioning-large" model. Built with Gradio and deployed on Hugging Face Spaces, the app provides a simple interface for transforming images into meaningful text descriptions.
A Dify plugin for semantic search across 110 million academic publications powered by abstracts-search.一个基于 abstracts-search 的 Dify 插件,可对 1.1 亿篇学术出版物进行语义搜索。
AI-Assisted Code Generation with CodeGen and Gradio
Implement LLMs from scratch.
Add a description, image, and links to the gradio-python-llm topic page so that developers can more easily learn about it.
To associate your repository with the gradio-python-llm topic, visit your repo's landing page and select "manage topics."