Shravan: AI YouTube Video Processing with Speech-to-Text, Q&A, and Translation

📌 Overview

Shravan is an AI-powered tool designed to process YouTube videos by converting speech to text, translating transcripts, summarizing content, and enabling contextual Q&A. This project leverages on Speech Recognition, Advance NLP, Hugging Face, and Seq2Seq LLMs

✨ Features

🎤 Video-to-Audio Conversion: Extracts audio from locally downloaded YouTube videos.
📝 Speech-to-Text Transcription: Converts audio chunks into text using PyDub and Wave2Vec.
🌍 Translation Support: Translates transcripts into a user-specified language.
📄 Summarization: Provides concise summaries of the original transcript.
🤖 Contextual Q&A: Uses FAISS + FLAN-T5 to allow users to ask questions about the video content.
⚡ Fast and Efficient: Optimized pipeline for processing large video files quickly.

🛠️ Installation & Setup

1️⃣ Clone the Repository

git clone https://github.com/sagarvk24/EchoTranscribe-AI-YouTube-Video-Processing-with-Speech-to-Text-Q-A-and-Translation.git
cd EchoTranscribe

📚 Technologies Used

🤗 Hugging Face Transformers (For Q&A & Summarization)
🎙️ PyDub, HuggingSound and Wave2Vec (ASR and Speech-To-Text)
📦 FAISS (Vector Database for efficient Q&A)
🌍 Google Translate API (Text Translation)
🏗 PyTorch (Model Inference)

🤝 Contributing

Contributions are welcome! Feel free to fork the repo, create a branch, and submit a pull request.

git checkout -b feature-branch

📜 License

This project is licensed under the MIT License. See LICENSE for details.

📩 Contact

For any queries, reach out via LinkedIn or open an issue on GitHub.

🚀 Happy Transcribing & Exploring!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
EchoTranscribe - AI YouTube Video Processing with Speech-to-Text, Q&A, and Translation.ipynb		EchoTranscribe - AI YouTube Video Processing with Speech-to-Text, Q&A, and Translation.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Shravan: AI YouTube Video Processing with Speech-to-Text, Q&A, and Translation

📌 Overview

✨ Features

🛠️ Installation & Setup

1️⃣ Clone the Repository

📚 Technologies Used

🤝 Contributing

📜 License

📩 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

sagarvk24/Shravan-AI-YouTube-Video-Processing-with-Speech-to-Text-Q-A-and-Translation

Folders and files

Latest commit

History

Repository files navigation

Shravan: AI YouTube Video Processing with Speech-to-Text, Q&A, and Translation

📌 Overview

✨ Features

🛠️ Installation & Setup

1️⃣ Clone the Repository

📚 Technologies Used

🤝 Contributing

📜 License

📩 Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages