YouTube Subtitle Scraper

This project is a YouTube subtitle scraper that extracts subtitle text from YouTube videos. It sends a POST request to savesubs.com api.

Features

Sends POST requests to an API endpoint to extract subtitle URLs.
Cleans subtitle text by removing newlines, extra spaces, and special characters.
Handles retries and logs errors for failed requests.

Installation

Clone the repository:

git clone https://github.com/faisal-fida/youtube-subtitle-scraper.git

Navigate to the project directory:
```
cd youtube-subtitle-scraper
```
Install the required dependencies:
```
pip install -r requirements.txt
```

Usage

Run the scraper by executing the app.py file:
```
python app.py
```
The script will print the scraped data, including the title, duration, uploader, and cleaned subtitles.

Example

from scraper import run_scraper

yt_url = "https://www.youtube.com/watch?v=3ckGtkuflsM"

yt_data = run_scraper(yt_url)

if yt_data:
    print(yt_data)

Logging

The scraper uses Python's built-in logging module to log information, warnings, and errors. Logs are printed to the console for easy debugging.

Contributing

Contributions are welcome! Please open an issue or submit a pull request for any improvements or bug fixes.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
app.py		app.py
config.py		config.py
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

YouTube Subtitle Scraper

Features

Installation

Usage

Example

Logging

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

faisal-fida/youtube-subtitle-scraper

Folders and files

Latest commit

History

Repository files navigation

YouTube Subtitle Scraper

Features

Installation

Usage

Example

Logging

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages