Edubase Downloader + OCR

Backup/Download your bought Edubase books to PDFs, because nobody likes proprietary ebook readers.

Setup (on Windows)

git clone https://github.com/tobiaswuerth/edubase-downloader.git
py -m venv .venv
.\.venv\Scripts\activate
pip install -r requirements.txt

Setup OCR (if necessary):

download and setup https://github.com/UB-Mannheim/tesseract/wiki
add installation directory to system environment variable PATH (to let it access tesseract.exe)
open link https://github.com/tesseract-ocr/tessdata/
- download the languages you intend to use:
  - note: code is setup for Deutsch and English, if you need something else, adjust the default in ocr.py
  - note: English is available by default, needs no additional download
  - deu for Deutsch, download deu.traineddata
- put the files into the installation sub-directory tessdata (e.g. \Tesseract-OCR\tessdata\deu.traineddata)
download and setup https://ghostscript.com/releases/gsdnld.html

Usage

Update the config.yaml file with your edubase login credentials
Run py .\main.py

This will:

opens new browser window
login using your credentials
find all books
lets you choose which book to download
download PDF to /downloads/ directory
OCR the PDF (if setup correctly)
... choose to download another one or not, goto step 4. and repeat

Legal

This code does not "crack" any copy protection. It simply makes automated screenshots of every single site of a bought document/book. Since Edubase cannot guarantee the existence/support for their reader app if they go out of business, this repo was created to save, backup and preserve bought Edubase documents as the widely adapted and well documented PDF format.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
edu		edu
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Edubase Downloader + OCR

Setup (on Windows)

Usage

Legal

About

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

tobiaswuerth/edubase-downloader

Folders and files

Latest commit

History

Repository files navigation

Edubase Downloader + OCR

Setup (on Windows)

Usage

Legal

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 2

Uh oh!

Languages