Skip to content
/ OCR Public

This project contains the text extracted from images using Optical Character Recognition (OCR). The extracted text is saved in a text file with the same name as the input image.

Notifications You must be signed in to change notification settings

itsfuad/OCR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Image to Text Conversion using OCR

This project contains the text extracted from images using Optical Character Recognition (OCR). The extracted text is saved in a text file with the same name as the input image.

Requirements

To run the OCR processing, you need the following Python packages:

  • pillow
  • pytesseract
  • opencv-python
  • numpy

You can install these packages using the following command:

pip install -r requirements.txt

Project Structure

  • main.py: The main script to process images and extract text.
  • requirements.txt: List of required Python packages.
  • output/: Directory containing the extracted text files.
  • target/: Directory containing the input images to be processed.

Usage

  1. Ensure you have Tesseract OCR installed on your system. You can download it from here.

  2. Place the images you want to process in the target/ directory.

  3. Run the main.py script:

python main.py
  1. The extracted text will be saved in the output/ directory with the same name as the input image but with a .txt extension.

License

This project is licensed under the MIT License.

About

This project contains the text extracted from images using Optical Character Recognition (OCR). The extracted text is saved in a text file with the same name as the input image.

Resources

Stars

Watchers

Forks

Languages