You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Extract text from PDFs using Google Vision API. This script converts PDF pages to images, preprocesses them for OCR accuracy, and uses Google Vision API for text extraction. It supports parallel processing for efficiency and saves extracted text in a structured format for each PDF.
A comprehensive privacy protection system that detects and redacts Personally Identifiable Information (PII) from Indian documents and images. Utilizes advanced OCR technology and machine learning models to identify sensitive data including Aadhaar numbers, PAN cards, names, emails, phone numbers, and addresses, ensuring compliance with Indian data