A modern Flutter application that leverages Google Vertex AI for Optical Character Recognition (OCR) and Text-to-Speech (TTS) capabilities. Instantly extract text from images and have it read aloud, with support for multiple languages.
- Text Recognition: Extract text from images using Google Vertex AI OCR.
- Text-to-Speech: Listen to recognized text in various languages.
- Image Viewer: Fullscreen image preview with zoom and pan.
- Language Selection: Easily switch between supported languages for TTS.
- Modern UI: Clean, responsive, and easy-to-use interface.
- Flutter SDK
- Dart SDK
- Google Cloud Project with Vertex AI enabled
- Firebase project (for authentication, if used)
- Clone the repository:
git clone <your-repo-url> cd vertex-ai-ocr-tts
- Install dependencies:
flutter pub get
- Configure Firebase & Google Services:
- Place your
google-services.jsoninandroid/app/. - Place your
GoogleService-Info.plistinios/Runner/. - Update
lib/firebase_options.dartas needed.
- Place your
- Run the app:
flutter run
lib/
main.dart # App entry point
screens/ # UI screens (home, result, fullscreen viewer)
services/ # Vertex AI, TTS, translation logic
utils/ # App theming, text detection helpers
widgets/ # Reusable UI components
- Select or capture an image.
- Extract text using Vertex AI OCR.
- Listen to the recognized text with TTS.
- Switch languages as needed.
Pull requests are welcome! For major changes, please open an issue first to discuss what you would like to change.
The lib/firebase_options.dart file is auto-generated by the Firebase CLI and contains your project's Firebase configuration, including API keys and identifiers. This file is sensitive and should never be committed to version control.
- If you need to regenerate it, run:
flutterfire configure
- Always ensure this file is listed in your
.gitignore. - If accidentally committed, rotate your API keys and update the file.
For more details, see the FlutterFire documentation.
Made with β€οΈ by Execute to mark the end of Semester 4