SigLIP2 Multi-Domain & Zero-Shot Image Classification

Demo.mp4

SigLIP2 Multi-Domain & Zero-Shot Image Classification

This project provides a unified Gradio interface that supports both:

Multi-Domain Image Classification using pre-trained specialized models
Zero-Shot Image Classification using Google's SigLIP and SigLIP2 models

It enables users to perform image classification across a wide variety of tasks and also test open-vocabulary classification via a simple web interface.

Features

1. Multi-Domain Image Classification

Supports 18+ domains, including:

Age classification
Gender classification
Emotion detection
Deepfake quality assessment
Dog breed classification
Waste classification
Food classification (Indian/Western)
Traffic density
Leaf disease detection (rice)
Alphabet sign language detection
Gym workout pose classification
Bird species classification
Clip Art 126, Painting 126, Sketch 126
MNIST and Fashion MNIST
Multi-source 121 classification

Each model is integrated via its own module for efficient classification using domain-specific architectures.

2. Zero-Shot Classification (SigLIP & SigLIP2)

Uses two open-vocabulary models:

google/siglip-so400m-patch14-384
google/siglip2-so400m-patch14-384

How it works:

Accepts a user-uploaded image
Takes a comma-separated list of labels
Compares prediction probabilities using both SigLIP and SigLIP2 models

Getting Started

1. Clone the Repository

git clone https://github.com/PRITHIVSAKTHIUR/SigLIP2-MultiDomain-App.git
cd SigLIP2-MultiDomain-App

2. Install Dependencies

You can install required packages using pip:

pip install -r requirements.txt

Minimal requirements.txt might include:

torch
transformers
gradio
Pillow

3. Run the App

python app.py

This will launch a Gradio interface in your default web browser.

How to Use

Multi-Domain Classification Tab

Use the sidebar to choose your desired classification task (e.g., Age, Dog Breed, Deepfake).
Upload an image.
Click "Classify / Predict" to see the result.

Zero-Shot Classification Tab

Upload an image.
Enter labels separated by commas (e.g., "cat, dog, horse").
Click "Run" to compare SigLIP and SigLIP2 outputs.

Project Structure

├── app.py                           # Main Gradio app
├── gender_classification.py        # Domain model modules
├── emotion_classification.py
├── dog_breed.py
├── deepfake_quality.py
├── ...
├── sketch_126.py
└── requirements.txt

Models Used

Domain-specific models (e.g., CNNs, ViTs) hosted locally or via Hugging Face
SigLIP / SigLIP2 models from Google (used for zero-shot inference)

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
age_classification.py		age_classification.py
alphabet_sign_language_detection.py		alphabet_sign_language_detection.py
app.py		app.py
augmented_waste_classifier.py		augmented_waste_classifier.py
bird_species.py		bird_species.py
clip_art.py		clip_art.py
deepfake_quality.py		deepfake_quality.py
dog_breed.py		dog_breed.py
emotion_classification.py		emotion_classification.py
fashion_mnist_cloth.py		fashion_mnist_cloth.py
gender_classification.py		gender_classification.py
gym_workout_classification.py		gym_workout_classification.py
indian_western_food_classify.py		indian_western_food_classify.py
mnist_digits.py		mnist_digits.py
multisource_121.py		multisource_121.py
painting_126.py		painting_126.py
requirements.txt		requirements.txt
rice_leaf_disease.py		rice_leaf_disease.py
sketch_126.py		sketch_126.py
traffic_density.py		traffic_density.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SigLIP2 Multi-Domain & Zero-Shot Image Classification

Features

1. Multi-Domain Image Classification

2. Zero-Shot Classification (SigLIP & SigLIP2)

Getting Started

1. Clone the Repository

2. Install Dependencies

3. Run the App

How to Use

Multi-Domain Classification Tab

Zero-Shot Classification Tab

Project Structure

Models Used

License

About

Uh oh!

Uh oh!

Languages

License

PRITHIVSAKTHIUR/SigLIP2-MultiDomain-App

Folders and files

Latest commit

History

Repository files navigation

SigLIP2 Multi-Domain & Zero-Shot Image Classification

Features

1. Multi-Domain Image Classification

2. Zero-Shot Classification (SigLIP & SigLIP2)

Getting Started

1. Clone the Repository

2. Install Dependencies

3. Run the App

How to Use

Multi-Domain Classification Tab

Zero-Shot Classification Tab

Project Structure

Models Used

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages