siglip2

aimagelab / LLaVA-MORE

LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

vision-and-language llms llava siglip multimodal-llms llama3 llava-llama3 llama3-vision gemma-2 llama3-1 deepseek-r1 siglip2

Updated Apr 24, 2025
Python

PRITHIVSAKTHIUR / deepfake-detector-model-v1

Star

deepfake-detector-model-v1 is a vision-language encoder model fine-tuned from siglip2-base-patch16-512 for binary deepfake image classification. It is trained to detect whether an image is real or generated using synthetic media techniques. The model uses the SiglipForImageClassification architecture.

google detection transformer image-classification gradio deep-fake huggingface-transformers vision-transformer siglip2

Updated May 30, 2025
Python

PRITHIVSAKTHIUR / Facial-Emotion-Detection-SigLIP2

Star

Facial-Emotion-Detection-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224

image-classification emotion-analysis emotion-detection emotion-recognition huggingface-transformers siglip2

Updated Apr 9, 2025
Python

PRITHIVSAKTHIUR / Age-Classification-SigLIP2

Star

Age-Classification-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to predict the age group of a person from an image using the SiglipForImageClassification architecture.

google vit age-detection huggingface-transformers vision-transformer huggingface-models siglip2

Updated Mar 28, 2025
Python

PRITHIVSAKTHIUR / Watermark-Detection-SigLIP2

Star

Watermark-Detection-SigLIP2 is a vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for binary image classification. It is trained to detect whether an image contains a watermark or not, using the SiglipForImageClassification architecture.

detection image-classification gradio watermark huggingface-transformers vision-transformer siglip2

Updated May 1, 2025
Python

PRITHIVSAKTHIUR / Augmented-Waste-Classifier-SigLIP2

Star

Augmented-Waste-Classifier-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224

google image-classification vit hazard-detection hazard-assessment waste-management huggingface-transformers siglip2

Updated May 25, 2025
Python

PRITHIVSAKTHIUR / siglip2-mini-explicit-content

Star

siglip2-mini-explicit-content is an image classification vision-language encoder model fine-tuned from siglip2-base-patch16-512 for a single-label classification task. It is designed to classify images into categories related to explicit, sensual, or safe-for-work content using the SiglipForImageClassification architecture.

google image-classification gradio nsfw-recognition nsfw-filter nsfw-classifier nsfw-detection huggingface-transformers siglip2

Updated May 20, 2025
Python

PRITHIVSAKTHIUR / Human-Action-Recognition

Star

Human-Action-Recognition is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for multi-class human action recognition. It uses the SiglipForImageClassification architecture to predict human activities from still images.

recognition human action huggingface-transformers siglip2

Updated Apr 11, 2025
Python

PRITHIVSAKTHIUR / SigLIP2-MultiDomain-App

Star

SigLIP2 is a vision-language encoder model fine-tuned from google/siglip2-base-patch16-224

encoder image-classification gradio multidomain huggingface-transformers vision-language-model siglip2

Updated May 1, 2025
Python

PRITHIVSAKTHIUR / Anime-Classification-v0.1

Star

Anime-Classification-v1.0 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify anime-related images using the SiglipForImageClassification architecture.

google anime image-classification gradio huggingface huggingface-transformers siglip2 anime-type

Updated Apr 20, 2025
Python

PRITHIVSAKTHIUR / nsfw-image-detection

Star

nsfw-image-detection is a vision-language encoder model fine-tuned from siglip2-base-patch16-256 for multi-class image classification. Built on the SiglipForImageClassification architecture, the model is trained to identify and categorize content types in images, especially for explicit, suggestive, or safe media filtering.

google gradio nsfw nsfw-recognition nsfw-data nsfw-classifier nsfw-detection huggingface-transformers vision-transformer siglip2

Updated May 12, 2025
Python

PRITHIVSAKTHIUR / x-bot-profile-detection

Star

x-bot-profile-detection is a SigLIP2-based classification model designed to detect profile authenticity types on social media platforms (such as X/Twitter). It categorizes a profile image into four classes: bot, cyborg, real, or verified. Built on google/siglip2-base-patch16-224.

bot twitter detection torch image-classification x gradio huggingface-transformers siglip2

Updated May 3, 2025
Python

PRITHIVSAKTHIUR / Mnist-Digits-SigLIP2

Star

classify handwritten digits (0-9)

numbers mnist-classification image-classification vit gradio digits-recognition digits-classification huggingface-transformers siglip2 0-9

Updated Mar 28, 2025
Python

jesus3476 / Fire-Detection-Siglip2

Star

Fire-Detection-Siglip2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to detect fire, smoke, or normal conditions using the SiglipForImageClassification architecture.

google smoke image-classification llama vit normal fire-detection huggingface huggingface-transformers siglip siglip2

Updated Jun 2, 2025
Python

PRITHIVSAKTHIUR / Fashion-Mnist-SigLIP2

Star

Fashion-Mnist-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify images into Fashion-MNIST categories using the SiglipForImageClassification architecture.

google image-classification clothing fashion-mnist huggingface-transformers vision-transformer siglip2

Updated Mar 21, 2025
Python

PRITHIVSAKTHIUR / Multilabel-GeoSceneNet

Star

Multilabel-GeoSceneNet is a vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for multi-label image classification. It is designed to recognize and label multiple geographic or environmental elements in a single image using the SiglipForImageClassification architecture.

map geospatial landscape spaces gradio huggingface-transformers hugging-face siglip vision-encoder siglip2 geoscenenet

Updated Apr 23, 2025
Python

PRITHIVSAKTHIUR / Deepfake-vs-Real-8000

Star

Deepfake vs Real is a dataset designed for image classification, distinguishing between deepfake and real images.

detection vit deepfake vision-transformer siglip2

Updated Mar 27, 2025
Python

PRITHIVSAKTHIUR / open-deepfake-detection

Star

open-deepfake-detection is a vision-language encoder model fine-tuned from siglip2-base-patch16-512 for binary image classification. It is trained to detect whether an image is fake or real using the OpenDeepfake-Preview dataset. The model uses the SiglipForImageClassification architecture.

google image-classification image-recognition gradio deepfake-detection huggingface-transformers siglip2

Updated May 22, 2025
Python

PRITHIVSAKTHIUR / Face-Mask-Detection

Star

Face-Mask-Detection is a binary image classification model based on google/siglip2-base-patch16-224, trained to detect whether a person is wearing a face mask or not. This model can be used in public health monitoring, access control systems, and workplace compliance enforcement.

gradio face-mask-detection facemask-detection huggingface-transformers face-mask-classification vision-transformer siglip2

Updated May 12, 2025
Python

PRITHIVSAKTHIUR / Food-101-93M

Star

Food-101-93M is a fine-tuned image classification model built on top of google/siglip2-base-patch16-224 using the SiglipForImageClassification architecture. It is trained to classify food images into one of 101 popular dishes, derived from the Food-101 dataset.

food image-classification huggingface-transformers vision-transformer siglip2

Updated Apr 7, 2025
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Here are 54 public repositories matching this topic...

aimagelab / LLaVA-MORE

PRITHIVSAKTHIUR / deepfake-detector-model-v1

PRITHIVSAKTHIUR / Facial-Emotion-Detection-SigLIP2

PRITHIVSAKTHIUR / Age-Classification-SigLIP2

PRITHIVSAKTHIUR / Watermark-Detection-SigLIP2

PRITHIVSAKTHIUR / Augmented-Waste-Classifier-SigLIP2

PRITHIVSAKTHIUR / siglip2-mini-explicit-content

PRITHIVSAKTHIUR / Human-Action-Recognition

PRITHIVSAKTHIUR / SigLIP2-MultiDomain-App

PRITHIVSAKTHIUR / Anime-Classification-v0.1

PRITHIVSAKTHIUR / nsfw-image-detection

PRITHIVSAKTHIUR / x-bot-profile-detection

PRITHIVSAKTHIUR / Mnist-Digits-SigLIP2

jesus3476 / Fire-Detection-Siglip2

PRITHIVSAKTHIUR / Fashion-Mnist-SigLIP2

PRITHIVSAKTHIUR / Multilabel-GeoSceneNet

PRITHIVSAKTHIUR / Deepfake-vs-Real-8000

PRITHIVSAKTHIUR / open-deepfake-detection

PRITHIVSAKTHIUR / Face-Mask-Detection

PRITHIVSAKTHIUR / Food-101-93M

Improve this page

Add this topic to your repo