sudheesh4

sudheesh4

Pinned Loading

-POC-VLM -POC-VLM Public

A PyTorch implementation of a Vision-Language Model that aligns image and text representations in a shared embedding space using contrastive learning.

Python
EBT_test EBT_test Public

A minimal PyTorch demo of Energy-Based model on MNIST/Fashion-MNIST using a frozen Vision Transformer backbone (DINOv2).

Python
People_Anonymizer People_Anonymizer Public

[TESTING] A tool that automatically detects and segments people in videos using DETR and SAM 2, then replaces their appearances with distinct solid colors, effectively anonymizing individuals while…

Python
Query_Video_OpenCLIP Query_Video_OpenCLIP Public

[TESTING] Use OpenCLIP to analyze videos and measure how well their visual content matches one or more text prompts

Python
Mini_JEPA_TEST Mini_JEPA_TEST Public

A lightweight JEPA-based demo that learns to fill in masked image regions using a webcam feed.
ChittiAssist ChittiAssist Public

A simple android cloud-based assistant (Google-Gemini/PaLM + Langchain powered) through FLASK based server. Query with text or image.

Java