Skip to content
View sudheesh4's full-sized avatar

Block or report sudheesh4

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. -POC-VLM -POC-VLM Public

    A PyTorch implementation of a Vision-Language Model that aligns image and text representations in a shared embedding space using contrastive learning.

    Python

  2. EBT_test EBT_test Public

    A minimal PyTorch demo of Energy-Based model on MNIST/Fashion-MNIST using a frozen Vision Transformer backbone (DINOv2).

    Python

  3. People_Anonymizer People_Anonymizer Public

    [TESTING] A tool that automatically detects and segments people in videos using DETR and SAM 2, then replaces their appearances with distinct solid colors, effectively anonymizing individuals while…

    Python

  4. Query_Video_OpenCLIP Query_Video_OpenCLIP Public

    [TESTING] Use OpenCLIP to analyze videos and measure how well their visual content matches one or more text prompts

    Python

  5. Mini_JEPA_TEST Mini_JEPA_TEST Public

    A lightweight JEPA-based demo that learns to fill in masked image regions using a webcam feed.

  6. ChittiAssist ChittiAssist Public

    A simple android cloud-based assistant (Google-Gemini/PaLM + Langchain powered) through FLASK based server. Query with text or image.

    Java