Pinned Loading
-
-POC-VLM
-POC-VLM PublicA PyTorch implementation of a Vision-Language Model that aligns image and text representations in a shared embedding space using contrastive learning.
Python
-
EBT_test
EBT_test PublicA minimal PyTorch demo of Energy-Based model on MNIST/Fashion-MNIST using a frozen Vision Transformer backbone (DINOv2).
Python
-
People_Anonymizer
People_Anonymizer Public[TESTING] A tool that automatically detects and segments people in videos using DETR and SAM 2, then replaces their appearances with distinct solid colors, effectively anonymizing individuals while…
Python
-
Query_Video_OpenCLIP
Query_Video_OpenCLIP Public[TESTING] Use OpenCLIP to analyze videos and measure how well their visual content matches one or more text prompts
Python
-
Mini_JEPA_TEST
Mini_JEPA_TEST PublicA lightweight JEPA-based demo that learns to fill in masked image regions using a webcam feed.
-
ChittiAssist
ChittiAssist PublicA simple android cloud-based assistant (Google-Gemini/PaLM + Langchain powered) through FLASK based server. Query with text or image.
Java
If the problem persists, check the GitHub status page or contact support.