Skip to content
View machuofan's full-sized avatar

Block or report machuofan

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. CVMI-Lab/ResKD CVMI-Lab/ResKD Public

    [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".

    Python 31 1

  2. CVMI-Lab/CoDet CVMI-Lab/CoDet Public

    (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

    Python 123 6

  3. FoundationVision/Groma FoundationVision/Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    Python 582 45

  4. FoundationVision/UniTok FoundationVision/UniTok Public

    [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

    Python 494 10