Skip to content
View enjoyyi00's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report enjoyyi00

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. FoundationVision/VAR FoundationVision/VAR Public

    [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

    Jupyter Notebook 8.6k 553

  2. FoundationVision/Waver FoundationVision/Waver Public

    Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

    836 96

  3. FoundationVision/Infinity FoundationVision/Infinity Public

    [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    Python 1.5k 84

  4. FoundationVision/Liquid FoundationVision/Liquid Public

    (Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators

    Python 636 34

  5. FoundationVision/Groma FoundationVision/Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    Python 582 45

  6. FoundationVision/UniTok FoundationVision/UniTok Public

    [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

    Python 494 10