Stars
Ongoing research training transformer models at scale
Machine Learning and Computer Vision Engineer - Technical Interview Questions
Lists of company wise questions available on leetcode premium. Every csv file in the companies directory corresponds to a list of questions on leetcode for a specific company based on the leetcode …
Customized Inference Engine for Multiverse Models
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Retrieval and Retrieval-augmented LLMs
A high-throughput and memory-efficient inference and serving engine for LLMs
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) …
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Official inference repo for FLUX.1 models
Official repository of In-Context LoRA for Diffusion Transformers
A generative world for general-purpose robotics & embodied AI learning.
[CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Official implementations for paper: Anydoor: zero-shot object-level image customization
🚀 Cross attention map tools for huggingface/diffusers
High-resolution models for human tasks.
AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for…
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
[WACV 2024] Training-Free Layout Control with Cross-Attention Guidance


