#

masked-autoencoder

Here are 65 public repositories matching this topic...

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Updated Apr 25, 2025
Python

MCG-NJU / VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

pytorch transformer action-recognition video-understanding mae video-analysis video-representation-learning self-supervised-learning masked-autoencoder vision-transformer video-transformer neurips-2022

Updated Dec 8, 2023
Python

SparK

keyu-tian / SparK

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Updated Jan 23, 2024
Python

Lupin1998 / Awesome-MIM

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

computer-vision deep-learning awesome-list gpt representation-learning bert mae generative-models self-supervised-learning pre-training masked-autoencoder vision-transformer masked-image-modeling awesome-mim masked-modeling

Updated Apr 23, 2025
Python

uncbiag / SimpleClick

SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)

pytorch interactive-segmentation masked-autoencoder vision-transformers

Updated Mar 22, 2025
Python

implus / mae_segmentation

reproduction of semantic segmentation using masked autoencoder (mae)

vit semantic-segmentation mae self-supervised-learning masked-autoencoder vision-transformer

Updated Feb 3, 2022
Python

xyzforever / BEVT

PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

deep-learning pytorch bert action-recognition video-understanding video-representation-learning self-supervised-learning masked-autoencoder foundation-models

Updated Jul 19, 2022
Python

ruiwang2021 / mvd

[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)

action-recognition video-understanding video-representation-learning self-supervised-learning masked-autoencoder vision-transformer cvpr2023

Updated May 21, 2023
Python

TonyLianLong / CrossMAE

Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders

computer-vision deep-learning mae self-supervised-learning masked-autoencoder

Updated Apr 10, 2025
Python

NeRF-MAE

zubair-irshad / NeRF-MAE

[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields

Updated Mar 20, 2025
Python

habla-liaa / encodecmae

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

audio deep-learning representation-learning masked-autoencoder encodec

Updated Jul 24, 2024
Python

Haochen-Wang409 / HPM

[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling

computer-vision imagenet coco pattern-recognition unsupervised-learning ade20k self-supervised-learning masked-autoencoder masked-image-modeling cvpr2023

Updated Apr 16, 2025
Python

rishikksh20 / AudioMAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders that Listen

speech tts speech-synthesis autoencoder self-supervised-learning masked-autoencoder

Updated Aug 8, 2022
Python

MCG-NJU / VideoMAE-Action-Detection

[NeurIPS 2022 Spotlight] VideoMAE for Action Detection

pytorch transformer video-understanding action-detection mae masked-autoencoder video-transformer neurips-2022 videomae

Updated Feb 3, 2023
Python

HKUDS / MAERec

[SIGIR'2023] "MAERec: Graph Masked Autoencoder for Sequential Recommendation"

recommender-systems graph-neural-networks self-supervised-learning sequential-recommendation masked-autoencoder

Updated Jun 1, 2023
Python

lucidrains / LVMAE-pytorch

Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch

deep-learning artificial-intelligence video-understanding masked-autoencoder

Updated Nov 25, 2024
Python

lyhkevin / MT-Net

Multi-scale Transformer Network for Cross-Modality MR Image Synthesis (IEEE TMI)

deep-learning pytorch mri synthesis multi-modal masked-autoencoder

Updated Dec 24, 2023
Python

naver-ai / augsub

[CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"

backbone-models masked-autoencoder self-distillation imagenet-classification cvpr2025

Updated Mar 25, 2025
Python

Event-AHU / VFM-Det

VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models

big-model vehicle-detection attribute-learning masked-autoencoder pre-trained-big-model vehiclemae

Updated Apr 9, 2025
Python

stoneMo / DeepAVFusion

Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".

attention-mechanism multimodal-learning self-supervised-learning sound-source-localization transformer-architecture audio-visual-correspondence audio-visual-learning masked-autoencoder sound-source-separation masked-image-modeling

Updated Aug 2, 2024
Python

Improve this page

Add a description, image, and links to the masked-autoencoder topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the masked-autoencoder topic, visit your repo's landing page and select "manage topics."