Skip to content
View MikeWangWZHL's full-sized avatar

Highlights

  • Pro

Block or report MikeWangWZHL

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. X-PLUG/MobileAgent X-PLUG/MobileAgent Public

    Mobile-Agent: The Powerful GUI Agent Family

    Python 7.6k 772

  2. Solo-Performance-Prompting Solo-Performance-Prompting Public

    Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"

    Python 349 32

  3. VidIL VidIL Public

    Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners

    Python 116 1

  4. VDLM VDLM Public

    Repo for paper: https://arxiv.org/abs/2404.06479

    Python 30 1

  5. Paxion Paxion Public

    Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight

    Python 37 1

  6. PAPO PAPO Public

    Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"

    Python 117 8