Skip to content
View lyuwenyu's full-sized avatar
👀
👀
  • Harbin Institute of Technology
  • Beijing, China

Block or report lyuwenyu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
lyuwenyu/README.md

👋 Hi there, I'm lyuwenyu

🌱 Interests: computer vision and multimodal large language model

🔭 Publications: Google scholar

📬 Reach out to me: lyuwenyu@foxmail.com

Pinned Loading

  1. RT-DETR RT-DETR Public

    [CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

    Python 3.6k 413

  2. PP-InsCapTagger PP-InsCapTagger Public

    Instance Capability Tagger(InsCapTagger) is a multimodal data capability tagging model. 多模态数据能力标签模型,可用于图文数据分析和处理(e.g. 基于信息密度的数据过滤���案、基于模型能力的数据配比方案)。 🔥 🔥 🔥

    8

  3. PaddlePaddle/PaddleDetection PaddlePaddle/PaddleDetection Public

    Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

    Python 13.4k 2.9k

  4. PaddlePaddle/PaddleMIX PaddlePaddle/PaddleMIX Public

    Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …

    Python 626 209