Skip to content
View saulocatharino's full-sized avatar
🚩
Revolucionando.
🚩
Revolucionando.
  • Beet Labs
  • Rio de janeiro

Block or report saulocatharino

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
saulocatharino/README.md

Saulo Catharino

Saulo Catharino GitHub stats ovi

Pinned Loading

  1. Video-LLaMA Video-LLaMA Public

    Forked from DAMO-NLP-SG/Video-LLaMA

    Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

    Python

  2. VisionLLM VisionLLM Public

    Forked from OpenGVLab/VisionLLM

    VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

  3. Voice-Identification Voice-Identification Public

    Forked from AKBoles/Voice-Identification

    Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.

    Jupyter Notebook 1

  4. whisper whisper Public

    Forked from openai/whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

    Python

  5. YOLOX YOLOX Public

    Forked from Megvii-BaseDetection/YOLOX

    YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

    Python