Skip to content
View garrett4wade's full-sized avatar
  • Tsinghua University
  • Beijing, China

Block or report garrett4wade

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. inclusionAI/AReaL inclusionAI/AReaL Public

    Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

    Python 3.3k 264

  2. openpsi-project/ReaLHF openpsi-project/ReaLHF Public archive

    Super-Efficient RLHF Training of LLMs with Parameter Reallocation

    Python 330 21

  3. revisiting_marl revisiting_marl Public

    Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)

    Python 23 1

  4. cugae cugae Public

    CUDA implementation of Generalized Advantage Estimation (GAE)

    Python 4

  5. scaling_marl scaling_marl Public

    Python