Skip to content
View vwxyzjn's full-sized avatar
😃
😃

Block or report vwxyzjn

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. allenai/open-instruct allenai/open-instruct Public

    AllenAI's post-training codebase

    Python 3.4k 466

  2. lm-human-preference-details lm-human-preference-details Public

    RLHF implementation details of OAI's 2019 codebase

    Python 196 12

  3. cleanrl cleanrl Public

    High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

    Python 8.4k 906

  4. ppo-implementation-details ppo-implementation-details Public

    The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

    Python 887 119

  5. cleanba cleanba Public

    CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL

    Python 118 11

  6. portwarden portwarden Public

    Create Encrypted Backups of Your Bitwarden Vault with Attachments

    Go 628 38