vwxyzjn

Follow

😃

Costa Huang vwxyzjn

😃

Follow

Exploiting physical rewards @periodiclabs. Prev: RL @allenai @huggingface.

1.7k followers · 127 following

@huggingface
Philadelphia, PA
04:45 (UTC -05:00)
https://costa.sh
@vwxyzjn

Achievements

Achievements

Pinned Loading

allenai/open-instruct allenai/open-instruct Public

AllenAI's post-training codebase

Python 3.4k 466
lm-human-preference-details lm-human-preference-details Public

RLHF implementation details of OAI's 2019 codebase

Python 196 12
cleanrl cleanrl Public

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8.4k 906
ppo-implementation-details ppo-implementation-details Public

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 887 119
cleanba cleanba Public

CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL

Python 118 11
portwarden portwarden Public

Create Encrypted Backups of Your Bitwarden Vault with Attachments

Go 628 38