yihedeng9

Follow

Yihe Deng yihedeng9

Follow

Personal website: https://yihe-deng.notion.site/Yihe-Deng-167ab2d2c1fb80b3a76dfb120f716c84

47 followers · 6 following

Achievements

Achievements

Highlights

Pro

Pinned Loading

rlhf-summary-notes rlhf-summary-notes Public

A brief and partial summary of RLHF algorithms.

135 3
OpenVLThinker OpenVLThinker Public

OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement

Python 116 6
DuoGuard DuoGuard Public

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Python 27 4
STIC STIC Public

Enhancing Large Vision Language Models with Self-Training on Image Comprehension.

Python 70 4
uclaml/SPIN uclaml/SPIN Public

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1.2k 102
uclaml/PDE uclaml/PDE Public

Official repo of Progressive Data Expansion: data, code and evaluation

Jupyter Notebook 29 1