Skip to content
View ZhengShenghan's full-sized avatar

Highlights

  • Pro

Block or report ZhengShenghan

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. github/codeql github/codeql Public

    CodeQL: the libraries and queries that power security researchers around the world, as well as code scanning in GitHub Advanced Security

    CodeQL 9.8k 2k

  2. harbor-framework/harbor harbor-framework/harbor Public

    Framework for evaluating and improving agents

    Python 2.8k 1.2k

  3. benchflow-ai/skillsbench benchflow-ai/skillsbench Public

    SkillsBench evaluates how well skills work and how effective agents are at using them.

    PDDL 1.4k 320

  4. harbor-framework/terminal-bench-3 harbor-framework/terminal-bench-3 Public

    Measuring agents' ability to get work done on a computer

    Python 268 317

  5. benchflow-ai/benchflow benchflow-ai/benchflow Public

    Research infra for creating RL environments, post-training, and evals

    Python 275 34