Skip to content
View dashitongzhi's full-sized avatar

Highlights

  • Pro

Block or report dashitongzhi

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dashitongzhi/README.md

header

╔═══════════════════════════════════════════╗
β•‘                                           β•‘
β•‘   β–ˆβ–ˆβ•—  β–ˆβ–ˆβ•— β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ•—   β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ•— β–ˆβ–ˆβ•—          β•‘
β•‘   β–ˆβ–ˆβ•‘ β–ˆβ–ˆβ•”β• β–ˆβ–ˆβ•”β•β•β–ˆβ–ˆβ•— β–ˆβ–ˆβ•”β•β•β•β–ˆβ–ˆβ•— β–ˆβ–ˆβ•‘         β•‘
β•‘   β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ•”β•  β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ•”β• β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ•‘ β–ˆβ–ˆβ•‘         β•‘
β•‘   β–ˆβ–ˆβ•”β–ˆβ–ˆβ•—   β–ˆβ–ˆβ•”β•β•β–ˆβ–ˆβ•— β–ˆβ–ˆβ•”β•β•β•β–ˆβ–ˆβ•‘ β–ˆβ–ˆβ•‘         β•‘
β•‘   β–ˆβ–ˆβ•‘ β–ˆβ–ˆβ•—  β–ˆβ–ˆβ•‘  β–ˆβ–ˆβ•‘ β–ˆβ–ˆβ•‘   β–ˆβ–ˆβ•‘ β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ•—   β•‘
β•‘   β•šβ•β•  β•šβ•β• β•šβ•β•  β•šβ•β• β•šβ•β•   β•šβ•β• β•šβ•β•β•β•β•β•β•β•   β•‘
β•‘                                           β•‘
β•šβ•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•

⚑ KRAL ⚑

Typing SVG

GitHub Followers GitHub Stars Email


Man Technologist About Me

Hi, I'm Cao Hanzhe (Kral) β€” a CS student, AI researcher, and open-source enthusiast.

  • πŸ”­ Building Multi-Agent Systems that reason, debate, and collaborate to solve complex real-world problems
  • 🧠 Advancing Reinforcement Learning β€” from RLHF reward modeling to agentic RL with multi-turn reasoning
  • πŸ€– Bridging the gap between simulation and real-world robotics β€” sim-to-real transfer, embodied AI
  • 🌐 Pushing the frontier of LLM Reasoning β€” test-time compute scaling, search-augmented generation, tool-use agents
  • πŸ—οΈ Creator of MingJian (ζ˜Žι‰΄) β€” an evidence-driven multi-agent simulation platform for strategic decision-making
  • πŸ“¬ Reach me at: cajd6876@gmail.com

Rocket What I Do

πŸ€– Multi-Agent Orchestration

Designing agent systems where multiple LLMs collaborate through debate protocols, evidence chains, and structured reasoning β€” not just simple tool-calling.

🧠 Reinforcement Learning

Implementing and fixing core RL algorithms β€” from PettingZoo parallel environments to LinUCB contextual bandits. Contributing fixes upstream to pytorch/rl and Pearl.

🦾 Robotics & Sim-to-Real

Working with robosuite and NVIDIA IsaacLab to build robust simulation pipelines that transfer to real robots. Fixing core physics engine bugs and resource management.

πŸ›‘οΈ AI Safety & Evaluation

Building automated safety checks β€” prompt injection detection, red-teaming frameworks, and LLM evaluation harnesses. Contributing to Giskard AI safety platform.


Rocket Featured Project

πŸ—οΈ MingJian (ζ˜Žι‰΄) β€” Multi-Agent Decision Platform

AI-powered multi-agent platform for evidence-driven scenario simulation and strategic decision-making

  • ⭐ 19 stars Β· Python Β· FastAPI
  • πŸ›οΈ Supports corporate and military strategic domains
  • 🎭 Multi-agent debate protocol with evidence chains
  • πŸ“Š Real-time scenario simulation engine
  • πŸ”— github.com/dashitongzhi/MingJian

Toolbox Tech Stack

Languages

Python Rust TypeScript Swift C++

AI / ML

PyTorch LangChain HuggingFace OpenAI

Infrastructure

Docker Linux FastAPI ROS


Trophy GitHub Stats

GitHub Stats

GitHub Streak

GitHub Trophies


Octopus Open Source Contributions

Active contributor to 30+ AI and agent projects across GitHub β€” fixing core bugs, adding safety features, and improving developer experience

Category Projects Highlights
πŸ€– Agent Frameworks rllm, notte, Composio/agent-orchestrator, stakpak/agent Core bug fixes, session management, async improvements
🧠 Reinforcement Learning pytorch/rl, facebookresearch/Pearl, alibaba/ROLL Fixing PettingZoo parallel env bugs, LinUCB tensor squeeze, agentic LR scheduler
🦾 Robotics robosuite, IsaacLab, SmolVM Resource leak fixes, docstring corrections, sim-to-real improvements
πŸ”§ AI Infrastructure Kokoro-FastAPI, any-llm, Art, burr Error message fixes, kwargs passthrough, install automation
πŸ›‘οΈ AI Safety Giskard-AI, Agent-R1 LLM-based prompt injection detection, red-teaming checks
πŸ“¦ Dev Tools visidata, cc-switch, hermecore, go-micro Shell command fixes, metadata parsing, CI improvements

Star Achievements

  • πŸ† Starstruck β€” Repository earned 16+ stars
  • 🦈 Pull Shark β€” Merged 30+ pull requests across major open-source projects
  • πŸ“Š 330+ contributions in the last year
  • 🌍 Contributed to projects from Meta, PyTorch, Alibaba, NVIDIA, Apache and more
  • πŸš€ Built and maintained MingJian β€” a production-grade multi-agent platform

Bar Chart Contribution Graph

Contribution Graph


Ask Me About Ask Me About

Multi-Agent Systems Reinforcement Learning Robotics LLM Reasoning AI Safety Sim-to-Real


Currently Working On Currently Working On

  • πŸ—οΈ MingJian v2 β€” Enhanced multi-agent debate protocol with evidence chain validation
  • 🧠 Agentic RL β€” Multi-turn reinforcement learning for LLM agents
  • 🦾 IsaacLab Contributions β€” Improving sim-to-real transfer pipelines
  • πŸ›‘οΈ Prompt Injection Detection β€” Building automated LLM safety evaluation tools

Quote Quote

"The question of whether machines can think is about as relevant as the question of whether submarines can swim." β€” Edsger W. Dijkstra


Handshake Let's Connect

I'm always open to collaboration on multi-agent systems, RL research, and robotics projects.

If you're building something in the AI agent space β€” let's talk! πŸš€

Email GitHub


Profile Views

footer

Popular repositories Loading

  1. MingJian MingJian Public

    AI-powered multi-agent platform for evidence-driven scenario simulation and strategic decision-making. Supports corporate \& military domains with debate protocol.

    Python 29 3

  2. NNovel NNovel Public

    ε†™ε°θ―΄ηš„θΎ…εŠ©ε·₯ε…·

    Python 3 1

  3. HoST HoST Public

    Forked from InternRobotics/HoST

    [RSS 2025 Best Systems Paper Finalist] πŸ’Official implementation of "Learning Humanoid Standing-up Control across Diverse Postures"

    Python 1

  4. TextOp TextOp Public

    Forked from TeleHuman/TextOp

    TextOp: Real-time Interactive Text-Driven Humanoid Robot Motion Generation and Control

    Python 1

  5. liquid-glass-react liquid-glass-react Public

    Forked from rdev/liquid-glass-react

    Apple's Liquid Glass effect for React

    TypeScript 1

  6. liquid-glass liquid-glass Public

    Forked from callstack/liquid-glass

    Liquid Glass in React Native

    TypeScript 1