🛡️ Agent-Adversary

Forge more reliable AI Agents through rigorous adversarial testing, real-time X-Ray diagnostics, and automated governance.

Agent-Adversary is an enterprise-grade red-teaming framework designed specifically for the Agentic AI era. While traditional LLM benchmarks focus on static knowledge, Agent-Adversary probes the dynamic logic, tool-calling safety, and multi-modal resilience of your agents.

🌟 Why Agent-Adversary?

As AI move from "chatbots" to "agents" with system access, the attack surface expands exponentially. Agent-Adversary provides the "X-Ray" vision needed to secure these workflows.

🔍 Reasoning X-Ray: Visualize agent decision trees using D3.js to pinpoint exactly where logic fails under pressure.
🖼️ Multi-Modal Exploits: First-of-its-kind support for testing Vision-Language Models (VLM) against OCR injection and visual jailbreaks.
🐝 Swarm Resilience: Stress-test multi-agent systems against Byzantine failures and cross-agent prompt pollution.
🛡️ Live Patching: Automatically generate and apply prompt-level hotfixes to mitigate detected vulnerabilities in real-time.
🌐 Distributed Hub: Orchestrate massive red-teaming campaigns across a global fleet of workers with HMAC-signed integrity.

🛠️ Architecture

Agent-Adversary operates as a closed-loop security system:

Connect: Link via Shell, Browser (Playwright), Docker, or Cloud APIs.
Attack: Execute evolved payloads (Jailbreak, Logic Traps, Multi-modal).
Observe: Capture sub-millisecond telemetry and reasoning traces.
Audit: Generate executive-ready HTML security attestation reports.

🚀 Quick Start

Installation

git clone https://github.com/xbtlin/Agent-Adversary.git
cd Agent-Adversary
pip install -e .

Run your first Benchmark

# Test a local shell-based agent against all logic traps
adversary bench --connector shell --agent "./my_agent.sh" --scenario all

Start the X-Ray Dashboard

adversary dashboard

Access the interactive visualization at http://localhost:8000

📊 Key Features

1. Advanced Diagnostics (X-Ray)

Identify "Reasoning Spikes" and "Logic Loops" before they hit production. Our D3-powered engine renders the agent's internal thought process as an interactive graph.

2. Enterprise Governance

Immutable Audit Trail: Append-only logs for every attack and system modification.
Consensus Judging: Utilizes a panel of diverse LLMs (GPT-4o, Claude 3.5) to reach an objective safety verdict.
Compliance Ready: Generate reports that align with emerging AI safety standards.

3. Adaptive Red-Teaming

The framework learns from the agent's failures. Using a genetic algorithm, it mutates payloads to bypass specific system instructions, simulating a persistent human adversary.

🤝 Contributing

We welcome safety researchers and developers! Please read our CONTRIBUTING.md and SECURITY.md.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Built with ❤️ for a safer AI future.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
agent_adversary		agent_adversary
assets		assets
cli		cli
docs		docs
examples		examples
templates		templates
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
DEVELOPMENT_PLAN.md		DEVELOPMENT_PLAN.md
PHASE_10_PLAN.md		PHASE_10_PLAN.md
PHASE_1_SUMMARY.md		PHASE_1_SUMMARY.md
PHASE_3_PLAN.md		PHASE_3_PLAN.md
PHASE_4_PLAN.md		PHASE_4_PLAN.md
PHASE_5_PLAN.md		PHASE_5_PLAN.md
PHASE_6_PLAN.md		PHASE_6_PLAN.md
PHASE_7_PLAN.md		PHASE_7_PLAN.md
PHASE_8_PLAN.md		PHASE_8_PLAN.md
PHASE_9_PLAN.md		PHASE_9_PLAN.md
README.md		README.md
SECURITY.md		SECURITY.md
SPRINT_LOGS.md		SPRINT_LOGS.md
audit_report.html		audit_report.html
latest_report.md		latest_report.md
metrics.prom		metrics.prom
mock_agent.sh		mock_agent.sh
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️ Agent-Adversary

🌟 Why Agent-Adversary?

🛠️ Architecture

🚀 Quick Start

Installation

Run your first Benchmark

Start the X-Ray Dashboard

📊 Key Features

1. Advanced Diagnostics (X-Ray)

2. Enterprise Governance

3. Adaptive Red-Teaming

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🛡️ Agent-Adversary

🌟 Why Agent-Adversary?

🛠️ Architecture

🚀 Quick Start

Installation

Run your first Benchmark

Start the X-Ray Dashboard

📊 Key Features

1. Advanced Diagnostics (X-Ray)

2. Enterprise Governance

3. Adaptive Red-Teaming

🤝 Contributing

📄 License

About

Resources

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages