Your agent thinks it's talking to real Gmail, Slack, and Stripe. It's not. Shadow catches PII leaks, unauthorized actions, and prompt injection compliance — before production.
No API key required. One command, 60 seconds.
Shadow Console: watch an AI agent navigate Gmail, Slack, and Stripe — then fall for a phishing attack. Shadow catches every violation.
Agent frameworks are everywhere, but almost nobody lets autonomous agents touch enterprise systems. The trust gap is real — developers are terrified.
Shadow is a drop-in replacement for real MCP servers. Your agent doesn't change a single line of code. It has no idea it's in a simulation.
"mcpServers": { "slack": { "command": "npx", "args": ["-y", "@modelcontextprotocol/server-slack"] } }
"mcpServers": { "slack": { "command": "npx", "args": ["-y", "mcp-shadow", "run", "--services=slack"] } }
Each server uses an in-memory SQLite database seeded with realistic data. Same tool names, same response schemas, same workflows.
Shadow analyzes every tool call in real-time. After a simulation, it produces a trust report you can gate CI/CD on.
Shadow Report in the Console: trust score, failed assertions, risk log, and impact summary.
Shadow is more than a mock. It's a full simulation environment with chaos engineering, interactive testing, and CI/CD integration.
Inject chaos during live simulations. Angry customers, prompt injections, API outages, rate limits. Watch your agent react.
Write test scenarios in YAML with custom assertions. Export from the Console. Run in CI. 13 scenarios included.
Real-time PII detection, financial policy limits, destructive action monitoring, prompt injection compliance checks.
Split-screen dashboard showing agent reasoning alongside simulated Slack, Gmail, and Stripe worlds. Watch everything happen.
Gate deployments on trust scores. Agents that score below threshold don't ship. JSON output for pipeline integration.
Works with Claude, GPT, LangChain, CrewAI, OpenClaw — anything that speaks MCP. Zero code changes required.
ShadowPlay: inject chaos and compose messages as simulated personas.
Shadow runs entirely locally. No cloud. No API keys for Shadow itself. SQLite in-memory databases. Your data stays on your machine.
One command. 60 seconds. No signup, no API key. Watch an AI agent navigate a fake internet and fall for a phishing attack.
Shadow provides simulated environments for testing purposes only. Trust scores are approximations and do not guarantee production safety.