Patronus AI Secures $50M Series B to Build Digital Worlds for Stress‑Testing AI Agents

Patronus AI disclosed on June 25, 2026 that it has closed a $50 million Series B financing round, bringing total capital raised to $70 million. The funding, led by Greenfield Partners with participation from Notable Capital, Lightspeed, Datadog and Samsung, will be used to expand its simulated digital‑world platform that evaluates AI agents under realistic, high‑stress scenarios.

Building “Digital World Models” for Agent Stress‑Testing

The startup creates replica environments of websites, internal systems and other operational contexts, allowing AI agents to be tested after training via reinforcement learning. By rewarding successful task completion and penalising errors, the platform surfaces shortcuts and failure modes that traditional benchmarks miss.

Funding Surge and Revenue Explosion: 15‑Fold Growth in One Year

Revenue growth: 15× increase over the past 12 months.
Total funding: $70 million after the new round.
Investors: Greenfield Partners (lead), Notable Capital, Lightspeed, Datadog, Samsung.
Customers: Frontier AI labs and emerging AI startups seeking reliable agent evaluation.

Why Simulated Environments Are Becoming Critical for Reliable AI Agents

Benchmarks alone no longer prove an agent’s ability to execute multi‑step, real‑world tasks. Simulated digital worlds give agents exposure to rare or unpredictable conditions—similar to how Waymo trains autonomous cars—ensuring models can handle edge cases without human oversight. This capability addresses a growing industry need for accountability and safety as agents move from answering questions to executing financial analyses, travel bookings, and other high‑stakes operations.

Future Outlook: Expansion Beyond Verifiable Tasks and Competitive Landscape

Co‑founder Anand Kannappan notes that while current offerings focus on verifiable problems (software engineering, finance), the roadmap includes harder‑to‑verify domains. The company sees its main competition as internal evaluation teams within AI labs, positioning itself as a turnkey, fully automated alternative to human‑data firms such as Mercor and Surge.

With capital to scale infrastructure and broaden industry coverage, Patronus AI is poised to become a cornerstone of AI‑agent safety, potentially shaping standards for how future autonomous systems are validated.