Horizon3.ai Announces Breakthrough Research Making Autonomous AI Cyber Defense Safe to Deploy

SAN FRANCISCO--(BUSINESS WIRE)--Horizon3.ai, the AI-native proactive security leader, today announced new research that addresses one of the most critical barriers to adopting AI in cybersecurity: making autonomous defense systems predictable, controllable, and safe for real-world deployment.

AI-powered security tools have long promised speed and adaptability, but security teams have been reluctant to trust them in production. The issue is not intelligence. It is unpredictability. When facing adaptive attackers, even highly capable AI agents can behave in ways that introduce operational risk.

Horizon3.ai’s research addresses this challenge with a new tool-mediated architecture that makes stability a property of the system itself, not the underlying AI model. The AI remains responsible for strategy, but every action is constrained to a finite, pre-approved catalog and executed through deterministic, validated tools. This ensures the system remains controllable and stable, even under adversarial pressure.

Drawing from our vast training dataset derived from 250,000 real pentests, the results were tested and validated on 161 organizations from 25 different industries and demonstrated measurable impact and repeatable behavior across production configurations:

Same outcome, different paths: Across 40 runs under varying conditions, the system explored different action sequences but consistently converged on identical defensive outcomes with zero variance in attacker success.
Proven risk reduction: The system reduced attacker expected success, measured as game value, by 59 percent compared to a strong deterministic baseline.
Every single policy deployment strictly improves defense: On all 250+ real enterprise attack graphs, every EDR policy change made the defender’s position strictly better, with no risk of regression. This forms the foundation for safe, automatic policy tuning.
Zero off-catalog hallucinations across 420+ deployments: Even when using different AI models and temperatures, the system never proposed an action outside the approved catalog, eliminating dangerous or unauthorized changes.
Under adversarial pressure: The system improved observability by 4.7×, automatically adjusting EDR logging policies to provide a more accurate, real-time understanding of the environment.
Model-agnostic safety: Both Claude Sonnet 4 and the significantly smaller Claude Haiku 4.5 operated fully within the constrained action space across hundreds of deployments, demonstrating that safety is independent of model capability.
Converges in an average of just 3 rounds: The system reached a strong defensive posture in an average of approximately three rounds, with 97.7 percent of the total improvement often occurring in the first round, aligning with typical SOC maintenance windows.

The research includes formal mathematical proofs, grounded in control theory and game theory, showing that the system always gets stronger with every policy change, remains stable even when attackers try new tactics, and steadily builds a more accurate picture of the network.

Most importantly, this enables a new operational capability: safe, automatic tuning of critical defenses in live environments. The NodeZero® AI-native Proactive Security Platform can now autonomously adjust EDR policies, including Microsoft Defender, with the assurance that changes will not degrade the overall defensive posture.

“Security teams have been waiting for AI that can match an attacker’s creativity without introducing operational risk. Today we’ve delivered that. By combining powerful AI reasoning with tightly constrained, pre-approved actions, we’ve made autonomous defense not just intelligent, but predictable, controllable, and provably stable for live production environments. This changes the game: organizations can now safely let AI continuously tune and strengthen their defenses in real time,” said Snehal Antani, CEO of Horizon3.ai.

The findings also challenge the assumption that only the most advanced AI models can deliver reliable results. The research shows that while more capable models can improve performance, safety and stability come from the architecture itself. Even smaller, more cost-efficient models can operate safely within this framework, enabling organizations to deploy AI-driven defense without relying on expensive frontier models.

To read the full research report, visit: https://arxiv.org/abs/2605.03034

The work marks a significant step toward fully autonomous learning loops between AI attackers and AI defenders, grounded in real-world attack data rather than synthetic environments. It establishes a practical foundation for deploying AI-driven cyber defense systems that are both effective and trustworthy at scale.

About HORIZON3.ai

Horizon3.ai is the AI-native proactive security company redefining how organizations validate and strengthen their defenses. It is the company behind NodeZero®, the world’s best and most experienced AI hacker, trusted by 4 of the Fortune 10, global banks, top pharmaceutical and semiconductor manufacturers, and critical infrastructure operators.

NodeZero enables organizations to proactively hack, fix, verify, and repeat testing on-demand across their environment, resulting in stronger defenses and measurable improvements in cyber resilience. Founded by former U.S. Special Operations members and industry experts, Horizon3.ai is trusted by more than 5,500 customers who have executed over 250,000 production-safe pentests.

Follow HORIZON3.ai on LinkedIn and X.

Contacts

Media Contact
Stephen Gates
press@horizon3.ai

Industry:

More News From Horizon3.ai

Horizon3.ai and Brinqa Partner to Help Enterprises Prioritize Real-World Cyber Exposure

SAN FRANCISCO & AUSTIN, Texas--(BUSINESS WIRE)--Horizon3.ai, the AI-native proactive security leader, and Brinqa, a leader in AI-powered exposure management, today announced a strategic technology partnership designed to help enterprises operationalize Continuous Threat Exposure Management (CTEM), strengthen risk-based vulnerability management (RBVM) programs, and advance Adversarial Exposure Validation (AEV) initiatives through validated, attacker-informed exposure intelligence. The partnershi...

Horizon3.ai Launches Rapid Response to Secure the Era of AI-Powered Attacks

SAN FRANCISCO--(BUSINESS WIRE)--Horizon3.ai, the AI-native proactive security leader, today announced Rapid Response, a new capability that helps organizations identify, prioritize, and respond to emerging threats in an era of AI-driven vulnerability discovery and exploit development. As exploit windows continue to shrink, organizations are struggling to determine which threats represent real, exploitable risk to their business before attackers can weaponize them. Rapid Response helps security...

Horizon3.ai NodeZero® Assessed “Awardable” for Department of War Work in the Tradewinds Solutions Marketplace

SAN FRANCISCO--(BUSINESS WIRE)--Horizon3.ai, the AI-Native Proactive Security Company behind NodeZero®, today announced that NodeZero has achieved “Awardable” status through the Tradewinds Solutions Marketplace (TSM). The Tradewinds Solutions Marketplace is the premier offering of the Department of War (DoW) Chief Digital and Artificial Intelligence Office (CDAO). It serves as a digital environment of post-competition video pitches designed to accelerate the procurement of Artificial Intelligen...

Back to Newsroom

Services & Solutions

Services

Solutions For

Resources

Education

Why Business Wire

Horizon3.ai Announces Breakthrough Research Making Autonomous AI Cyber Defense Safe to Deploy

Contacts

Horizon3.ai

Contacts

Horizon3.ai and Brinqa Partner to Help Enterprises Prioritize Real-World Cyber Exposure

Horizon3.ai Launches Rapid Response to Secure the Era of AI-Powered Attacks

Horizon3.ai NodeZero® Assessed “Awardable” for Department of War Work in the Tradewinds Solutions Marketplace

Horizon3.ai

Contacts