-

Horizon3.ai Announces Breakthrough Research Making Autonomous AI Cyber Defense Safe to Deploy

SAN FRANCISCO--(BUSINESS WIRE)--Horizon3.ai, the AI-native proactive security leader, today announced new research that addresses one of the most critical barriers to adopting AI in cybersecurity: making autonomous defense systems predictable, controllable, and safe for real-world deployment.

AI-powered security tools have long promised speed and adaptability, but security teams have been reluctant to trust them in production. The issue is not intelligence. It is unpredictability. When facing adaptive attackers, even highly capable AI agents can behave in ways that introduce operational risk.

Horizon3.ai’s research addresses this challenge with a new tool-mediated architecture that makes stability a property of the system itself, not the underlying AI model. The AI remains responsible for strategy, but every action is constrained to a finite, pre-approved catalog and executed through deterministic, validated tools. This ensures the system remains controllable and stable, even under adversarial pressure.

Drawing from our vast training dataset derived from 250,000 real pentests, the results were tested and validated on 161 organizations from 25 different industries and demonstrated measurable impact and repeatable behavior across production configurations:

  • Same outcome, different paths: Across 40 runs under varying conditions, the system explored different action sequences but consistently converged on identical defensive outcomes with zero variance in attacker success.
  • Proven risk reduction: The system reduced attacker expected success, measured as game value, by 59 percent compared to a strong deterministic baseline.
  • Every single policy deployment strictly improves defense: On all 250+ real enterprise attack graphs, every EDR policy change made the defender’s position strictly better, with no risk of regression. This forms the foundation for safe, automatic policy tuning.
  • Zero off-catalog hallucinations across 420+ deployments: Even when using different AI models and temperatures, the system never proposed an action outside the approved catalog, eliminating dangerous or unauthorized changes.
  • Under adversarial pressure: The system improved observability by 4.7×, automatically adjusting EDR logging policies to provide a more accurate, real-time understanding of the environment.
  • Model-agnostic safety: Both Claude Sonnet 4 and the significantly smaller Claude Haiku 4.5 operated fully within the constrained action space across hundreds of deployments, demonstrating that safety is independent of model capability.
  • Converges in an average of just 3 rounds: The system reached a strong defensive posture in an average of approximately three rounds, with 97.7 percent of the total improvement often occurring in the first round, aligning with typical SOC maintenance windows.

The research includes formal mathematical proofs, grounded in control theory and game theory, showing that the system always gets stronger with every policy change, remains stable even when attackers try new tactics, and steadily builds a more accurate picture of the network.

Most importantly, this enables a new operational capability: safe, automatic tuning of critical defenses in live environments. The NodeZero® AI-native Proactive Security Platform can now autonomously adjust EDR policies, including Microsoft Defender, with the assurance that changes will not degrade the overall defensive posture.

“Security teams have been waiting for AI that can match an attacker’s creativity without introducing operational risk. Today we’ve delivered that. By combining powerful AI reasoning with tightly constrained, pre-approved actions, we’ve made autonomous defense not just intelligent, but predictable, controllable, and provably stable for live production environments. This changes the game: organizations can now safely let AI continuously tune and strengthen their defenses in real time,” said Snehal Antani, CEO of Horizon3.ai.

The findings also challenge the assumption that only the most advanced AI models can deliver reliable results. The research shows that while more capable models can improve performance, safety and stability come from the architecture itself. Even smaller, more cost-efficient models can operate safely within this framework, enabling organizations to deploy AI-driven defense without relying on expensive frontier models.

To read the full research report, visit: https://arxiv.org/abs/2605.03034

The work marks a significant step toward fully autonomous learning loops between AI attackers and AI defenders, grounded in real-world attack data rather than synthetic environments. It establishes a practical foundation for deploying AI-driven cyber defense systems that are both effective and trustworthy at scale.

About HORIZON3.ai

Horizon3.ai is the AI-native proactive security company redefining how organizations validate and strengthen their defenses. It is the company behind NodeZero®, the world’s best and most experienced AI hacker, trusted by 4 of the Fortune 10, global banks, top pharmaceutical and semiconductor manufacturers, and critical infrastructure operators.

NodeZero enables organizations to proactively hack, fix, verify, and repeat testing on-demand across their environment, resulting in stronger defenses and measurable improvements in cyber resilience. Founded by former U.S. Special Operations members and industry experts, Horizon3.ai is trusted by more than 5,500 customers who have executed over 250,000 production-safe pentests.

Follow HORIZON3.ai on LinkedIn and X.

Contacts

Media Contact
Stephen Gates
press@horizon3.ai

Horizon3.ai


Release Versions

Contacts

Media Contact
Stephen Gates
press@horizon3.ai

Social Media Profiles
More News From Horizon3.ai

Horizon3.ai Research Reveals Growing Divide Between Security Leaders and Practitioners

SAN FRANCISCO--(BUSINESS WIRE)--There is a growing disconnect between how security is reported at the executive level and how risk is experienced by those operating security programs day to day, according to new research from Horizon3.ai, the AI-native proactive security leader. That gap is reflected in the data: 97% of CISOs say they are confident their endpoint protection would detect attacker behavior, yet only 12% report testing that capability within the last three months. Just 30% of orga...

Horizon3.ai Accelerates Channel Investment at Global Partner Conference: Americas

SAN FRANCISCO--(BUSINESS WIRE)--Horizon3.ai, the AI-native proactive security leader, today announced expanded investment in its global partner ecosystem at its Global Partner Conference: Americas, signaling a continued shift toward partner-led growth and scaled delivery of offensive security outcomes. Now in its second year, the event brings together more than 100 partners in Orlando, Florida, reflecting strong momentum across the channel. Partners are a key driver of growth, with 32 percent o...

Horizon3.ai Named to Fast Company’s Annual List of the World’s Most Innovative Companies of 2026

SAN FRANCISCO--(BUSINESS WIRE)--Horizon3.ai is proud to have been named to Fast Company’s prestigious list of the World’s Most Innovative Companies of 2026. This year’s list shines a spotlight on businesses that are shaping industry and culture through their innovations. Alongside the World’s 50 Most Innovative Companies, Fast Company recognizes 720 honorees across 59 sectors and regions. “NodeZero® is the most experienced AI hacker in the world. It relentlessly seeks ways to compromise your ne...
Back to Newsroom