-

Armis Launches First-of-Its-Kind Benchmark Report Warning of Critical Security Gaps in AI-Native Development

Research reveals 100% of leading generative AI models fail to generate secure code for critical development scenarios

SAN FRANCISCO--(BUSINESS WIRE)--Armis, the cyber exposure management & security company, is warning that the rapid enterprise adoption of AI-native development is outpacing critical security safeguards, leaving organizations exposed to systemic vulnerabilities.

New research from Armis Labs’ Trusted Vibing Benchmark Report, which evaluates 18 leading generative AI models across 31 test scenarios, reveals a 100% failure rate in generating secure code. These vulnerabilities are most prevalent in high-risk areas like memory buffer overflows, design file uploads and authentication systems. Therefore, organizations should immediately implement AI-native application security controls to reduce risk.

“The era of vibe coding is here, but speed should not come at the cost of security,” said Nadir Izrael, CTO and Co-Founder of Armis. “Our research finds that the worst offenders are the same ones selling security solutions for the very vulnerabilities their models create. If the industry continues to integrate autonomous code without oversight, we aren't just halting velocity – we are accelerating technical debt.”

The report identifies a concerning variance in security across the AI landscape:

  • Universal Blind Spots: Even the most advanced models produce vulnerable code in over 30% of scenarios. This is compounded by a dangerous perception gap. The 2026 Armis Cyberwarfare Report indicates that 77% of global IT decision-makers trust the integrity and security of the third-party code used in their most critical applications, despite 16% admitting they do not know if it is thoroughly checked for high-severity vulnerabilities.
  • The Performance Gap: Not all models are created equal. For example, Gemini 3.1 Pro emerges as a leader in security posture, while older proprietary models show significantly higher vulnerability counts and a lack of baseline security guardrails.
  • Cost vs. Security: A higher cost does not necessarily mean better safety. Low-cost open-source models, such as Qwen 3.5 and Minimax M2.5, provide highly competitive security performance at a fraction of the price.

“Organizations are currently playing a subjective guessing game with AI-generated code,” added Izrael. “To effectively move forward, application security must evolve from ‘scanner management’ to true ‘risk management.’ Security teams need to stop drowning in signal noise and start using AI-native controls that can prioritize findings based on real business impact.”

The Trusted Vibing Benchmark Report, which will be regularly updated by the pioneering team at Armis Labs, measures how leading commercial and open-source AI models generate secure code and resist producing critical vulnerabilities across various scenarios. It focuses on four core areas: testing generated code using "atomic" features or functions, the choice of prompt, the choice of test harness, and the choice of application security tool.

Armis Centrix™ for Application Security helps organizations secure their entire software supply chain through AI-powered detection, contextualization and remediation.

For a closer look at the report findings and key takeaways, read our blog here.

About Armis

Armis, the cyber exposure management & security company, protects the entire attack surface and manages the organization’s cyber risk exposure in real time. In a rapidly evolving, perimeter-less world Armis ensures that organizations continuously see, protect and manage all critical assets – from the ground to the cloud. Armis secures Fortune 100, 200 and 500 companies as well as national governments, state and local entities to help keep critical infrastructure, economies and society stay safe and secure 24/7. Armis is a privately held company headquartered in California.

Contacts

Media Contacts:
Rebecca Cradick
Vice President, Global Communications
Armis
pr@armis.com

Armis

Details
Headquarters: San Francisco, San Francisco
Website: www.armis.com
CEO: Yevgeny Dibrov
Employees: 696
Organization: PRI

Release Versions

Contacts

Media Contacts:
Rebecca Cradick
Vice President, Global Communications
Armis
pr@armis.com

Social Media Profiles
More News From Armis

Armis Appoints Simon Mouyal as Chief Marketing Officer

SAN FRANCISCO--(BUSINESS WIRE)--Armis, the cyber exposure management & security company, today announced the appointment of Simon Mouyal as Chief Marketing Officer. In this role, Mouyal will oversee Armis’ global marketing strategy and execution to accelerate category leadership and demand for Armis Centrix™, the Armis Cyber Exposure Management Platform. During the next phase of Armis’ expansion, he will scale the global marketing organization and help further drive the company’s rapid grow...

Armis Warns Cyberwarfare Threats at Global Tipping Point as AI Accelerates Escalation

SAN FRANCISCO--(BUSINESS WIRE)--Armis, the cyber exposure management & security company, is warning that cyberwarfare threats have reached a global pressure-cooker moment. As emerging technologies accelerate cyber operations and geopolitical tensions worsen, attackers are increasingly targeting the infrastructure, information, and systems that underpin global stability. “Geopolitical tensions, AI acceleration, and unresolved security gaps are colliding, bringing the state of cyberwarfare to...

Armis Announces Armis Centrix™ for Vulnerability Management Detection and Response to Reduce Cyber Risk and Eliminate Operational Friction

SAN FRANCISCO--(BUSINESS WIRE)--Armis, the cyber exposure management & security company, today announced Armis Centrix™ for Vulnerability Management Detection and Response. The solution enables security teams to precisely identify and validate vulnerabilities across all of an organization’s assets in real time. Armis’ unified approach to vulnerability assessment results in greater accuracy, faster detection time and reduced operational costs. "Waiting weeks for a vulnerability scan that sti...
Back to Newsroom