-

Upwind Unveils Novel Approach to High-Speed, Precise Detection of Malicious AI Prompts with Nvidia AI

New research demonstrates real-time detection of malicious LLM prompts in production environments without sacrificing latency or cost efficiency

SAN FRANCISCO--(BUSINESS WIRE)--Upwind, the runtime-first cloud security platform leader today unveiled the results of research from RSAC Conference demonstrating that malicious Large Language Model (LLM) prompts can be detected with approximately 95% precision, while maintaining sub-millisecond inference for real-time traffic with Nvidia technology. In the evaluation, advanced LLM reasoning was applied only to a small subset of high-risk requests, avoiding the latency and cost overhead that has made many AI security approaches impractical at scale.

Upwind has demonstrates that malicious Large Language Model (LLM) prompts can be detected with approximately 95% precision, while maintaining sub-millisecond inference for real-time traffic with Nvidia technology.

Share

As enterprises move generative AI into production, with Gartner predicting that more than 80% will use generative AI APIs, models, or deployed enabled applications in production this year, application security is undergoing a fundamental shift. The interface itself, natural language, has become the attack surface. Unlike traditional exploits that target code vulnerabilities or malformed packets, LLM threats are embedded in language, manipulating meaning and intent. As these models move into enterprise workflows, they introduce new threat categories including prompt injection, jailbreaks, data exfiltration and social engineering. Traditional security controls are poorly suited to these threats.

“LLMs don’t just process input, they interpret intent,” said Moshe Hassan, VP Research & Innovation, at Upwind. “That changes the security model entirely. Organizations aren’t just trying to block bad code anymore, they have to stop attempts that twist language and manipulate systems. Our research with Nvidia shows you can do that effectively in live production environments, without slowing things down or driving up costs.”

A Three-Stage Architecture Built for Production

Rather than relying on a single heavyweight model or static rules, Upwind engineered a layered detection system designed around challenges including, latency, cost, false-positive tolerance and explainability.

The system operates in three stages:

Stage 1: LLM Traffic Identification
A lightweight classifier filters traffic to determine whether a request is even LLM-bound. This stage ran in under a millisecond and achieved 99.88% accuracy, ensuring that semantic analysis is applied only when necessary.

Stage 2: Semantic Threat Detection
Once a request was identified as heading to an LLM, the next challenge was determining whether it was malicious. The team analyzed these requests using the Nvidia nv-embedcode-7b-v1 model, deployed through NVIDIA NIM microservices. After testing multiple models, nv-embedcode-7b-v1 proved most effective at distinguishing normal prompts from malicious prompts, including indirect jailbreaks and prompt injections, while running on infrastructure fast enough for real-time protection. This stage achieved 94.53% detection accuracy, while maintaining inference times well under 0.1 milliseconds, demonstrating that high-quality AI security can operate at production speed and scale.

Stage 3: Selective LLM Validation
As part of a progressive, multi-stage workflow, high-risk or uncertain cases are escalated to NVIDIA Nemotron-3-Nano-30B model for a more reliable determination. NVIDIA NeMo Guardrails is also integrated to apply predefined rules and structured output formats, ensuring responses remain consistent and aligned with security policies. This selective escalation improves accuracy and decision confidence while keeping the system efficient.

From Detection to Actionable Security

Detection alone isn’t enough in modern cloud environments, where a flagged prompt is just one piece of a much larger puzzle. By embedding LLM threat detection directly into Upwind’s runtime and cloud visibility platform, malicious prompts are surfaced not as isolated model outputs, but as actionable security events within the broader cloud ecosystem.

As AI adoption accelerates, language-based threats are becoming an operational reality. The research from Upwind, with Nvidia, proves organizations don’t have to choose between innovation and security. To learn more about this research and Upwind, visit www.upwind.io

About Upwind

Upwind is the next-generation cloud security platform built to lead the Runtime revolution. Headquartered in San Francisco, California, Upwind brings together a unified vision for cloud and application-layer protection, empowering organizations to run faster, detect threats earlier and secure their environments with unmatched precision. The company was founded by Amiram Shachar and the founding team behind Spot.io (acquired by NetApp for $450 million) and is backed by leading investors including Bessemer, Salesforce Ventures, Greylock, Cyberstarts, Leaders Fund, Craft Ventures, TCV, Alta Park, Cerca Partners, Swish Ventures and Penny Jar Capital. Upwind has raised $430 million since its founding in 2022 and is trusted by forward-thinking enterprises globally to bring real-time runtime intelligence to modern cloud security. For more information or to schedule a demo, visit www.upwind.io.

Contacts

About Upwind
Reesha Dedhia
reesha.dedhia@upwind.io

More News From Upwind

Upwind Partners with Microsoft to Deliver Runtime Security for Azure Workloads

SAN FRANCISCO--(BUSINESS WIRE)--Upwind, the runtime-first cloud security leader, today announced a partnership with Microsoft to deliver a unified Azure security solution to enterprises worldwide. The partnership brings together runtime protection, posture management, and vulnerability detection in a single experience, giving organizations continuous and integrated visibility across their Azure environments. Available on the Microsoft Marketplace, the solution offers deep alignment with Azure’s...

Upwind Doubles Down on India and Expands Footprint Across Asia-Pacific and Japan to Meet Growing Demand for Real-Time Cloud and AI Security

SAN FRANCISCO--(BUSINESS WIRE)--Upwind, the runtime-first cloud security leader, today announced an expansion of its presence in India as part of a broader scale-up across Asia-Pacific and Japan (APJ), as enterprises face a new era of real-time cloud and AI risk. Building on its established offices in Mumbai, Bangalore, Pune, Singapore, Tokyo, and Sydney, Upwind has grown its global customer base by 200% year over year and more than tripled its APJ workforce in the past three months alone, refl...

Upwind Runtime-First Cloud Security Platform Leader Integrates With New Extended Plan for AWS Security Hub

SAN FRANCISCO--(BUSINESS WIRE)--Upwind, the runtime-first cloud security leader, today announced that its cloud-native application protection platform (CNAPP) is now integrated with the Extended plan in AWS Security Hub, Amazon Web Services’ unified security solution. This integration enables customers to gain deep, real-time visibility across their AWS workloads while reducing alert noise and focusing on the risks that matter most. By combining AWS detection services with Upwind’s runtime cont...
Back to Newsroom