Upwind Unveils Novel Approach to High-Speed, Precise Detection of Malicious AI Prompts with Nvidia AI

New research demonstrates real-time detection of malicious LLM prompts in production environments without sacrificing latency or cost efficiency

SAN FRANCISCO--(BUSINESS WIRE)--Upwind, the runtime-first cloud security platform leader today unveiled the results of research from RSAC Conference demonstrating that malicious Large Language Model (LLM) prompts can be detected with approximately 95% precision, while maintaining sub-millisecond inference for real-time traffic with Nvidia technology. In the evaluation, advanced LLM reasoning was applied only to a small subset of high-risk requests, avoiding the latency and cost overhead that has made many AI security approaches impractical at scale.

Upwind has demonstrates that malicious Large Language Model (LLM) prompts can be detected with approximately 95% precision, while maintaining sub-millisecond inference for real-time traffic with Nvidia technology.
Share

As enterprises move generative AI into production, with Gartner predicting that more than 80% will use generative AI APIs, models, or deployed enabled applications in production this year, application security is undergoing a fundamental shift. The interface itself, natural language, has become the attack surface. Unlike traditional exploits that target code vulnerabilities or malformed packets, LLM threats are embedded in language, manipulating meaning and intent. As these models move into enterprise workflows, they introduce new threat categories including prompt injection, jailbreaks, data exfiltration and social engineering. Traditional security controls are poorly suited to these threats.

“LLMs don’t just process input, they interpret intent,” said Moshe Hassan, VP Research & Innovation, at Upwind. “That changes the security model entirely. Organizations aren’t just trying to block bad code anymore, they have to stop attempts that twist language and manipulate systems. Our research with Nvidia shows you can do that effectively in live production environments, without slowing things down or driving up costs.”

A Three-Stage Architecture Built for Production

Rather than relying on a single heavyweight model or static rules, Upwind engineered a layered detection system designed around challenges including, latency, cost, false-positive tolerance and explainability.

The system operates in three stages:

Stage 1: LLM Traffic Identification
A lightweight classifier filters traffic to determine whether a request is even LLM-bound. This stage ran in under a millisecond and achieved 99.88% accuracy, ensuring that semantic analysis is applied only when necessary.

Stage 2: Semantic Threat Detection
Once a request was identified as heading to an LLM, the next challenge was determining whether it was malicious. The team analyzed these requests using the Nvidia nv-embedcode-7b-v1 model, deployed through NVIDIA NIM microservices. After testing multiple models, nv-embedcode-7b-v1 proved most effective at distinguishing normal prompts from malicious prompts, including indirect jailbreaks and prompt injections, while running on infrastructure fast enough for real-time protection. This stage achieved 94.53% detection accuracy, while maintaining inference times well under 0.1 milliseconds, demonstrating that high-quality AI security can operate at production speed and scale.

Stage 3: Selective LLM Validation
As part of a progressive, multi-stage workflow, high-risk or uncertain cases are escalated to NVIDIA Nemotron-3-Nano-30B model for a more reliable determination. NVIDIA NeMo Guardrails is also integrated to apply predefined rules and structured output formats, ensuring responses remain consistent and aligned with security policies. This selective escalation improves accuracy and decision confidence while keeping the system efficient.

From Detection to Actionable Security

Detection alone isn’t enough in modern cloud environments, where a flagged prompt is just one piece of a much larger puzzle. By embedding LLM threat detection directly into Upwind’s runtime and cloud visibility platform, malicious prompts are surfaced not as isolated model outputs, but as actionable security events within the broader cloud ecosystem.

As AI adoption accelerates, language-based threats are becoming an operational reality. The research from Upwind, with Nvidia, proves organizations don’t have to choose between innovation and security. To learn more about this research and Upwind, visit www.upwind.io

About Upwind

Upwind is the next-generation cloud security platform built to lead the Runtime revolution. Headquartered in San Francisco, California, Upwind brings together a unified vision for cloud and application-layer protection, empowering organizations to run faster, detect threats earlier and secure their environments with unmatched precision. The company was founded by Amiram Shachar and the founding team behind Spot.io (acquired by NetApp for $450 million) and is backed by leading investors including Bessemer, Salesforce Ventures, Greylock, Cyberstarts, Leaders Fund, Craft Ventures, TCV, Alta Park, Cerca Partners, Swish Ventures and Penny Jar Capital. Upwind has raised $430 million since its founding in 2022 and is trusted by forward-thinking enterprises globally to bring real-time runtime intelligence to modern cloud security. For more information or to schedule a demo, visit www.upwind.io.

Contacts

About Upwind
Reesha Dedhia
reesha.dedhia@upwind.io

Industry:

More News From Upwind

Upwind Security Names Joe Sullivan, Former CSO of Facebook, Uber, and Cloudflare, as Strategic Advisor

SAN FRANCISCO--(BUSINESS WIRE)--Upwind, the leader in runtime-first cloud security, today announced that Joe Sullivan has joined the company as a Strategic Advisor. Sullivan is one of the most decorated and widely recognized security leaders in the technology industry, having served as the first Chief Security Officer at Facebook, Uber, and Cloudflare, and as the first federal prosecutor in Silicon Valley dedicated full-time to prosecuting cybercrime. His appointment reflects the significance o...

Upwind Named to Fast Company’s Annual List of the World’s Most Innovative Companies of 2026

SAN FRANCISCO--(BUSINESS WIRE)--Upwind, the runtime-first cloud security leader, is proud to have been named to Fast Company’s prestigious list of the World’s Most Innovative Companies of 2026. The recognition highlights Upwind’s role in redefining how enterprises secure modern cloud and AI-driven environments. This year’s list shines a spotlight on businesses that are shaping industry and culture through innovation. “Cloud and AI have already redefined how modern businesses operate, and securi...

Upwind Partners with Microsoft to Deliver Runtime Security for Azure Workloads

SAN FRANCISCO--(BUSINESS WIRE)--Upwind, the runtime-first cloud security leader, today announced a partnership with Microsoft to deliver a unified Azure security solution to enterprises worldwide. The partnership brings together runtime protection, posture management, and vulnerability detection in a single experience, giving organizations continuous and integrated visibility across their Azure environments. Available on the Microsoft Marketplace, the solution offers deep alignment with Azure’s...

Back to Newsroom

Services & Solutions

Services

Solutions For

Resources

Education

Why Business Wire

Upwind Unveils Novel Approach to High-Speed, Precise Detection of Malicious AI Prompts with Nvidia AI

Contacts

Upwind

Contacts

Upwind Security Names Joe Sullivan, Former CSO of Facebook, Uber, and Cloudflare, as Strategic Advisor

Upwind Named to Fast Company’s Annual List of the World’s Most Innovative Companies of 2026

Upwind Partners with Microsoft to Deliver Runtime Security for Azure Workloads

Upwind

Contacts