-

Red Hat AI Factory with NVIDIA Accelerates the Path to Scalable Production AI

New co-engineered offering combines Red Hat AI Enterprise and NVIDIA's accelerated computing software to provide a unified foundation for building, deploying, and scaling AI-enabled applications

RALEIGH, N.C.--(BUSINESS WIRE)--Red Hat, the world’s leading provider of open source solutions, today announced the Red Hat AI Factory with NVIDIA, a co-engineered software platform that combines Red Hat AI Enterprise and NVIDIA AI Enterprise to provide an end-to-end AI solution optimized for organizations deploying AI at scale. Red Hat AI Factory with NVIDIA is the latest milestone in the companies’ deep collaboration, accelerating the delivery of the newest AI innovations to enterprise customers today while also delivering Day 0 support for NVIDIA hardware architectures.

With enterprise AI spending expected to reach over $1 trillion by 20291, driven in large part by agentic AI applications, organizations are looking to shift their strategies toward high-density, agentic workflows and address the resulting demands on AI inference and infrastructure. To help organizations keep pace, Red Hat AI Factory with NVIDIA empowers IT operations teams to streamline management of both traditional infrastructure and the evolving demands of the AI stack.

Red Hat AI Factory with NVIDIA accelerates the path to production AI and delivers the software platform for AI factories, running on accelerated computing infrastructure that fuels higher performance for the models and NVIDIA GPUs driving the inference stack. The platform is supported on AI factory infrastructure from leading systems manufacturers, including Cisco, Dell Technologies, Lenovo and Supermicro. This empowers IT administrators and operations teams to scale and maintain AI deployments with the same operational rigor and predictability as any enterprise workload.

This co-engineered software platform integrates the open source collaboration, engineering and support expertise of both Red Hat and NVIDIA to deliver a trusted, enterprise-grade solution. The Red Hat AI Factory with NVIDIA provides a highly scalable foundation for AI deployments across any environment, whether on-premises, in the cloud or at the edge. It includes core capabilities for high-performance AI inference, model tuning, customization and agent deployment and management, with a focus on security. This allows organizations to maintain architectural control from the datacenter to the public cloud, delivering:

  • Accelerated time-to-value: Advance to production AI with streamlined workflows and instant access to pre-configured models, including the indemnified IBM Granite family, NVIDIA Nemotron, and NVIDIA Cosmos open models, delivered as NVIDIA NIM microservices. Additionally, organizations can further align models to enterprise data using NVIDIA NeMo, reducing tuning time and cost.
  • Optimized performance and cost: Maximize infrastructure usage and bolster inference performance with a unified, high-performance serving stack. Red Hat AI Factory with NVIDIA delivers built-in observability capabilities and taps Red Hat AI inference capabilities powered by vLLM, NVIDIA TensorRT-LLM, and NVIDIA Dynamo to meet strict AI service level objectives. This helps organizations reduce the total cost of ownership (TCO) for AI by optimizing the connection between models and NVIDIA GPUs.
  • Intelligent GPU orchestration: Enable on-demand access to GPU resources through intelligent orchestration and pooled infrastructure, with automatic checkpointing to protect long-running jobs and maintain more predictable compute costs in dynamic environments.
  • Strengthened enterprise posture: Leveraging the flexible and stable foundation of Red Hat Enterprise Linux, organizations benefit from advanced security and compliance capabilities built-in from the start that help to lower risk, save time and mitigate downtime. This delivers a security-hardened foundation for mission-critical AI workloads that require isolation and continuous verification. NVIDIA DOCA microservices build on this foundation, creating a zero-trust architecture and delivering AI runtime security across the infrastructure.

Availability

Red Hat AI Factory with NVIDIA is available now.

Supporting Quotes

Chris Wright, chief technology officer and senior vice president, Global Engineering, Red Hat
"The shift from AI experimentation to industrial-scale, enterprise-wide production requires a fundamental change in how we manage the AI computing stack. We’re accelerating the path to deploy AI and move quickly to production using Red Hat AI Factory with NVIDIA. With a stable, high-performance foundation driven by our proven hybrid cloud offerings, we’re enabling our customers to own their AI strategy and scale with the same rigor they apply to their core IT platforms."

Justin Boitano, vice president, Enterprise AI Platforms, NVIDIA
“Enterprises are building AI factories that turn data into intelligence at scale during inference, requiring production-grade infrastructure and software that span the hybrid cloud. Red Hat AI Factory with NVIDIA provides the software foundation that helps organizations keep pace with rapid infrastructure innovation while reliably building and deploying the next generation of agentic AI applications.”

Jeremy Foster, senior vice president and general manager, Cisco Compute
“Cisco is focused on helping customers move AI from experimentation to production, securely, at scale, and across distributed environments. By supporting Red Hat AI Factory with NVIDIA, Cisco enables organizations to deploy and operate AI on a consistent, enterprise-grade infrastructure foundation, from data center to edge. Together, we’re giving customers a simpler, more reliable way to run AI as a mission-critical workload, with the performance, security, and operational control they expect from their core infrastructure.”

Ihab Tarazi, senior vice president and chief technology officer, Infrastructure Solutions Group, Dell Technologies
“Enterprises are moving quickly to operationalize their AI investments, but that requires a robust, integrated infrastructure that can run reliably across their hybrid environments. Together with Red Hat and NVIDIA, we will bring customers new levels of integration, further accelerating enterprise AI outcomes."

Vlad Rozanovich, senior vice president, Infrastructure Solutions Group, Lenovo
“The next era of enterprise AI is about real-time action and tangible business return, and that requires an industrial-strength, hybrid foundation. We can bring a scalable, enterprise-grade platform that combines Lenovo’s inferencing-optimized infrastructure with Red Hat AI Factory with NVIDIA, to give customers the real-time advantage – a resilient foundation for agentic AI that is deployable and manageable anywhere they operate.”

Vik Malyala, president and managing director, EMEA, and senior vice president, Technology and AI, Supermicro
“Supermicro has an extensive portfolio of Red Hat-certified systems and is dedicated to delivering the most advanced accelerated computing infrastructure for AI factories. Our validated solutions for the Red Hat AI Factory with NVIDIA help ensure that customers can combine our high-performance, purpose-built systems with a robust, enterprise-grade software platform. This simplifies the deployment and scaling of mission-critical AI enterprise workloads, helping organizations achieve faster time-to-value and predictable, efficient operations across the hybrid cloud.”

Francisco Criado, senior vice president, Cloud, Security and AI, TD SYNNEX
"As a leading end-to-end distributor and a long-standing partner to both Red Hat and NVIDIA, TD SYNNEX is excited to bring the Red Hat AI Factory with NVIDIA to our channel partners and their customers, as a complementary addition to the TD SYNNEX Destination AI program. This optimized, enterprise-grade solution removes the complexity of building and deploying AI, helping organizations operationalize their AI investments across the hybrid cloud and accelerate their journey to real business value."

Neil Anderson, vice president, GS&A Cloud & Infrastructure Solutions, WWT
“WWT is committed to helping organizations move beyond AI experimentation to successfully scale production deployments across their IT environments. Red Hat AI Factory with NVIDIA helps meet this need by offering a validated platform to simplify deployments, accelerate time-to-value and provide operational consistency for clients.”

Additional Resources

Connect with Red Hat

About Red Hat

Red Hat is the open hybrid cloud technology leader, delivering a trusted, consistent and comprehensive foundation for transformative IT innovation and AI applications. Its portfolio of cloud, developer, AI, Linux, automation and application platform technologies enables any application, anywhere—from the datacenter to the edge. As the world's leading provider of enterprise open source software solutions, Red Hat invests in open ecosystems and communities to solve tomorrow's IT challenges. Collaborating with partners and customers, Red Hat helps them build, connect, automate, secure and manage their IT environments, supported by consulting services and award-winning training and certification offerings.

Forward-Looking Statements

Except for the historical information and discussions contained herein, statements contained in this press release may constitute forward-looking statements within the meaning of the Private Securities Litigation Reform Act of 1995. Forward-looking statements are based on the company’s current assumptions regarding future business and financial performance. These statements involve a number of risks, uncertainties and other factors that could cause actual results to differ materially. Any forward-looking statement in this press release speaks only as of the date on which it is made. Except as required by law, the company assumes no obligation to update or revise any forward-looking statements.

Red Hat, Red Hat Enterprise Linux, OpenShift and the Red Hat logo are trademarks or registered trademarks of Red Hat, LLC, or its subsidiaries in the U.S. and other countries. Linux® is the registered trademark of Linus Torvalds in the U.S. and other countries.

1IDC, Agentic AI to Dominate IT Budget Expansion Over Next Five Years, Exceeding 26% of Worldwide IT Spending, and $1.3 Trillion in 2029, August 26, 2025

Contacts

Media Contacts:
Jessie Beach
jbeach@redhat.com
+1 (919) 602-2836

Red Hat, Inc.

Details
Headquarters: Raleigh, North Carolina
CEO: Matt Hicks
Employees: 22,000
Organization: OTH

Release Versions

Contacts

Media Contacts:
Jessie Beach
jbeach@redhat.com
+1 (919) 602-2836

More News From Red Hat, Inc.

Red Hat Launches Red Hat AI Enterprise to Deliver a Unified AI Platform that Spans from Metal to Agents

RALEIGH, N.C.--(BUSINESS WIRE)--Red Hat, the world's leading provider of open source solutions, today announced Red Hat AI Enterprise, an integrated AI platform for deploying and managing AI models, agents and applications across the hybrid cloud. It joins the Red Hat AI portfolio which includes Red Hat AI Inference Server, Red Hat OpenShift AI and Red Hat Enterprise Linux AI. Red Hat is also introducing Red Hat AI 3.3, bringing significant updates and enhancements across the company’s entire A...

Red Hat Expands Collaboration with NVIDIA to Pair Enterprise Open Source with Rack-Scale AI for Faster, Production-Ready Innovation

RALEIGH, N.C.--(BUSINESS WIRE)--Red Hat, the world's leading provider of open source solutions, today announced a landmark expansion of its collaboration with NVIDIA to align enterprise open source technologies to the rapidity of enterprise AI evolution and rack-scale AI advances. As the industry moves beyond individual servers toward unified, high-density systems, Red Hat aims to deliver the starting point for this transformation with Red Hat Enterprise Linux for NVIDIA, a specialized edition...

Red Hat OpenShift Service on AWS with Hosted Control Planes in AWS GovCloud Achieves FedRAMP High Authorization

RALEIGH, N.C.--(BUSINESS WIRE)--Red Hat, the world's leading provider of open source solutions, today announced that Red Hat OpenShift Service on AWS in AWS GovCloud now offers support for hosted control plane architecture, achieving an incremental addition to the service’s Federal Risk and Authorization Management Program (FedRAMP) High Authorization. The authorization for the hosted control plane architecture builds upon the existing FedRAMP High approval previously granted to Red Hat OpenShi...
Back to Newsroom