-

Omdia: China Hyperscalers Commercialize AI Amid Export Restrictions but Modern GPUs Remain Limited

LONDON--(BUSINESS WIRE)--What are the biggest cloud providers in Asia doing to meet the rising demand for AI inference? Omdia’s latest research offers an in-depth look at the evolving challenges of AI inference operations, the key trade-offs between throughput, latency, and support for diverse AI models, and the possible solutions. The report provides detailed coverage of companies such as Huawei, Baidu, Alibaba, ByteDance, Tencent, NAVER, and SK Telecom Enterprise. It examines which GPUs, AI accelerators, and AI-optimized CPUs these companies offer, their pricing, the stockpile of NVIDIA GPUs, their AI service portfolios, and the current status of their own AI models and custom chip projects.

Despite heavy stockpiling of NVIDIA H800 and H20 GPUs during 2024 and early 2025, prior to the imposition of US export controls, these high-performance chips are difficult to find in Chinese cloud services, suggesting they are primarily used for the hyperscalers’ own model development projects. Similarly, there are relatively few options that use any of the Chinese AI chip projects; exceptions include Baidu’s on-premises cloud products and some Huawei Cloud services, although they remain limited. Chinese hyperscale companies are well advanced in adopting best practices such as decoupled prefill and generation and publish seminal research in fundamental AI; however, the research papers often mention that the training runs are carried out using Western GPUs, with a few notable exceptions.

“The real triumph in Chinese semiconductors has been CPUs rather than accelerators,” says Omdia Principal Analyst and author of the report, Alexander Harrowell. “Chinese Arm-based CPUs are clearly in production at scale and are usually optimized for parallel workloads in a way like Amazon Web Services’ Graviton series. Products such as Alibaba’s YiTian 710 offer an economically attractive solution for serving the current generation of small AI models such as Alibaba Qwen3 in the enterprise, where the user base is relatively small and workload diversity is high.”

If modern GPUs are required, the strongest offering Omdia found was the GPU-as-a-service product SK Telecom is building in partnership with Lambda Labs. Omdia observed significant interest in moving Chinese workloads outside the great firewall in hopes of accessing modern GPUs and potentially additional training data. Among other important findings, nearly all companies now offer models-as-a-service platforms that enable fine-tuning and other customizations, making this one of the most common ways for enterprises to access AI capabilities. Chinese hyperscalers are especially interested in supporting AI applications at the edge. For example, ByteDance, offers a pre-packaged solution to monitor restaurant kitchens and report whether chefs are wearing their hats.

ABOUT OMDIA

Omdia, part of Informa TechTarget, Inc. (Nasdaq: TTGT), is a technology research and advisory group. Our deep knowledge of tech markets grounded in real conversations with industry leaders and hundreds of thousands of data points, make our market intelligence our clients’ strategic advantage. From R&D to ROI, we identify the greatest opportunities and move the industry forward.

Contacts

Fasiha Khan: fasiha.khan@omdia.com

More News From Omdia

Omdia: More Than Half of 45–54 Year Olds Now Watch Mobile Video While Watching TV

LONDON--(BUSINESS WIRE)--New primary research data from Omdia reveals that media multitasking is no longer just a Gen Z habit. More than half of adults aged 45–54 now watch video clips on their mobile phones while watching television, highlighting a major shift in viewing behavior and the growing fragmentation of attention across screens. According to Omdia’s latest consumer research, 52% of US viewers aged 45–54 reported watching video clips on their phones while watching TV in November 2025,...

Omdia: Global PC Shipments to Decline 12% in 2026 Amid Severe Memory and Storage Supply Challenges

LONDON--(BUSINESS WIRE)--Worldwide shipments of desktops, notebooks and workstations in 2026 are expected to decline by 12% to 245 million units, according to the latest outlook from Omdia. This forecast is grounded in sharp increases in memory and storage prices - particularly the expected minimum 60% rise in 1Q26. Further upward price pressure is anticipated throughout the remaining quarters of the year, though subsequent increases are expected to be more moderate. Since 1Q25, the costs of ma...

Omdia: Global Smartphone Shipments to Fall 7% in 2026 Amid Memory Constraints and Geopolitical Pressures

LONDON--(BUSINESS WIRE)--Global smartphone shipments are forecast to decline by around 7% year-on-year in 2026 according to Omdia’s latest outlook. This projection based on Q1 memory price assumptions, which indicate that pricing pressure and constrained supply will begin to ease in the second half of the year. The global smartphone market will face significant challenges in 2026 as tightening memory supply and elevated pricing place increasing cost pressures for vendors. Memory now accounts fo...
Back to Newsroom