-

MLCommons™ Releases MLPerf™ Training v1.0 Results

The latest benchmark submission round includes over 650 ML Training performance results for leading ML models, software, and hardware

SAN FRANCISCO--(BUSINESS WIRE)--Today, MLCommons, an open engineering consortium, released new results for MLPerf Training v1.0, the organization's machine learning training performance benchmark suite. MLPerf Training measures the time it takes to train machine learning models to a standard quality target in a variety of tasks including image classification, object detection, NLP, recommendation, and reinforcement learning. In its fourth round, MLCommons added two new benchmarks to evaluate the performance of speech-to-text and 3D medical imaging tasks.

MLPerf Training is a full system benchmark, testing machine learning models, software, and hardware. With MLPerf, MLCommons now has a reliable and consistent way to track performance improvement over time, plus results from a "level playing field" benchmark drives competition, which in turn is driving performance. Compared to the last submission round, the best benchmark results improved by up to 2.1X, showing substantial improvement in hardware, software, and system scale.

Similar to past MLPerf Training results, the submissions consist of two divisions: closed and open. Closed submissions use the same reference model to ensure a level playing field across systems, while participants in the open division are permitted to submit a variety of models. Submissions are additionally classified by availability within each division, including systems commercially available, in preview, and RDI.

New MLPerf Training Benchmarks to Advance ML Tasks and Performance

As industry adoption and use cases for machine learning expands, MLPerf will continue to evolve its benchmark suites to evaluate new capabilities, tasks and performance metrics. With the MLPerf Training v1.0 round, MLCommons included two new benchmarks to measure performance for speech-to-text and 3D medical imaging. These new benchmarks leverage the following reference models:

  • Speech-to-Text with RNN-T: RNN-T: Recurrent Neural Network Transducer is an automatic speech recognition (ASR) model that is trained on a subset of LibriSpeech. Given a sequence of speech input, it predicts the corresponding text. RNN-T is MLCommons’ reference model and commonly used in production for speech-to-text systems.
  • 3D Medical Imaging with 3D U-Net: The 3D U-Net architecture is trained on the KiTS 19 dataset to find and segment cancerous cells in the kidneys. The model identifies whether each voxel within a CT scan belongs to a healthy tissue or a tumor, and is representative of many medical imaging tasks.

MLPerf Training v1.0 results further MLCommons’ goal to provide benchmarks and metrics that level the industry playing field through the comparison of ML systems, software, and solutions. The latest benchmark round received submissions from 13 organizations and released over 650 peer-reviewed results for machine learning systems spanning from edge devices to data center servers. Submissions this round included software and hardware innovations from Dell, Fujitsu, Gigabyte, Google, Graphcore, Habana Labs, Inspur, Intel, Lenovo, Nettrix, NVIDIA, PCL & PKU, and Supermicro. To view the results, please visit https://mlcommons.org/en/training-normal-10/.

“We’re thrilled to see the continued growth and enthusiasm from the MLPerf community, especially as we’re able to measure significant improvement across the industry with the MLPerf Training benchmark suite,” said Victor Bittorf, Co-Chair of the MLPerf Training Working Group. “Congratulations to all of our submitters in this v1.0 round - we’re excited to continue our work together, bringing transparency across machine learning system capabilities.”

“The industry progress highlighted in this round of results is outstanding,” said John Tran, Co-Chair of the MLPerf Training Working Group. "The training benchmark suite is at the center of MLCommon’s mission to push machine learning innovation forward for everyone, and we’re incredibly pleased both with the engagement from this round’s submissions, as well as increasing interest in MLPerf benchmark results by businesses looking to adopt AI solutions."

Additional information about the Training v1.0 benchmarks will be available at https://mlcommons.org/en/training-normal-10/.

About MLCommons

MLCommons is an open engineering consortium with a mission to accelerate machine learning innovation, raise all boats and increase its positive impact on society. The foundation for MLCommons began with the MLPerf benchmark in 2018, which rapidly scaled as a set of industry metrics to measure machine learning performance and promote transparency of machine learning techniques. In collaboration with its 50+ founding partners - global technology providers, academics and researchers, MLCommons is focused on collaborative engineering work that builds tools for the entire machine learning industry through benchmarks and metrics, public datasets and best practices.

For additional information on MLCommons and details on becoming a Member or Affiliate of the organization, please visit http://mlcommons.org/ or contact participation@mlcommons.org.

Contacts

MLCommons


Release Versions

Contacts

Social Media Profiles
More News From MLCommons

MLCommons™ Releases MLPerf™ Tiny Inference Benchmark

SAN FRANCISCO--(BUSINESS WIRE)--Today, MLCommons, an open engineering consortium, launched a new benchmark, MLPerf™ Tiny Inference, to measure how quickly a trained neural network can process new data for extremely low-power devices in the smallest form factors and included an optional power measurement. MLPerf Tiny v0.5 is the organization's first inference benchmark suite that targets machine learning use cases on embedded devices. Embedded machine learning is a burgeoning field where AI-driv...

MLCommons™ Releases MLPerf™ Inference v1.0 Results with First Power Measurements

SAN FRANCISCO--(BUSINESS WIRE)--Today, MLCommons, an open engineering consortium, released results for MLPerf Inference v1.0, the organization's machine learning inference performance benchmark suite. In its third round of submissions, the results measured how quickly a trained neural network can process new data for a wide range of applications on a variety of form factors and for the first-time, a system power measurement methodology. MLPerf Inference v1.0 is a cornerstone of MLCommons’ initi...

MLCommons Launches and Unites 50+ Global Technology and Academic Leaders in AI and Machine Learning to Accelerate Innovation in ML

SAN FRANCISCO--(BUSINESS WIRE)--Today, MLCommons, an open engineering consortium, launches its industry-academic partnership to accelerate machine learning innovation and broaden access to this critical technology for the public good. The non-profit organization initially formed as MLPerf, now boasts a founding board that includes representatives from Alibaba, Facebook AI, Google, Intel, NVIDIA and Professor Vijay Janapa Reddi of Harvard University; and a broad range of more than 50 founding me...
Back to Newsroom