Latest MLCommons results announced: Sharing AI achievements of the fifth-generation Xeon Scalable Processor

Publisher:EE小广播Latest update time:2024-03-29 Source: EEWORLD Reading articles on mobile phones Scan QR code
Read articles on your mobile phone anytime, anywhere

The latest MLPerf test results for the 5th Gen Xeon Scalable processors showcase the progress Intel and its ecosystem partners have made in improving generative AI performance.


Recently, MLCommons announced the results of the MLPerf v4.0 benchmark for AI reasoning. Among them, the fifth-generation Intel® Xeon® Scalable Processor (hereinafter referred to as the "fifth-generation Xeon") with built-in Intel® Advanced Matrix Extensions (Intel® AMX) performed well in the test, further demonstrating Intel's commitment to promoting "AI everywhere" through rich and competitive solutions. As of now, Intel is still the only CPU manufacturer that has submitted MLPerf test results. Compared with the results of the fourth-generation Xeon in the MLPerf reasoning v3.1 benchmark, the test results of the fifth-generation Xeon are an average of 1.42 times higher.


“We continue to improve AI performance across our broad portfolio of CPUs and accelerators in industry benchmarks,” said Zane Ball, Intel vice president and general manager of product management for the Data Center and AI Group. “This new MLCommons result shows that the AI ​​solutions we offer are able to meet the evolving and diverse AI needs of our customers. At the same time, Intel Xeon processors provide customers with a cost-effective option for rapid AI deployment.”


Intel products have demonstrated leading training and inference performance in multiple rounds of MLPerf benchmark tests to date. The test results have also established an industry standard for customers to evaluate the AI ​​performance of products.


Test results for the fifth-generation Xeon:


Compared with the performance of the fourth-generation Xeon in the MLPerf inference v3.1 performance benchmark, the fifth-generation Xeon, which has been optimized by hardware and software, has an average performance improvement of 1.42 times. Among them, for the GPT-J model with software optimizations such as continuous batching, the performance of the fifth-generation Xeon has increased by about 1.8 times compared with the test results of v3.1; similarly, thanks to MergedEmbeddingBag and other optimizations based on Intel AMX, the test results of DLRMv2 show a performance improvement of about 1.8 times and an accuracy of 99.9%.


image.png

5th Generation Intel® Xeon® Scalable Processors


At the same time, Intel is very proud to work with a wide range of OEM partners including Cisco, Dell, Quanta, Supermicro and Wiwynn to help them submit MLPerf test results based on their own products. Intel not only began submitting test results based on the fourth-generation Xeon in 2020, but the Xeon Scalable processor is also the host CPU for many accelerators in the products participating in MLPerf testing.


In addition, the 5th Generation Xeon can be evaluated on the Intel® Developer Cloud, where users can perform small and large-scale AI training (such as large language models or generative AI), run large-scale inference workloads, and manage AI computing resources.


Notes: Workload and related configuration descriptions. Results may vary.


Reference address:Latest MLCommons results announced: Sharing AI achievements of the fifth-generation Xeon Scalable Processor

Previous article:The fastest fiber optic data transmission to date reaches 301TB/s, which is 4.5 million times the average broadband speed in the UK
Next article:Zhejiang Mobile, Qualcomm and ZTE jointly completed the global commercial debut of 5G-A downlink three-carrier aggregation + 1024QAM, achieving a breakthrough in single-user rate

Recommended ReadingLatest update time:2024-11-16 09:43

Infineon launches CoolSiC MOSFET 400 V, redefining power density and efficiency of AI server power supplies
Munich, Germany, June 25, 2024 - As artificial intelligence (AI) processors become increasingly power-hungry, server power supplies (PSUs) must deliver more power without exceeding the size of server racks, primarily because of the surge in energy requirements of advanced GPUs. By the end of this decade, eac
[Power Management]
Infineon launches CoolSiC MOSFET 400 V, redefining power density and efficiency of AI server power supplies
Industry丨The era of AI-defined cockpit is coming
With the widespread application of artificial intelligence technology , its enabling role in various industries is becoming increasingly significant. In particular, driven by generative artificial intelligence, the cabin experience is ushering in an unprecedented transformation. In this transformation, the co
[Automotive Electronics]
MediaTek Unveils Next-Generation Chromebook, Smart TV and Display Chips at COMPUTEX 2024, Bringing AI to More Products
June 4, 2024 – MediaTek COMPUTEX showcases advanced AI technologies and innovative applications in a wide range of fields. Vice Chairman and CEO Dr. Li-Hsing Tsai delivered a keynote speech on June 4, talking about how MediaTek’s technology can enable the ubiquitous AI era and continue to change mobile communication
[Home Electronics]
Graphcore Launches IPU Developer Cloud to Solve the World’s Toughest AI Problems
Graphcore officially released the IPU-based developer cloud, which is free for Chinese customers, universities, research institutions and individual researchers to use, so that cutting-edge machine intelligence innovators can easily obtain IPU for cloud training and reasoning of cutting-edge AI models, thereby achievi
[Internet of Things]
Graphcore Launches IPU Developer Cloud to Solve the World’s Toughest AI Problems
The United States suddenly upgraded its AI chip export ban, and Nvidia, ASML, Biren Technology, and Moore Thread responded
According to foreign media reports, on October 17, local time in the United States, the Biden administration stated that it plans to stop exporting to China more advanced AI chips designed by companies such as Nvidia, and will impose broader restrictions on advanced chips and chip manufacturing tools. Expanded to more
[Semiconductor design/manufacturing]
The United States suddenly upgraded its AI chip export ban, and Nvidia, ASML, Biren Technology, and Moore Thread responded
New Features in NVIDIA AI Enterprise 2.1
NVIDIA today announced the general availability of NVIDIA AI Enterprise 2.1, the latest version of its end-to-end AI and data analytics software suite optimized and certified to enable enterprises to deploy and scale AI applications across bare metal, virtualized environments, containers and the cloud.
[Network Communication]
New Features in NVIDIA AI Enterprise 2.1
Leapmotor uses BlackBerry QNX to build a new generation of AI smart seats for its new electric SUV
Leapmotor Uses BlackBerry QNX to Build Next-Generation AI Cockpit for Its New Electric SUV Hangzhou, China – November 18, 2021 – BlackBerry today announced that Leapmotor Technology Co., Ltd. (“Leapmotor”), a Chinese technology-based smart electric vehicle brand, has adopted BlackB
[Automotive Electronics]
Leapmotor uses BlackBerry QNX to build a new generation of AI smart seats for its new electric SUV
Tesla's self-developed AI chip improves performance and reduces costs by 20%
(Image source: www.futurecar.com) According to foreign media reports, Tesla's vice president of hardware engineering Pete Bannon recently revealed details about Tesla's future FDS (full self-driving) system Autopilot chipset. Bannon said that the new AI chip is 21 times faster than the current Nvidia chip and cos
[Automotive Electronics]
Tesla's self-developed AI chip improves performance and reduces costs by 20%
Latest Network Communication Articles
Change More Related Popular Components

EEWorld
subscription
account

EEWorld
service
account

Automotive
development
circle

About Us Customer Service Contact Information Datasheet Sitemap LatestNews


Room 1530, 15th Floor, Building B, No.18 Zhongguancun Street, Haidian District, Beijing, Postal Code: 100190 China Telephone: 008610 8235 0740

Copyright © 2005-2024 EEWORLD.com.cn, Inc. All rights reserved 京ICP证060456号 京ICP备10001474号-1 电信业务审批[2006]字第258号函 京公网安备 11010802033920号