Alibaba Cloud launches FeiTian intelligent computing platform, which can improve AI training efficiency by 11 times
On August 30, Alibaba Cloud announced the official launch of the full-stack intelligent computing solution "Feitian Intelligent Computing Platform" and launched two ultra-large-scale intelligent computing centers, providing powerful intelligent computing services for various scientific research, public services and corporate institutions in both public cloud and private cloud modes. It can increase the utilization rate of computing resources by more than 3 times, AI training efficiency by 11 times, and reasoning efficiency by 6 times.
The FeiTian intelligent computing platform has been widely used within Alibaba, supporting the development of cutting-edge AI and e-commerce intelligent technologies at the DAMO Academy. It has also served institutions and companies such as Xiaopeng Motors, Deepin Technology, SAIC Motor, China Meteorological Administration, and China Southern Power Grid, supporting industries such as autonomous driving, new drug research and development, weather forecasting, and industrial energy to significantly improve AI training efficiency.
It is understood that the platform provides an overall solution of integrated computing power and big data AI integrated platform based on Alibaba Cloud Panjiu infrastructure. It can run on servers with various chip types such as X86, GPU, ARM, etc., realizing "one cloud, multiple cores" and achieving 90% kilocalorie parallel computing efficiency with up to 10 times IO optimization and 5 times communication performance optimization.
In terms of green technology, FeiTian Intelligent Computing reduces carbon emissions per unit computing power in five aspects: technical emission reduction, energy structure optimization, regional layout optimization, supply chain carbon reduction, and resource utilization optimization. In terms of technical emission reduction, energy consumption is reduced through liquid cooling, power supply technology, and intelligent operation and maintenance, with the lowest PUE reaching 1.09.
At the same time, developers can perform data storage, data governance, data analysis, model development, model training and reasoning on the platform. It also provides pre-trained models, as well as model capabilities in the fields of speech, image, natural language processing, decision-making, etc., to facilitate developers to better accelerate the development of AI applications.
Currently, the platform is supporting the construction of two ultra-large-scale intelligent computing centers. Among them, the Zhangbei Intelligent Computing Center has a construction scale of 12 EFLOPS (120 billion floating-point operations per second) AI computing power, which will exceed Google's 9 EFLOPS and Tesla's 1.8 EFLOPS, becoming the world's largest intelligent computing center. The Ulanqab Intelligent Computing Center has a construction scale of 3 EFLOPS (300 billion floating-point operations per second) AI computing power and is located in the Inner Mongolia hub of "Eastern Data and Western Computing".
Cai Yinghua, President of Global Sales at Alibaba Cloud Intelligence, said that intelligent computing is not only about scale, but also needs to be green, efficient and have industrial practice. Computing is a huge and complex system. Without systematic core technical capabilities, piling up hardware will not produce computing power, let alone bring actual industrial value.
It is understood that intelligent computing is different from general computing. It requires massive data to train AI models, and computing power is lost in data migration and synchronization. The minimum computing power output of 1,000 calories is often only about 40%. This leads to high costs for intelligent computing and restricts the development of the industry. Alibaba Cloud has changed the problem of intelligent computing loss through systematic technological innovation, and increased the efficiency of 1,000 calories of parallel computing to more than 90%.
For example, in terms of communication technology, Alibaba Cloud uses a high-performance self-developed Solar-RDMA network to achieve an end-to-end minimum latency of 2 microseconds. Combined with Alibaba Cloud's self-developed non-blocking communication technology, the data exchange speed in the computing process is increased by up to 5 times. At the same time, the application of green technologies such as natural air cooling and liquid cooling reduces the energy consumption of the intelligent computing center, with a PUE as low as 1.09.
At the AI development layer, Alibaba Cloud provides a big data + AI integrated platform to support the entire development and operation process. In particular, in the model training phase, it provides a distributed training framework that can automatically combine and optimize distributed strategies, increasing training efficiency by more than 11 times. In addition, Alibaba Cloud provides users with a one-stop general reasoning optimization tool that can perform operations such as quantization, pruning, sparsification, and distillation on algorithm models, which can increase reasoning efficiency by more than 6 times.
Not long ago, Xiaopeng Motors built the "Fuyao" intelligent computing center in Ulanqab based on Feitian Intelligent Computing, with a computing power of 600PFLOPS, which is the largest intelligent computing center for autonomous driving in China, and has accelerated the training of autonomous driving models by nearly 170 times. Based on Feitian Intelligent Computing, Haomo Auto has achieved a 128-card parallel efficiency of over 96%, reducing the cost of autonomous driving model training by 62%, increasing the training speed by 110%, and significantly shortening the model iteration cycle.
In the field of life sciences, after using the FeiTian intelligent computing platform, Deepin Technology has improved cluster performance optimization by more than 100% , and increased the efficiency of molecular dynamics simulation training by 5 times . In the industrial field, Zhiji Auto has used high-performance computing to improve the efficiency of industrial simulation by 25% , and the efficiency of intelligent driving training by 70%, accelerating the development and launch of new models. Shandong Dezhou Electric Power uses AI to conduct review and prediction with an accuracy rate of 98% , and the time taken is reduced from 1 hour to a few minutes .
In the field of urban governance, Sichuan Chengyi Expressway has reduced the accident rate by 60% through vehicle-road collaborative optimization using digital twins. Chongqing Water Affairs has achieved a 95% accuracy in water conservancy dispatch forecasting through remote sensing data and simulation deduction; China Southern Power Grid and China Meteorological Administration have used intelligent computing capabilities to improve the accuracy and stability of weather forecasts.
In addition, FeiTian Intelligent Computing also supports Alibaba's artificial intelligence practice, supporting Alibaba AI's 1 trillion calls per day and serving 1 billion people worldwide. Among them, the training speed of Pailitao has increased by 200 times, and the full training time of 1 billion pictures has been shortened from 2.5 months to 8 hours. The DAMO Academy's large model M6 only uses 512 GPUs and completes the training of a 10 trillion parameter model in 10 days, with an energy consumption of only 1% of GPT-3 with the same parameter scale.
Previous article:Alibaba Cloud launches super intelligent computing center with total computing power reaching 12 EFLOPS
Next article:Tianshu Zhixin launched DeepSpark, an open platform for top 100 applications, making it easier to choose computing power
Recommended ReadingLatest update time:2024-11-16 10:33
- Popular Resources
- Popular amplifiers
- Wi-Fi 8 specification is on the way: 2.4/5/6GHz triple-band operation
- Three steps to govern hybrid multicloud environments
- Microchip Accelerates Real-Time Edge AI Deployment with NVIDIA Holoscan Platform
- Keysight Technologies FieldFox handheld analyzer with VDI spread spectrum module to achieve millimeter wave analysis function
- Qualcomm launches its first RISC-V architecture programmable connectivity module QCC74xM, supporting Wi-Fi 6 and other protocols
- Microchip Launches Broadest Portfolio of IGBT 7 Power Devices Designed for Sustainable Development, E-Mobility and Data Center Applications
- Infineon Technologies Launches New High-Performance Microcontroller AURIX™ TC4Dx
- Rambus Announces Industry’s First HBM4 Controller IP to Accelerate Next-Generation AI Workloads
- NXP FRDM platform promotes wireless connectivity
- Innolux's intelligent steer-by-wire solution makes cars smarter and safer
- 8051 MCU - Parity Check
- How to efficiently balance the sensitivity of tactile sensing interfaces
- What should I do if the servo motor shakes? What causes the servo motor to shake quickly?
- 【Brushless Motor】Analysis of three-phase BLDC motor and sharing of two popular development boards
- Midea Industrial Technology's subsidiaries Clou Electronics and Hekang New Energy jointly appeared at the Munich Battery Energy Storage Exhibition and Solar Energy Exhibition
- Guoxin Sichen | Application of ferroelectric memory PB85RS2MC in power battery management, with a capacity of 2M
- Analysis of common faults of frequency converter
- In a head-on competition with Qualcomm, what kind of cockpit products has Intel come up with?
- Dalian Rongke's all-vanadium liquid flow battery energy storage equipment industrialization project has entered the sprint stage before production
- Allegro MicroSystems Introduces Advanced Magnetic and Inductive Position Sensing Solutions at Electronica 2024
- Car key in the left hand, liveness detection radar in the right hand, UWB is imperative for cars!
- After a decade of rapid development, domestic CIS has entered the market
- Aegis Dagger Battery + Thor EM-i Super Hybrid, Geely New Energy has thrown out two "king bombs"
- A brief discussion on functional safety - fault, error, and failure
- In the smart car 2.0 cycle, these core industry chains are facing major opportunities!
- The United States and Japan are developing new batteries. CATL faces challenges? How should China's new energy battery industry respond?
- Murata launches high-precision 6-axis inertial sensor for automobiles
- Ford patents pre-charge alarm to help save costs and respond to emergencies
- New real-time microcontroller system from Texas Instruments enables smarter processing in automotive and industrial applications
- Unboxing the imxm8plus development board from PHYTEC
- EEWORLD University ---- OpenCV 3 with Python 3 Tutorial
- Siemens 230RC opens for 3 seconds and closes for 3 seconds
- [Environmental Expert's Smart Watch] Part 10: Status Light and Mode Switching
- Disassembly of 3.0 expansion dock, PD to HDMI conversion chip Bridgestone PS176HDMQFN48GTR2-B0 schematic diagram reference
- Three issues about LDO load regulation, linear regulation and voltage drop
- How to release the object is part of a locked union
- Lighting capacitor
- [National Technology N32G457 Review] Temperature and atmospheric pressure detection
- 2021 National Electric Competition