October 16, 2024 - At the SEMiBAY2024 "HBM and Memory Technology and Application Forum", Xiong Dapeng, founder, chairman and CEO of Yizhu Technology, delivered a speech entitled "Beyond the Limits: Technical Challenges and Solutions Facing High-Computing Power Chips".
Dr. Xiong Dapeng proposed that driven by AI big model technology, computing power is reaching a turning point in demand, and hardware architecture will become one of the key paths to meet computing power demand. Future computing power growth will be centered on storage units.
Opportunities and Challenges in the Era of Big Models
In the era of AI big models, with the continuous improvement of data, computing power, and parameter volume, model capabilities have been significantly enhanced. Dr. Xiong Dapeng pointed out that big models have gradually evolved from quantitative changes to qualitative changes. When the model size is large enough, emergent capabilities similar to human "enlightenment" will appear, and the reasoning ability of big models will be significantly improved. This change indicates that the last mile of AI application is about to be opened up, and the implementation of business will drive the demand for AI computing power to a turning point.
Omdia's latest report, "Cloud Computing and Data Center Artificial Intelligence Processor Forecast", shows that the market size of GPUs and other acceleration chips used for cloud computing and data center artificial intelligence has grown from less than $10 billion in 2022 to $78 billion in 2024, and is expected to reach $151 billion by 2029. However, the market may see a clear inflection point in 2026, and the growth momentum will shift from technology adoption to changes in demand for artificial intelligence applications.
In addition, IDC predicts that future AI servers will focus on improving computing power and processing efficiency (energy efficiency ratio) to adapt to more complex and larger-scale AI applications. It is expected that by 2027, the proportion of AI computing power used for reasoning will reach 72.6%, and in the future it is expected to reach 95% for reasoning and 5% for training.
Application implementation requires hardware architecture breakthroughs
However, the speed of improving the performance of existing chip hardware can no longer meet the rapidly growing computing power requirements of algorithm models. Moore's Law, the golden rule that once guided the development of the semiconductor industry, is now facing unprecedented challenges. A report from the Economic Research Institute of Guosen Securities pointed out that the parameter scale of large models increases 35 times every 18 months, while chips under Moore's Law only increase by 2 times. Therefore, exploring and developing new hardware architectures has become one of the key paths to breakthroughs in computing power.
Dr. Xiong Dapeng emphasized that under the existing hardware architecture, AI chips are currently facing the "three wall" problem: storage wall, energy consumption wall and compilation wall. The storage wall refers to the problem that the data access speed of the memory cannot keep up with the data processing speed of the computing unit, resulting in a performance bottleneck.
At the same time, the existence of the storage wall brings about the problems of energy consumption wall and compilation wall. The energy consumption wall refers to the fact that as chip performance improves, energy consumption and heat dissipation issues become the main factors limiting further performance improvement. The compilation wall refers to the fact that as the complexity of AI models increases, the amount of data and computing tasks that the compiler needs to process also increase dramatically, which makes static compilation optimization very difficult, and manual optimization consumes a lot of time and cost.
Storage and computing integration opens up the second growth curve of computing power
Faced with this challenge, Yizhu Technology chose to innovate and used a new chip design concept, the "storage-computing integrated super-heterogeneous" architecture, which greatly reduced the delay in data transfer and improved the overall computing efficiency and energy efficiency.
Dr. Xiong Dapeng pointed out that if we want to break the "three walls" of AI chips, we need to start from the first principle of computing power (Amdahl's law) and significantly reduce the amount of data moved so that the F value is close to 0, so as to ensure the linear growth of effective computing power density. Currently, there are two main solutions in the industry: one is in-memory computing, and the other is near-memory computing.
In-memory computing integrates storage and computing functions to reduce data transfer latency and improve performance and energy efficiency. In an ideal state, F=0, which enables seamless integration of storage and computing. Near-memory computing integrates storage units and computing units through advanced packaging to increase memory access bandwidth, reduce data transfer latency, and improve overall computing efficiency.
Dr. Xiong Dapeng emphasized that through technologies such as storage-computing integrated architecture, we can break through the bottleneck of traditional computing models, achieve higher effective computing power, and break the ceiling of effective computing power. In the future, the era centered on computing power units will come to an end, and the second growth curve of computing power will be centered on storage units.
Conclusion
Dr. Xiong Dapeng said that since its establishment, Yizhu Technology has always been committed to providing a new path for the development of AI high-computing chips that are more cost-effective, more energy-efficient, and have greater computing power development space through storage and computing integration. In March 2023, in the face of AI computing challenges brought by large models such as ChatGPT, Yizhu Technology first proposed "storage and computing integrated super heterogeneity", providing a new idea for the development of AI high-computing chips in the era of large models.
In the future, with the continuous advancement of AI technology, the demand for computing power is also growing. Yizhu Technology will provide a new direction for the development of AI chips through an innovative storage-computing integrated architecture. In the era of large models, Yizhu Technology's technology and products will provide strong support for the development of AI technology and drive the entire industry forward. With the continuous maturity of Yizhu Technology's technology and the continuous expansion of its applications, we have reason to expect that AI chip technology will usher in a new stage of development and make greater contributions to scientific and technological progress!
Previous article:Akamai adds behavioral DDoS protection engine to App & API Protector
Next article:Edge AI: Revolutionizing Real-Time Data Processing and Automation
Recommended ReadingLatest update time:2024-11-22 02:42
- Popular Resources
- Popular amplifiers
- Filling the domestic gap! China Mobile, Huawei and others jointly released the first GSE DPU chip
- Akamai Launches Cloud-Agnostic, Ready-to-Use App Platform
- IC China 2024 opens in Beijing: Intel shares insights to promote the implementation of intelligent computing applications
- Intel's Song Jiqiang: Collaborative innovation between cloud, edge and end to accelerate the application of intelligent computing
- Gartner: By 2027, 40% of AI data centers will be constrained by power shortages
- NVIDIA helps Google’s quantum AI accelerate processor design by simulating quantum device physics
- New breakthrough! Ultra-fast memory accelerates Intel Xeon 6-core processors
- Consolidating vRAN sites onto a single server helps operators reduce total cost of ownership
- Wi-Fi 8 specification is on the way: 2.4/5/6GHz triple-band operation
- Intel promotes AI with multi-dimensional efforts in technology, application, and ecology
- ChinaJoy Qualcomm Snapdragon Theme Pavilion takes you to experience the new changes in digital entertainment in the 5G era
- Infineon's latest generation IGBT technology platform enables precise control of speed and position
- Two test methods for LED lighting life
- Don't Let Lightning Induced Surges Scare You
- Application of brushless motor controller ML4425/4426
- Easy identification of LED power supply quality
- World's first integrated photovoltaic solar system completed in Israel
- Sliding window mean filter for avr microcontroller AD conversion
- What does call mean in the detailed explanation of ABB robot programming instructions?
- Europe's three largest chip giants re-examine their supply chains
- Breaking through the intelligent competition, Changan Automobile opens the "God's perspective"
- The world's first fully digital chassis, looking forward to the debut of the U7 PHEV and EV versions
- Design of automotive LIN communication simulator based on Renesas MCU
- When will solid-state batteries become popular?
- Adding solid-state batteries, CATL wants to continue to be the "King of Ning"
- The agency predicts that my country's public electric vehicle charging piles will reach 3.6 million this year, accounting for nearly 70% of the world
- U.S. senators urge NHTSA to issue new vehicle safety rules
- Giants step up investment, accelerating the application of solid-state batteries
- Guangzhou Auto Show: End-to-end competition accelerates, autonomous driving fully impacts luxury...
- Design of real-time image test device based on LVDS technology
- Test丨You must first know your own isolation before you can measure the isolation level of others!
- [GD32L233C-START Review] 18. Hardware SPI driver (ST7735) 1.8-inch TFT LCD
- Difference between TMS320x281x and TMS320x2833x/TMS320x2823x
- Gameboy that doesn't need to be charged
- Our company is in urgent need of a senior technical support engineer (working in Tianjin)
- A single-stage power factor correction circuit with constant power control
- [National Technology N32 Bluetooth chip development package] --N32WB03x series
- ESD
- CMOS Implementation of RF Phase-Locked Loop Frequency Synthesizer