From May 30 to June 3, 2020, the 47th International Conference on Computer Architecture (ISCA) was held online. Professors Wei Shaojun and Liu Leibo from Tsinghua University gave an academic report titled "GraphABCD: Scaling Out Graph Analytics with Asynchronous Block Coordinate Descent". The speaker Yang Yifan is the first author of the paper.
This report introduces a method to transform graph computing problems into optimization problems under a reconfigurable architecture and optimize the graph computing framework using a block coordinate descent algorithm. This method fully utilizes the spatial parallelism of the reconfigurable array and provides a new perspective on optimizing the performance of the graph computing framework, which has significant advantages over traditional methods.
Large-scale graph computing algorithms such as PageRank and collaborative filtering are the basis of big data analysis. In order to efficiently solve large-scale graph computing problems, graph computing frameworks have optimized the performance of various graph computing algorithms for specific computing architectures. However, when optimizing iterative graph computing algorithms, existing graph computing frameworks only focus on the execution time of a single iteration, and rarely discuss the number of iterations required for the algorithm to converge. The performance optimization of graph computing algorithms has encountered a bottleneck. If this bottleneck cannot be broken through, it will severely restrict the improvement of the graph computing framework and will also greatly limit the further development of fields such as big data analysis.
Image source: Department of Micro-Nano Electronics, Tsinghua University
To address this problem, the team of Wei Shaojun and Liu Leibo proposed a block coordinate descent execution model applied to the graph computing framework, which can simultaneously optimize the number of iterations and single iteration time of the graph computing algorithm.
The crux of the limitations of existing graph computing frameworks is that they adopt a global synchronous parallel execution model, that is, each iteration of the graph computing uses barriers for global synchronization. The global synchronous parallel model not only limits the scalability of the framework, but also fails to dynamically optimize the number of iterations required for algorithm convergence during the algorithm execution process. This study transforms the graph computing problem into an optimization problem and introduces the block coordinate descent method of optimization analysis into the graph computing framework for the first time.
Under the block coordinate descent execution model, the iterative process of the graph algorithm no longer relies on global synchronization, but selects one or more data blocks consisting of subgraphs in each iteration and updates them according to the coordinate descent method until the algorithm converges. This study can systematically optimize the number of iterations required for the algorithm to converge by analyzing the impact of block coordinate descent model parameters such as data block size, selection order, and update method on the convergence speed; at the same time, since there is no need for synchronization between multiple data blocks, asynchronous concurrent execution can be achieved.
This study extends the graph computing framework to reconfigurable chips in an asynchronous execution mode, and reduces the execution time of a single iteration with the help of heterogeneous reconfigurable computing resources. Experimental results show that in important graph algorithms such as single-source shortest path, PageRank, and collaborative filtering, the convergence rate and performance are improved by 4.8 times and 2 times respectively compared with the current mainstream graph computing framework.
Previous article:Wuxi: IC exports exceeded 5 billion yuan in April, and Changdian Technology and others continued to make efforts
Next article:The curtain has been raised on the capital increase and production start, and integrated circuit projects are advancing under the epidemic
- iPhone SE 4 reportedly debuts with Apple's self-developed 5G baseband: coming in March next year
- iPhone 17 series revealed to be the first to use 3nm A19 series chips: no chance of TSMC's 2nm process
- The supply chain said that upstream components will be greatly reduced in price: Will domestic mobile phones take the initiative to reduce their prices after the current price increase?
- Apple to spend nearly $100 million to lift Indonesia's iPhone 16 sales ban
- Tata Motors acquires Pegatron's only iPhone factory in India, deepening cooperation with Apple
- Apple faces class action lawsuit from 40 million UK iCloud users, faces $27.6 billion in claims
- Apple and Samsung reportedly failed to develop ultra-thin high-density batteries, iPhone 17 Air and Galaxy S25 Slim phones became thicker
- Micron will appear at the 2024 CIIE, continue to deepen its presence in the Chinese market and lead sustainable development
- Qorvo: Innovative technologies lead the next generation of mobile industry
- Intel promotes AI with multi-dimensional efforts in technology, application, and ecology
- ChinaJoy Qualcomm Snapdragon Theme Pavilion takes you to experience the new changes in digital entertainment in the 5G era
- Infineon's latest generation IGBT technology platform enables precise control of speed and position
- Two test methods for LED lighting life
- Don't Let Lightning Induced Surges Scare You
- Application of brushless motor controller ML4425/4426
- Easy identification of LED power supply quality
- World's first integrated photovoltaic solar system completed in Israel
- Sliding window mean filter for avr microcontroller AD conversion
- What does call mean in the detailed explanation of ABB robot programming instructions?
- STMicroelectronics discloses its 2027-2028 financial model and path to achieve its 2030 goals
- 2024 China Automotive Charging and Battery Swapping Ecosystem Conference held in Taiyuan
- State-owned enterprises team up to invest in solid-state battery giant
- The evolution of electronic and electrical architecture is accelerating
- The first! National Automotive Chip Quality Inspection Center established
- BYD releases self-developed automotive chip using 4nm process, with a running score of up to 1.15 million
- GEODNET launches GEO-PULSE, a car GPS navigation device
- Should Chinese car companies develop their own high-computing chips?
- Infineon and Siemens combine embedded automotive software platform with microcontrollers to provide the necessary functions for next-generation SDVs
- Continental launches invisible biometric sensor display to monitor passengers' vital signs
- Senior maker Chunyang shares tips: Electronic product development creativity & clever use of online resources to help electronic product development
- 【AT-START-F425 Review】+ Serial Port Idle Interrupt and DMA
- Use of DSP guidance function
- 8051+DM9000AE/DM9008AE
- Rapid Design of TMS320LF2407 Program Based on Matlab
- MSP430G2553 collects AC VPP
- Watch the video and share 3000 yuan: Dr. Chen Qiaoliang talks about MOSFET selection principles and typical application cases
- 3m semi-anechoic chamber, 10m semi-anechoic chamber
- Talk about the development prospects of analog circuits
- Mobile revenue exceeds Telecom and Netcom; fixed-line operators eagerly await 3G