As the speed gap between ultra-fast CPU cores and relatively slow memories continues to grow, memory systems may become the main bottleneck restricting system performance today. In addition, low power consumption is another important design consideration, especially with the increasing number of battery-powered devices. Low power consumption means longer battery life and device usage time. In common applications, memory power consumption usually accounts for a considerable part of the application processor power consumption, and as memory designs become more complex, larger in capacity, and more layered, memory power consumption tends to increase rapidly. Therefore, reducing memory power consumption is very beneficial to extending battery life. In order to better understand the inherent behavior of various applications, it is necessary to explore the characteristics of memory and establish memory models to determine whether the application involves frequent memory access operations and even help predict the performance of the application.
This paper provides a simple and economical method to dynamically characterize the computation and memory composition of an application with acceptable accuracy.
Methods for describing memory characteristics
If no memory operations are involved, then CPU utilization should scale linearly with the CPU core frequency, and the application cost (defined as the product of CPU utilization and CPU frequency) should remain constant. However, after taking memory accesses into account, CPU utilization is no longer linear with core frequency. At higher frequencies, memory tends to have a greater impact on performance because the CPU has to spend more CPU cycles waiting for memory responses (here we assume that memory frequency does not vary with CPU frequency). In this sense, applications can be divided into two types: compute-bound and memory-bound.
Next, we will describe how to characterize memory characteristics using three different methods and help determine the CPU utilization of the application. Here, hardware performance information is collected by looking at the Performance Monitoring Unit (PMU). Therefore, Marvell's method only works on systems with PMU hardware support.
1. Overall data cache miss rate: Intuitively, a higher data cache miss rate means more memory traffic. To obtain the data cache miss rate, we need to monitor the total number of access operations and misses of the L1 data cache and L2 data cache (if any).
2. Main memory access rate: The occupancy rate of the external memory controller directly indicates the utilization rate of the memory. In order to obtain the main memory access rate value, two types of PMU information must be collected: the total number of cycles occupied by the memory controller; the total number of cycles in the monitoring window.
3. Data stall rate: Pipeline stalls are mainly caused by data dependencies, and the reason why data is unavailable is that the memory access speed is much lower than the CPU speed. Therefore, the number of pipeline stalls reflects the memory traffic. In addition, the number of pipeline stalls also indicates the importance of memory access. Not every memory access has a critical impact on the final performance, so it is quite useful to keep track of memory access operations that affect performance due to data dependencies. Using this method, you can monitor the events caused by data dependencies. In addition, the total number of cycles must be recorded to calculate the data stall rate in each window.
Previous article:Design of serial communication between MSP430 microcontroller and PC based on VB6.0
Next article:Tips for STM32 MCUs: How to use IAR to develop STM32
- MathWorks and NXP Collaborate to Launch Model-Based Design Toolbox for Battery Management Systems
- STMicroelectronics' advanced galvanically isolated gate driver STGAP3S provides flexible protection for IGBTs and SiC MOSFETs
- New diaphragm-free solid-state lithium battery technology is launched: the distance between the positive and negative electrodes is less than 0.000001 meters
- [“Source” Observe the Autumn Series] Application and testing of the next generation of semiconductor gallium oxide device photodetectors
- 采用自主设计封装,绝缘电阻显著提高!ROHM开发出更高电压xEV系统的SiC肖特基势垒二极管
- Will GaN replace SiC? PI's disruptive 1700V InnoMux2 is here to demonstrate
- From Isolation to the Third and a Half Generation: Understanding Naxinwei's Gate Driver IC in One Article
- The appeal of 48 V technology: importance, benefits and key factors in system-level applications
- Important breakthrough in recycling of used lithium-ion batteries
- Innolux's intelligent steer-by-wire solution makes cars smarter and safer
- 8051 MCU - Parity Check
- How to efficiently balance the sensitivity of tactile sensing interfaces
- What should I do if the servo motor shakes? What causes the servo motor to shake quickly?
- 【Brushless Motor】Analysis of three-phase BLDC motor and sharing of two popular development boards
- Midea Industrial Technology's subsidiaries Clou Electronics and Hekang New Energy jointly appeared at the Munich Battery Energy Storage Exhibition and Solar Energy Exhibition
- Guoxin Sichen | Application of ferroelectric memory PB85RS2MC in power battery management, with a capacity of 2M
- Analysis of common faults of frequency converter
- In a head-on competition with Qualcomm, what kind of cockpit products has Intel come up with?
- Dalian Rongke's all-vanadium liquid flow battery energy storage equipment industrialization project has entered the sprint stage before production
- Allegro MicroSystems Introduces Advanced Magnetic and Inductive Position Sensing Solutions at Electronica 2024
- Car key in the left hand, liveness detection radar in the right hand, UWB is imperative for cars!
- After a decade of rapid development, domestic CIS has entered the market
- Aegis Dagger Battery + Thor EM-i Super Hybrid, Geely New Energy has thrown out two "king bombs"
- A brief discussion on functional safety - fault, error, and failure
- In the smart car 2.0 cycle, these core industry chains are facing major opportunities!
- The United States and Japan are developing new batteries. CATL faces challenges? How should China's new energy battery industry respond?
- Murata launches high-precision 6-axis inertial sensor for automobiles
- Ford patents pre-charge alarm to help save costs and respond to emergencies
- New real-time microcontroller system from Texas Instruments enables smarter processing in automotive and industrial applications
- Using FSMC bus for inter-chip communication
- I'm a newbie. I don't understand. I'm asking for help on the problem I encountered when making a spi receiving module in Verilog.
- How feasible is it for the United States to skip 5G and go directly to 6G?
- 77 GHz Radar Solutions
- Showing Goods (7) - Open Source Development Board
- New version of IAR installation problems and solutions
- arm assembly instructions WFI and WFE
- IMX6 EK200 RTL8188EUS WIFI AP
- Methods for supporting national encryption algorithms in embedded devices
- Discussion: 5G and the Internet of Things, have they moved your cheese?