1 Analysis of low power consumption technology
Table 1 gives a low power consumption technology analysis table. As can be seen from Table 1, as the channel width decreases, the dynamic power consumption and static power consumption per unit area are increasing.
The power consumption of the chip can be described as:
In the formula: CeffVdd2fclock is the dynamic power consumption part. Where a is the flip rate at the current frequency; Ceff is the node load capacitance; Vdd is the operating voltage; fclock is the operating frequency. IleakVd is the static power consumption part, where Ileak is the leakage current. From formula (1), we can know the parameters that need to be reduced to reduce chip power consumption.
1.1 Means of reducing dynamic power consumption
1.1.1 Reducing α
There are two ways to reduce α: one is to reduce α by optimizing the logic structure through tools; the other is to achieve low α through coding, such as using flip codes. In fact, assuming that each flip is effective and optimal, afclock can be regarded as a constant, but the actual situation is not the case. There is often redundancy in the design driven by each clock. At the same time, for a certain rated upper-level task itself, it may not be suitable for software and hardware division. For fclock, if the module is not used, the module can be directly gated. There are three means of this gate:
(1) Gating at the clock generation end, configured by software. This means requires the design of such functions at the front end, including positive clock gated and reverse clock gated, and its structure is symmetrical. In fact, when designing, the device lib will provide a standard gated unit, which makes the front-end design easier.
(2) Perform hardware judgment in the module and use the gated clock. For example, there is a memory on the AHB bus as an AHB slave. Since the software frequently accesses the module, frequent gated operation will result in discontinuous operation. If the module is designed internally, the HSEL signal of the AHB becomes high, and the next clock is turned on inside the module, which can save the power consumption of the clock flip. Especially for memory, the power consumption of clock flip and non-flip is quite different.
(3) Use synthesis tools to add gates at the near end without designing at the front end.
In theory, a simple frequency reduction will not bring about a change in power consumption, because the workload is fixed, and a frequency reduction can only bring about an increase in running time. However, the power consumption of the clock tree accounts for almost 30% of the chip power consumption, so when the frequency is appropriately reduced, the power consumption of the clock tree will be reduced.
1.1.2 Reduce Ceff
The reduction of Ceff varies greatly depending on the process selection. Therefore, choosing a suitable process is more conducive to reducing Vdd, which can reduce power consumption in a square relationship. However, due to cost, reliability and business considerations, only one process can be selected, such as the 130 nm process, and the voltage can be changed through DVFS. Its core is: (1) The library under a certain process can work normally within a certain voltage range.
(2) Since the operating frequency required for modules or systems working under different tasks is different, the benefits of DVFS can be calculated. Suppose a system can perform MP3 or MP4 decoding tasks. In this way, when MP3 decoding, the required frequency is only 100 MHz; when MP4 decoding, the required frequency is 200 MHz. Through STA analysis, when working at a voltage of 1.1 V, the system can operate at a frequency of 100 MHz; when working at a voltage of 1.3 V, the system can operate at a frequency of 200 MHz. In this way, DVFS technology can be used for adjustment. Assuming that the flip rate and capacitance have not changed, the required power consumption in the two different working modes is reduced by 64%. Of course, the previous values are all hypothetical, and the actual situation is not so ideal.
1.2 Means of reducing static power consumption
To reduce static power consumption, two methods, Multi-Vdd and Multi-Vth, are available, which will not be described in detail here.
2 DVFS system
If DVFS is based on the needs of the CPU's own OS scheduling, the voltage will only change when the frequency itself needs to change. At this time, it can be considered an open-loop DVFS technology. For example, the OEMidle process in Windows Mobile provides a way to adjust the CPU frequency and voltage according to the CPU occupancy rate. However, when using the open-loop adjustment method, sufficient margin is required, and software, especially the operating system, needs to support it, which is not transparent to the software.
For a closed-loop system, a performance monitor is required to monitor performance and directly adjust voltage and frequency according to performance changes. Figure 1 shows a simple adaptive DVFS system.
In this system, the CPU is a power do-main with variable voltage, called CPU-subsys. However, for other modules, there is another power domain, called peri_subsys, which includes external memory interface (EMI), media coprocessor (MCP), LCD controller (LCDC), and PerRFormaneeMonitor (PM) module related to voltage control, which is used to monitor chip performance positively; The Power Controller (PC) module is used to calculate the control parameters after receiving the performance description of PM and pass it to the Power Supply (PS) module to provide variable voltage Vdd_arm. At the same time, there is a Level shifter between armsubsys and peri subsys.
For PM module, ARM can be configured through the bus, and PM realizes performance monitoring by monitoring the current in the variable voltage area. For operations with high MIPS requirements, the CPU idle time becomes less and the current demand becomes smaller; for operations with low MIPS requirements, the CPU performs intensive operations and the current demand becomes larger. [page]
The core of this design is how to make the PM module adaptively predict the current demand according to a certain algorithm, and the predicted response time and additional power consumption are relatively small, that is, to achieve the timely and appropriate voltage requirements. For the adaptive algorithm, the simple forward linear prediction shown in Figure 2 can be selected.
3 Simulation Experiment and Results
Figure 3 shows the system model. Such a system is constructed so that the test will be based on the given benchmark program that was run on the development board in advance. The power consumption parameters obtained from the test are converted into normalized nop and mac instruction programs according to the CPU load. These two types of instructions are distributed in the middle of the test vector. The CPU behavior model executes the relevant program. The model can only fetch instructions and execute 2-stage pipelines. For nop operations, nop is performed in the execution phase; for mac operations, mac is performed on fixed data in the execution phase, which can simplify the design. The CPU BM is written in Verilog. The CPU has an AHB bus to control access to memory. The MEM module uses the ahb interface to store compiled binary instructions and fix the frequency. The PM Model monitors the flip rate of the CPU BM. After monitoring the flip rate of each stage, it flows into the adaptive filter as input, calculates the required regulation voltage, and gives it to the PS Model; at the same time, it outputs the flip rate to the PC Model.
The PC Model uses the flip rate, clock, and voltage as inputs to calculate the system power consumption. The PS Model adjusts the voltage and frequency according to the voltage adjustment instructions issued by the PM. Since it is an rtl Model, the voltage adjustment is invisible. It is just based on the actual situation. If the voltage goes from low to high, the voltage is adjusted first, and then the frequency is adjusted; vice versa.
For the adaptively selected voltage, it can be implemented as shown in Figure 4. Table 2 shows the relationship between the CPU and bus frequencies when the voltage is implemented according to the 130 nm process. When adjusting the voltage, the clock is paused for several clock cycles. Assuming that the RC parameters of the power supply network remain unchanged, it is considered that the voltage switching is proportional to the switching voltage difference, as shown in Figure 4.
As shown in Figure 5, there is a certain relationship between power consumption, efficiency and adjustment step size. After selecting the adjustment step size reasonably, the efficiency and power consumption can be balanced. When the step size is 25 ms, the power consumption is less than 25% of DVFS, and the efficiency loss is only 1/3. It can be seen that when the total load utilization of CPU resources is 30%, the step size is relatively reasonable.
4 Conclusion
An adaptive dynamic voltage and frequency adjustment method is provided, and a corresponding system model is constructed. The model is simulated on a computer and a set of balanced forward prediction parameters is obtained. The experimental results verify the effectiveness of the adaptive dynamic voltage and frequency adjustment method and provide an effective way to evaluate the dynamic voltage and frequency adjustment simulation.
Previous article:Embedded Applications' Requirements for Microprocessors
Next article:A brief analysis of the design ideas of embedded DVR in financial monitoring
Recommended ReadingLatest update time:2024-11-17 00:19
- Popular Resources
- Popular amplifiers
- Siemens PLC Programming Technology and Application Cases (Edited by Liu Zhenquan, Wang Hanzhi, Yang Kun, etc.)
- Siemens PLC from Beginner to Mastery with Color Illustrations (Yang Rui)
- Experience and skills in using Siemens S7-200PLC (Shang Baoxing)
- Siemens S7-1200-PLC Programming and Application Tutorial (3rd Edition) (Edited by Shi Shouyong)
Professor at Beihang University, dedicated to promoting microcontrollers and embedded systems for over 20 years.
- Innolux's intelligent steer-by-wire solution makes cars smarter and safer
- 8051 MCU - Parity Check
- How to efficiently balance the sensitivity of tactile sensing interfaces
- What should I do if the servo motor shakes? What causes the servo motor to shake quickly?
- 【Brushless Motor】Analysis of three-phase BLDC motor and sharing of two popular development boards
- Midea Industrial Technology's subsidiaries Clou Electronics and Hekang New Energy jointly appeared at the Munich Battery Energy Storage Exhibition and Solar Energy Exhibition
- Guoxin Sichen | Application of ferroelectric memory PB85RS2MC in power battery management, with a capacity of 2M
- Analysis of common faults of frequency converter
- In a head-on competition with Qualcomm, what kind of cockpit products has Intel come up with?
- Dalian Rongke's all-vanadium liquid flow battery energy storage equipment industrialization project has entered the sprint stage before production
- Allegro MicroSystems Introduces Advanced Magnetic and Inductive Position Sensing Solutions at Electronica 2024
- Car key in the left hand, liveness detection radar in the right hand, UWB is imperative for cars!
- After a decade of rapid development, domestic CIS has entered the market
- Aegis Dagger Battery + Thor EM-i Super Hybrid, Geely New Energy has thrown out two "king bombs"
- A brief discussion on functional safety - fault, error, and failure
- In the smart car 2.0 cycle, these core industry chains are facing major opportunities!
- The United States and Japan are developing new batteries. CATL faces challenges? How should China's new energy battery industry respond?
- Murata launches high-precision 6-axis inertial sensor for automobiles
- Ford patents pre-charge alarm to help save costs and respond to emergencies
- New real-time microcontroller system from Texas Instruments enables smarter processing in automotive and industrial applications
- ST MEMS Creative Competition Post 9 - Discussion on issues related to the use of unico
- "Dating Spring" + Spring Scenery in Front of the Door
- Keyboard Man Terminator: Automatic Counterattack Keyboard
- [Free trial of Pingtouge Bluetooth Mesh Gateway Development Kit] + Burning W800 Upgrade_Tools1.0 version cannot be erased
- Why is this rectifier bridge connected like this?
- TWS earphones disassembly
- What is the Internet of Things? What are the trends in the Internet of Things?
- Intelligent rehabilitation detection system [Work submission]
- 3G Wireless Communications (W-CDMA, CDMA2000 and TDS-CDMA) Introduction Textbook
- Monocrystalline Appearance Quality Standards: Monocrystalline Solar Panel Quality Standards: Solar Panels: Monocrystalline Silicon Quality Standards