ARM architecture - ARM's development history and product features
Unlike Intel, AMD and other semiconductor companies that integrate design, manufacturing and sales, ARM has never been involved in chip manufacturing and sales. Since its establishment, the company has always been selling the results of chip research and development to major semiconductor manufacturers as an IP (Intelligence Property) provider. After purchasing IP core licenses, major semiconductor manufacturers can conduct in-depth design on this basis, and then manufacture products and bring them to market. This strategy allows ARM to achieve specialization while also creating a huge ARM kingdom. The CPU core of the ARM architecture has the advantages of low power consumption, low cost, and high performance, and has won strong support from many semiconductor companies. There are more than 70 large companies that produce ARM architecture embedded processors. Each company has its own set of solutions. Together with the derivative products developed by other manufacturers, the ARM architecture has finally blossomed in the 32-bit embedded market and is extremely widely used.
Among the ARM core family, ARM7, ARM9, ARM10 and ARM11 have the greatest influence and have been adopted by heavyweight semiconductor companies including Intel, Texas Instruments (TI), Qualcomm, Motorola, and Atmel. So far, 80% of the world's GSM/3G mobile phones, 99% of CDMA mobile phones and most PDA products use ARM-based embedded processors.
ARM's leading low-energy consumption characteristics can be said to have played a major role. Embedded applications that were just starting out did not demand high computing performance, but were quite sensitive to chip power consumption. Compared with other solutions of the same period, the energy efficiency ratio of the ARM architecture is very obvious: although the 325MHz ARM1020E processor has the lowest absolute performance, under the premise of consuming the same amount of power, its performance far exceeds that of IBM's PowerPC 750, Motorola MPC7455 (Apple's G4+ processor), PowerPC7400 (Apple's G4 processor) and NEC's VR5500 processor, and its energy efficiency ratio is extremely outstanding.
In addition, the application solution of the ARM architecture is very flexible. Since ARM only provides an efficient and streamlined core, each semiconductor manufacturer can design applications according to their own needs. The architecture is flexible and simple, and has strong scalability. For example, manufacturers can add related instruction sets for multimedia signal processing, or add efficient execution units for Java-related applications, or add 3D graphics coprocessors, etc. High scalability also allows ARM to have many derivative architectures, such as Dargonball, StrongARM and XScale architectures popular in the PDA field, which also belong to the ARM system. The third advantage of ARM that has a decisive influence is that it has a large amount of software support. The three major commonly used systems, including Windows CE, Symbian and Palm OS, are designed based on the ARM architecture. Although handheld devices using embedded Linux operating systems are not very common, they are also based on ARM. At present, ARM has firmly occupied the market of mobile phones, PDAs and other handheld electronic products. These fields pay great attention to software compatibility and design continuity. There is no doubt that ARM will continue to maintain its advantages. There may be many other embedded processors with more excellent performance, but this is not the only factor that manufacturers have to consider. In fact, streamlined hardware core, ultra-low energy consumption, flexible design, software support and rich development tools are all the technical foundations for the success of ARM architecture.
ARM Architecture - Evolution of ARM Core Design
Throughout the history of ARM development, technological innovation and process progress have become two major driving forces, which is well reflected in the development process from ARM7 core to ARM11 core. ARM7 only has a simple three-stage pipeline structure including "pre-fetch (Fetch), decode (Decode) and execute (Execute)" units. When using 0.18 micron process, the core area is less than 0.8 square millimeters (different models of ARM7 have different core areas), and its clock frequency is between 50MHz and 110MHz. The computing power corresponding to each MHz is 0.9 Dhrystone MIPS (Dhrystone: a comprehensive benchmark program that tests the integer performance of the processor; MIPS: millions of instructions per second), and the maximum power consumption is 0.4 milliwatts. The later ARM9 core increased the pipeline to five levels and added two new units: memory access (Memory) and write (Write) - the increase in the processor pipeline is conducive to increasing the operating frequency, but at the same time it will bring about the problem of reduced instruction execution efficiency. The highest frequency of the ARM9 core has been raised to 220MHz, which is twice as high as the ARM7 series. At the same time, the instruction performance of ARM9 is also higher than that of ARM7, reaching 1.05 to 1.1 MIPS per MHz frequency, which should be attributed to the multi-faceted improvements of AMR9 in peripheral design. The high frequency plus high instruction efficiency has made the ARM9 core achieve a very obvious performance improvement, which is also the main reason why it is widely sought after by the industry. However, due to the increase in the number of transistors, the chip area of the ARM9 core has also increased slightly to 1.1 square millimeters, and the power consumption range is between 0.85 and 1 milliwatt per hertz.
In the ARM10 core, the pipeline length is further increased to six levels. It adds a "send (Issue)" unit between the pre-fetch and decode units, and the introduction of the 0.15-micron manufacturing process has made the highest frequency of ARM10 exceed 400MHz. ARM10 uses the more advanced V5T instruction set (all previous ARM7s and most ARM9s are V4T versions), and has made great optimizations to the logical architecture (such as using a return stack to reduce the waiting time when a subroutine returns), which greatly improves execution efficiency. Ultimately, its instruction execution performance has increased to 1.24 Dhrystone MIPS per MHz. The soaring operating frequency coupled with higher instruction performance has greatly improved the overall performance of ARM10. However, the number of transistors in ARM10 has also increased a lot. Fortunately, with the help of the 0.15-micron process, its core area has been successfully controlled at the level of 10 square millimeters (with 64KB cache), and the power consumption level has been maintained at 0.85 milliwatts per hertz. The energy efficiency ratio is basically the same as that of the previous generation ARM9. The overall performance is quite outstanding, and it is not a problem to be widely used.
The ARM11 core was released in October 2002. In order to meet the increasingly stringent performance requirements of PDAs, ARM expanded its pipeline length to 8, including pre-fetch, decode, send, conversion/MAC1, execution/MAC2, memory access/MAC3 and write, which belong to the ARM V6 instruction set architecture. ARM11 uses the most advanced 0.13 micron manufacturing process at the time, and the operating frequency can reach up to 500 to 700MHz. If the 90 nanometer process is used, the operating frequency of the ARM11 core will easily reach 1GHz - for embedded processors, this is obviously a very amazing number. Secondly, the logical core of ARM11 has also undergone a lot of improvements, the most important of which is the "static/dynamic combination conversion prediction function". The execution unit of ARM11 contains a 64-bit, 4-state address conversion buffer, which is mainly used to store the most recently used conversion addresses. When the dynamic conversion prediction mechanism is used and the correct address cannot be found in the address buffer, the static conversion prediction function will immediately take its place. In actual tests, the accuracy of dynamic prediction alone is 88%, and the accuracy of static prediction alone is only 77%. The static/dynamic prediction combination mechanism of ARM11 can achieve a high accuracy of 92%. In order to solve the problem of increased energy consumption caused by high speed, ARM11 introduced an intelligent power management technology called "IEM (Intelligent Energy Manager)", which can dynamically adjust the voltage of the processor according to the task load, thereby effectively reducing its own power consumption. This series of improvements allows ARM11 to continue to improve its energy efficiency ratio. On average, it only consumes 0.6 milliwatts of electricity per hertz frequency (0.8 milliwatts when there is cache), and the processor's highest performance can reach 660 Dhrystone MIPS, far exceeding the previous generation of products.
ARM11 has also been greatly enhanced in terms of extended functions, such as supporting single instruction multiple data (SIMD) instructions, which can allow applications that use specific algorithms to achieve two to three times more efficient execution. It supports JAVA decoding function, which is extremely efficient when running Java programs, which has a very positive significance for platforms where Java programs are popular, such as mobile phones and PDAs. Furthermore, ARM11 can cooperate with "coprocessors" to complete some specified tasks, which also allows it to adapt to more applications. In addition, ARM11 supports "TrustZone" security technology, which can identify the system's security code and data, and ensure that they are safely and effectively run simultaneously in the operating system, which can prevent attacks from viruses and malicious codes without sacrificing any system performance. I believe this has strong practical value for digital devices that focus on information security, such as set-top boxes, new generation smart phones, payment devices, network equipment, and mobile phones.
Previous article:ARM architecture learning-uboot analysis of start.S
Next article:Analysis of x86 architecture and arm architecture processors
Recommended ReadingLatest update time:2024-11-16 13:38
- Popular Resources
- Popular amplifiers
Professor at Beihang University, dedicated to promoting microcontrollers and embedded systems for over 20 years.
- Innolux's intelligent steer-by-wire solution makes cars smarter and safer
- 8051 MCU - Parity Check
- How to efficiently balance the sensitivity of tactile sensing interfaces
- What should I do if the servo motor shakes? What causes the servo motor to shake quickly?
- 【Brushless Motor】Analysis of three-phase BLDC motor and sharing of two popular development boards
- Midea Industrial Technology's subsidiaries Clou Electronics and Hekang New Energy jointly appeared at the Munich Battery Energy Storage Exhibition and Solar Energy Exhibition
- Guoxin Sichen | Application of ferroelectric memory PB85RS2MC in power battery management, with a capacity of 2M
- Analysis of common faults of frequency converter
- In a head-on competition with Qualcomm, what kind of cockpit products has Intel come up with?
- Dalian Rongke's all-vanadium liquid flow battery energy storage equipment industrialization project has entered the sprint stage before production
- Allegro MicroSystems Introduces Advanced Magnetic and Inductive Position Sensing Solutions at Electronica 2024
- Car key in the left hand, liveness detection radar in the right hand, UWB is imperative for cars!
- After a decade of rapid development, domestic CIS has entered the market
- Aegis Dagger Battery + Thor EM-i Super Hybrid, Geely New Energy has thrown out two "king bombs"
- A brief discussion on functional safety - fault, error, and failure
- In the smart car 2.0 cycle, these core industry chains are facing major opportunities!
- The United States and Japan are developing new batteries. CATL faces challenges? How should China's new energy battery industry respond?
- Murata launches high-precision 6-axis inertial sensor for automobiles
- Ford patents pre-charge alarm to help save costs and respond to emergencies
- New real-time microcontroller system from Texas Instruments enables smarter processing in automotive and industrial applications
- [Perf-V Evaluation] Exception (interrupt) handling of Hummingbird E203
- Calculation and setting of baud rate of msp430 microcontroller
- RSL10 rich board set!
- What challenges do you face in powering handling and harvesting robots? Let’s talk with Vicor engineers!
- What is the continuity threshold used for? Analysis of the wiring harness tester function
- Does DDR design require back drilling?
- [Integrated subway security control system based on Raspberry Pi 400] Material unboxing - Raspberry Pi 400
- SinA332.0 development board Linux & Qt adjust VGA resolution to 1280x720
- Single chip microcomputer CPU card programming system main frequency setting
- Award-winning live broadcast: Microchip's trusted platform for the CryptoAuthentication family