Author: Ivo Bolsens, CTO of Xilinx
The evolution of chip architectures is accelerating, so having the right tools for the right job is critical.
The world is getting smarter. From the smartphones in our pockets to today’s smart cities managing traffic and transportation systems, AI is pervasive in nearly every industry and is impacting our daily lives. AI generates massive amounts of unstructured data that must often be managed and processed in real time. The demand for hardware is skyrocketing, with increasing reliance on innovations in chip architecture to deliver the necessary performance improvements to keep pace.
The continued improvement of Moore's Law requires thousands of engineers, hundreds of companies, and tens of billions of research and development to achieve. The same is true for artificial intelligence chips. This is not a single specification that fits all applications, nor will it be a market dominated by one company or chip architecture.
So, how can your hardware adapt to the growing demands of AI processing? The answer is Domain Specific Architecture (DSA). DSA is the future of computing, where hardware is customized to run specific workloads. DSA closely matches compute, memory bandwidth, data path, and I/O to the specific needs of application workloads. This provides a new level of processing efficiency compared to general-purpose CPU and GPU architectures.
Disadvantages of ASICs
DSAs can be built using specialized silicon, but there can be drawbacks to doing so. First, a key part of what we want is the need for faster, better, cheaper, and sooner. To keep up with the pace of innovation, manufacturers are creating and delivering innovative services in less time than ever before. More specifically, less time than it takes to design and build a new ASIC-based DSA. This creates a market mismatch between the market need for accelerated innovation and the time it takes to design and build an ASIC.
Second, ASICs are hard to make. The complexity of designing advanced node ASICs increases exponentially, greatly increasing the risk. A small mistake can have a huge impact, including at least non-recurring engineering (NRE) costs. For example, at 7nm, the cost of ASICs was hundreds of millions of dollars in NRE alone, and the cost will rise further as device geometries shrink to 5nm and beyond.
Third, ASIC implementations are not future-proof. Changing industry standards or other fluctuating requirements can quickly render the device obsolete. These are just some of the disadvantages of fixed chips.
So how can the industry continue to make architectural advances and build DSAs fast enough to keep up with the pace of innovation? The solution lies in adaptive computing.
The power of adaptive computing
Adaptive computing is based on FPGA technology, which can dynamically build DSAs in chips. Therefore, adaptive computing allows DSAs to be dynamically updated as requirements change. It frees us from long ASIC design cycles and high NRE costs. It supports over-the-air (OTA) updates of not only software but also hardware, which is especially important as processing becomes more distributed. For example, the 2011 Mars Rover Curiosity and the more recently launched Perseverance both include adaptive computing technology.
Perseverance uses adaptive computing for its comprehensive visual processing. It is built using an FPGA-based platform that accelerates AI and non-AI visual tasks, including image correction, filtering, detection, and matching. The images sent back to NASA by Perseverance will be processed using adaptive computing.
If a new algorithm is invented during the eight months, or a hardware bug is discovered, Perseverance cannot be upgraded en route. In this case, adaptive computing allows hardware updates to be sent remotely via air or space. These updates can be done as quickly and easily as software updates. When deployment is remote, such remote hardware updates are not only convenient, but necessary.
Adaptive computing can be deployed from the cloud to the edge to the endpoint, bringing the latest architectural innovations to every part of an end-to-end application. This is made possible by a wide range of adaptive computing platforms - from high-capacity devices on PCIe accelerator cards in the data center to small, low-power devices suitable for the endpoint processing required by IoT devices.
Adaptive computing can be used to build optimized DSAs in all forms, from latency-sensitive applications such as autonomous driving and real-time video streaming to signal processing in 5G and data processing for unstructured databases. With today’s hardware abstraction tools, software and AI developers can now take advantage of their strengths without having to become hardware experts.
Adaptive computing accelerates your entire application
AI inference rarely exists in isolation. It is part of a larger data analysis and processing chain, often with multiple front-end and back-end stages using traditional (non-AI) implementations. The embedded AI portions of these systems can certainly benefit from AI acceleration, but so can the non-AI portions. The flexibility of adaptive computing lends itself to accelerating both AI and non-AI processing tasks. We call this whole application acceleration, and it will become increasingly important as AI permeates more and more applications.
We've discussed the power of adaptive computing, but there are more tricks.
Start a new system
Adaptive computing makes entirely new systems possible that are not possible with other technologies. Not only does it enable rapid architectural innovation, it also enables switching between architectures while the system is running. These new systems deliver significant performance gains, power reductions, and cost reductions that are not possible with fixed chips such as CPUs, GPUs, and ASICs.
For example, in a modern vehicle, there are many cameras, each monitored by software and, increasingly, AI. Processing for the front and rear cameras is typically mutually exclusive—depending on the direction of travel, only one camera can be monitored at a time. Adaptive compute allows them to share processing resources. When moving forward, only the front video stream needs to be processed, and when moving backward, only the rear video stream needs to be processed. When the car moves from “drive” to “reverse,” the hardware is reconfigured to implement the rear camera processing algorithm, which may be different from the front-facing algorithm. This provides higher overall performance, but at the same time reduces power consumption and cost. For the end user, this means more features for less money.
in conclusion
Moore's Law is dead.
However, the spirit of Moore's Law remains.
Advances in adaptive computing and architectures such as DSAs help sustain the faster, better, cheaper quest we have developed. At the same time, adaptive computing meets the market demand for "faster" by eliminating the long design cycles required to build new ASIC-based DSAs. This trend will continue to drive rapid architectural advances, thus perpetuating the spirit of Moore's Law.
Previous article:AMI and Lattice Collaborate to Develop Flexible Firmware Security Solution Platform
Next article:Lattice releases Mach-NX, why does it focus on the security market
- Popular Resources
- Popular amplifiers
- Huawei's Strategic Department Director Gai Gang: The cumulative installed base of open source Euler operating system exceeds 10 million sets
- Analysis of the application of several common contact parts in high-voltage connectors of new energy vehicles
- Wiring harness durability test and contact voltage drop test method
- Sn-doped CuO nanostructure-based ethanol gas sensor for real-time drunk driving detection in vehicles
- Design considerations for automotive battery wiring harness
- Do you know all the various motors commonly used in automotive electronics?
- What are the functions of the Internet of Vehicles? What are the uses and benefits of the Internet of Vehicles?
- Power Inverter - A critical safety system for electric vehicles
- Analysis of the information security mechanism of AUTOSAR, the automotive embedded software framework
Professor at Beihang University, dedicated to promoting microcontrollers and embedded systems for over 20 years.
- LED chemical incompatibility test to see which chemicals LEDs can be used with
- Application of ARM9 hardware coprocessor on WinCE embedded motherboard
- What are the key points for selecting rotor flowmeter?
- LM317 high power charger circuit
- A brief analysis of Embest's application and development of embedded medical devices
- Single-phase RC protection circuit
- stm32 PVD programmable voltage monitor
- Introduction and measurement of edge trigger and level trigger of 51 single chip microcomputer
- Improved design of Linux system software shell protection technology
- What to do if the ABB robot protection device stops
- Keysight Technologies Helps Samsung Electronics Successfully Validate FiRa® 2.0 Safe Distance Measurement Test Case
- Innovation is not limited to Meizhi, Welling will appear at the 2024 China Home Appliance Technology Conference
- Innovation is not limited to Meizhi, Welling will appear at the 2024 China Home Appliance Technology Conference
- Huawei's Strategic Department Director Gai Gang: The cumulative installed base of open source Euler operating system exceeds 10 million sets
- Download from the Internet--ARM Getting Started Notes
- Learn ARM development(22)
- Learn ARM development(21)
- Learn ARM development(20)
- Learn ARM development(19)
- Learn ARM development(14)
- Classic Books on Signal Integrity Design
- How to avoid copper between the pins of AD18 chip
- [ST waterproof pressure sensor LPS27HHW evaluation 1] Hardware preparation and data collection
- Live broadcast at 10 am today [Design considerations for the precision acquisition signal chain of ADI vibration signals]
- How to implement data exchange between FPGA and HPS on DE1-SOC development board? Free. Help can be 200 RMB
- [National Technology N32G430] 6. Porting FreeRtos
- Anti-interference technology for industrial measurement and control systems
- Technical article: The magic of energy monitoring in DC systems
- Please give me some advice on the previous interview questions.
- cc2640 details display