Usually when we talk about DSP decoding and overall system performance , we mainly use traditional methods such as SNR, instantaneous error and phase error to make judgments. This article uses the method of psycho-acoustic compression design to examine the performance and performance of DSP decoding, introduces the concept of audio compression decoding based on psycho-acoustics, and gives a DSP performance analysis based on psycho-acoustics.
Since the 1990s, digital signal processing technology has gradually occupied an important position in the consumer audio market. Digital signal processors were originally used to process digitized analog audio signals, that is, PCM data. In the current era of system design, DSP system design solutions based on flexible software design features are an ideal alternative to traditional designs.
In the design of audio systems, the psycho-acoustic model is usually used to remove redundant data in the signal when the signal source is compressed and encoded. The performance of the system can be guaranteed by selecting a DSP with an appropriate number of bits. The selection of DSP in practical applications involves many factors, including accuracy (24-bit/32-bit), main frequency, cost, and memory capacity. This article analyzes the performance of DSP based on the psycho-acoustic model in audio decoding applications.
Relationship between DSP decibel and sound pressure decibel
The data described in the rest of this article are all measured at dBFS, i.e. decibels full scale. From the perspective of audibility, these values need to be related to dB SPL, i.e. converted to decibels of sound pressure intensity. The analog signal chain after the DSP includes DAC, preamplifier, power amplifier and speaker. Although the gain and performance of each component may vary significantly from system to system, it is still possible to relate dBFS to dB SPL with sufficient accuracy from the perspective of system configuration alone.
Typically, digital tracks are recorded at -20dBFS, which fully meets the amplitude required for the signal peak, while also having enough dynamic range to show the quiet part of the audio file, and will not be distorted in different formats such as CD, Dolby Digital and DTS. As we all know, the auditory configuration recommended by THX is to reproduce a -20dBFS sound signal at 85dB sound pressure intensity, which is usually very loud, and normal listening will be much lower than this intensity.
From the above facts, it can be concluded that there is a linear mapping between dBFS and dB SPL, with the following relationship: a signal at 0dBFS can be reproduced at 105dB SPL. It should be noted that the sound produced in this case is very high and is not suitable for long-term listening; 0dB SPL corresponds to -105dBFS.
Hearing and hearing threshold
Human hearing has its limits, and usually 0dB is set as the lowest audible level in the sound pressure intensity design. Most of the sound spectrum (below 300Hz and above 10KHz) can only be heard above 10dB sound pressure intensity. The highest sensitivity of sine waves is 3~4KHz, and such sounds can be perceived by people with excellent hearing at -3~-4 dB SPL.
From a physiological point of view, to reach the threshold of hearing, the energy of the sound needs to be large enough to generate a standing wave in the eardrum, causing the tiny hairs there to fluctuate. Without this fluctuation, the neurons connected to the auditory cortex cannot be triggered, and the sound cannot be perceived. From the above discussion, we get inspiration about audio system design, that is, when the noise level is lower than the hearing threshold of people, it is meaningless to blindly pursue high-precision DSP implementation solutions.
Using the previously obtained listening configuration relationship, the lowest audible sound pressure is -4dB SPL, or -109dBFS. Assuming that all other parts of the signal chain (DAC, preamplifier, etc.) are zero distortion, this means that any DSP that can produce a better than 109dB signal-to-noise ratio will not become a bottleneck in system performance, which is a very important issue in system design using DSP. In actual applications, the analog signal chain is the main source of noise in the system, and the contribution of DSP to noise is much lower than these analog devices.
DSP bit number that meets system performance
The above analysis is based on an average level of -20dBFS and a THX listening configuration. Although this is an extreme case, some performance margin should be left in the design to take into account the change in the dBFS/dB SPL conversion relationship. Therefore, a well-considered design should make the DSP bit number about two more than the theoretical bit number, that is, 121dB using a 6dB/bit configuration, corresponding to a 20-bit dynamic range of the PCM output.
The above analysis is consistent with the assumptions of Dolby's Dolby Digital design solution, which also uses 20-bit accuracy. At the same time, the actual ADC/DAC is also limited to 20-bit performance (<120dB), and even DAT recording uses 20-bit accuracy. All of this verifies the correctness of the above analysis.
The above data is based on the worst case, because in reality the amount of noise generated by the power amplifier, preamplifier and DAC has a greater impact on the overall system performance than the performance of the DSP. The best power amplifier can only achieve a signal-to-noise ratio of 109dB, because noise energy can be accumulated in the linear region, which means that a 20-bit DSP with an output of 121dB will only generate 6.66% of the noise of the amplifier. If the performance of the speaker is also taken into consideration, the noise generated by the DSP is 1/6 - speaker distortion, which can be completely ignored.
The above analysis does not even take into account coding distortion, ADC or microphone noise, all of which are critical. If the entire signal chain is considered, it is clear that a 20-bit DSP is more than enough. Only by testing with abnormal sine waves, artificially synthesizing signals with a precision greater than 20 bits can a measurable difference be obtained, and this difference is actually imperceptible to human hearing.
Lossy compression with "transparent" audio quality
Psycho-acoustic compression is designed to compress a given signal in a lossy way, and then understand to what extent different frequency/time domain signals are audible or inaudible, so as to adjust the encoding process accordingly and reduce the introduced noise below the hearing threshold. The basic phenomenon is that the strong sound part of the signal will mask the adjacent weak sound part. Ideally, such data reduction will not lead to a perceived loss of sound quality, which leads to the concept of "transparent" audio coding or compression.
This is fundamentally different from a simple SNR measurement, and more complex, because it requires accurate reproduction of the relevant audible portion of a particular signal. In other words, while SNR is a good criterion for determining encoding/decoding quality, it is not an appropriate criterion for judging that a DSP that can produce -140dB THD+N is necessarily better than -130dB THD+N. Since psycho-acoustic compression designs are based on the human hearing threshold curve, the above conclusion becomes very obvious, and signals below this threshold cannot be heard.
Question about getting "transparent" audio compression
The actual encoding/decoding output quality is determined by the following factors:
1. Algorithm used
2. Compression bit rate
3. Psycho-acoustic model used in analyzing input signals
4. Transient Analysis Architecture and Transformation Filter Bank
5. Bit allocation strategy
In the encoding/decoding process, the above factors are independent of the algorithm accuracy. Even if infinite precision is used, the above factors still have a decisive influence on the audio quality.
Based on the lossy compression systems discussed above, the following conclusions can be drawn: Traditional measurements such as SNR, THD+N, transient error, and phase error are no longer the ultimate metrics for comparing the performance of different implementations. They can only be used as a reference when identifying and verifying system performance, and cannot be used to rate systems that vary slightly from the psycho-acoustically validated threshold of approximately 120dB performance.
DSP value-added suggestions
Usually 20-bit DSP is enough for the system, and 16-bit is enough from a psycho-acoustic point of view. For a given compression scheme, once the decoder reaches a certain performance, simply increasing the accuracy of the DSP will not further improve the system performance. The actual DSP value-added solution becomes the post-processing of the decoded audio program and the system-level features it provides. In fact, the final consumer also needs the product to have more additional features, such as automatic detection, error shielding and a post-processor that provides virtual sound effects.
Judging from the market response alone, the use of 32-bit DSP is still very successful in concept, because the performance improvement can be intuitively felt from the numbers. In fact, it is an obvious misunderstanding to consider the overall performance and more functions. This misunderstanding is equivalent to comparing only the CPU frequency of PCs without considering the overall performance of the system.
32-bit DSP does not really help improve the final performance of the system, it requires more memory (about 33% more than 24-bit DSP). At the same time, because 32×32MAC is slower than 24×24MAC, a 32-bit DSP core will always be slower than the decoder of the corresponding 24-bit core. In terms of signal-to-noise ratio, actual application results have proved that through optimization, 24-bit DSP can get the same (or even higher) performance as 32-bit DSP.
It should be noted that the above discussion is only based on the basic decoder, and the 32-bit DSP still has certain advantages in post-processing. Therefore, if the front-end uses a highly optimized 24-bit DSP decoding engine and the back-end processor is a 32-bit DSP, such an optimized design combines the advantages of both and is a good choice. Such a system-level chip solution is already available.
Conclusion
In the real world, when using psycho-acoustic compression coding, the SNR test method is not a suitable standard for comparing system performance. The ITU PEAQ test platform method is more suitable for measuring the quality of perceptible audio. In addition, when we evaluate a DSP solution, we also need to consider the system's error shielding, automatic detection and post-processing characteristics, as well as the system's startup response time and batch delay.
The factors that affect the quality of AV receiver system decoders are not all about DSP accuracy. Through strong DSP algorithm skills and optimization, a 24-bit precision DSP can outperform a 32-bit DSP. However, with the continuous improvement of manufacturing technology, 32-bit DSP will eventually solve the above problems, making 32-bit decoders more cost-effective and technically competitive.
Previous article:Design of Ethernet Interface for TM1300 DSP System
Next article:Power supply noise problem in DSP
Recommended ReadingLatest update time:2024-11-16 20:29
- Popular Resources
- Popular amplifiers
- Huawei's Strategic Department Director Gai Gang: The cumulative installed base of open source Euler operating system exceeds 10 million sets
- Analysis of the application of several common contact parts in high-voltage connectors of new energy vehicles
- Wiring harness durability test and contact voltage drop test method
- Sn-doped CuO nanostructure-based ethanol gas sensor for real-time drunk driving detection in vehicles
- Design considerations for automotive battery wiring harness
- Do you know all the various motors commonly used in automotive electronics?
- What are the functions of the Internet of Vehicles? What are the uses and benefits of the Internet of Vehicles?
- Power Inverter - A critical safety system for electric vehicles
- Analysis of the information security mechanism of AUTOSAR, the automotive embedded software framework
Professor at Beihang University, dedicated to promoting microcontrollers and embedded systems for over 20 years.
- Innolux's intelligent steer-by-wire solution makes cars smarter and safer
- 8051 MCU - Parity Check
- How to efficiently balance the sensitivity of tactile sensing interfaces
- What should I do if the servo motor shakes? What causes the servo motor to shake quickly?
- 【Brushless Motor】Analysis of three-phase BLDC motor and sharing of two popular development boards
- Midea Industrial Technology's subsidiaries Clou Electronics and Hekang New Energy jointly appeared at the Munich Battery Energy Storage Exhibition and Solar Energy Exhibition
- Guoxin Sichen | Application of ferroelectric memory PB85RS2MC in power battery management, with a capacity of 2M
- Analysis of common faults of frequency converter
- In a head-on competition with Qualcomm, what kind of cockpit products has Intel come up with?
- Dalian Rongke's all-vanadium liquid flow battery energy storage equipment industrialization project has entered the sprint stage before production
- Allegro MicroSystems Introduces Advanced Magnetic and Inductive Position Sensing Solutions at Electronica 2024
- Car key in the left hand, liveness detection radar in the right hand, UWB is imperative for cars!
- After a decade of rapid development, domestic CIS has entered the market
- Aegis Dagger Battery + Thor EM-i Super Hybrid, Geely New Energy has thrown out two "king bombs"
- A brief discussion on functional safety - fault, error, and failure
- In the smart car 2.0 cycle, these core industry chains are facing major opportunities!
- The United States and Japan are developing new batteries. CATL faces challenges? How should China's new energy battery industry respond?
- Murata launches high-precision 6-axis inertial sensor for automobiles
- Ford patents pre-charge alarm to help save costs and respond to emergencies
- New real-time microcontroller system from Texas Instruments enables smarter processing in automotive and industrial applications
- What will happen if you connect the input and output of the power bank?
- [Silicon Labs Development Kit Review] TensorFlow Application Code Analysis Based on Deep Learning
- Direction of electrical energy transfer in a circuit
- Detailed explanation of the BQ25618/9 dedicated switch charging chip for TWS true wireless headset charging compartment
- TI DLP high-performance system for spectral analysis of liquids and solids
- AD package library
- Share a reference implementation of path planning, the key is that there are animated pictures!
- TMS320C6678 chip TI original board development
- How to configure practical Euler angles for sensortile box?
- Participate in the survey and get a 50-yuan JD card. The number of places is limited and will be given away while stocks last!