0 Preface
With the continuous development and maturity of speech signal processing technology, speech synthesis is gradually becoming a key technology for human-computer interface in information technology. DSP chip, that is, digital signal processor, is a microprocessor with a special structure specially designed to quickly implement various signal processing algorithms. Its processing speed is 10 to 50 times faster than the fastest CPU. This article introduces an implementation method of a Chinese speech synthesis system based on DSP.
1 Overall system plan
The biggest feature of speech synthesis is to synthesize continuous sentences with unlimited vocabulary from limited storage units [1]. In order to achieve this, this system is designed to have (1) a front-end preprocessing module that converts input text files into a standard format that the system can process; (2) a prosodic rule library that gives the prosodic feature parameters of each syllable in the current language environment; (3) The speech synthesizer adjusts the acoustic parameters of the corresponding speech units in the original speech library according to the given prosodic feature parameters; (4) Splicing the adjusted speech units together to obtain a continuous speech output corresponding to the input text It consists of 4 basic processes. The basic principle block diagram of the system is shown in Figure 1.
2 Hardware system design
*Fund project: Hunan Provincial Department of Education (03C025)
The purpose of the Chinese speech synthesis system is to output the input text file clearly, naturally and understandably in the form of continuous speech. ATMEL's AT89S52 microcontroller displays the text file input from the keyboard, then sends it to TMS320VC5402 for processing, and finally outputs the synthesis result. The hardware structure block diagram is shown in Figure 2.
The keyboard interface circuit of AT89S52 adopts the interrupt mode. When a key is pressed, an interrupt request is generated, the interrupt processing is entered, and then the corresponding processing is performed by querying the status of P1.0 and P1.1. Use resistors and capacitors to form an anti-bounce circuit to prevent incorrect operation of the circuit.
AT89S52 sends the text information input from the keyboard to the LCD display, and at the same time writes the data to the external memory CY7C133, and then TMS320 VC5402 reads the information on CY7C133 for processing. The LCD command format is shown in Table 1:
Table 1 LCD command format
Table 2 Register selection
AT89S52 and TMS320VC5402 work independently, and their information and data exchange are realized by sharing an external memory. The signal communication between them is realized through hard connection and software judgment [2].
The external memory uses CY7C133, which is a high-speed 2K X 16bit static asynchronous dual-port RAM with a storage speed of 25ns. It has two independent sets of address lines, data lines and control signal lines, allowing data in two control devices to be communicated through a commonly connected memory. This dual-port RAM allows two controllers to read any memory unit at the same time (including reading the same unit at the same time), but does not allow simultaneous writing or reading and writing of the same address unit.
For TMS320VC5402, the corresponding address of data memory CY7C133 is 4000H~47FFH.
For AT89S52, the corresponding address of data memory CY7C133 is 2000H~27FFH.
3 Software system design
As a tonal language, Chinese has very complex rhythmic features. In order to synthesize continuous sentences with unlimited vocabulary from limited storage units, the prosodic parameters of the phonetic library units must be adjusted under certain prosodic rules to obtain phonetic change units that conform to the current language flow environment [3].
According to different methods of obtaining sound change units, speech synthesizers can be divided into two types: (1) waveform splicing synthesis; (2) parameter synthesis (also known as source/filter synthesis). This system uses the method of waveform splicing and synthesis to directly adjust the time domain and frequency domain waveforms of the waveform to obtain the required sound change unit.
Simple waveform splicing method makes it difficult to adjust pitch and pitch (duration). Therefore, this system uses the direct splicing of neutral intonation syllables with the Pitch Synchronous Waveform Addition (PSOLA) algorithm, and uses the Code Excited Linear Prediction (CELP) coding method to encode and compress the original sampled sound library. The basic flow chart is shown in Figure 3.
? ? |
This system uses SCM to display input text files in real time, and can achieve comparative output of synthesized speech and input text files, which is highly intuitive; it has high clarity, intelligibility and naturalness; the synthesis algorithm has low computational complexity and can be used as small as possible sound library to meet the requirements for occupying limited storage space.
Previous article:Design of a Chinese speech synthesis system based on DSP
Next article:Design of wide-band digital phase-locked loop and implementation based on FPGA
- Popular Resources
- Popular amplifiers
- Mission-oriented wireless communications for cooperative sensing in intelligent unmanned systems
- Monocular semantic map localization for autonomous vehicles
- Introduction to Artificial Intelligence and Robotics (Murphy)
- Algorithm Notebooks Practical Guide for Computer Training (Edited by Hu Fan and Zeng Lei)
- Huawei's Strategic Department Director Gai Gang: The cumulative installed base of open source Euler operating system exceeds 10 million sets
- Analysis of the application of several common contact parts in high-voltage connectors of new energy vehicles
- Wiring harness durability test and contact voltage drop test method
- Sn-doped CuO nanostructure-based ethanol gas sensor for real-time drunk driving detection in vehicles
- Design considerations for automotive battery wiring harness
- Do you know all the various motors commonly used in automotive electronics?
- What are the functions of the Internet of Vehicles? What are the uses and benefits of the Internet of Vehicles?
- Power Inverter - A critical safety system for electric vehicles
- Analysis of the information security mechanism of AUTOSAR, the automotive embedded software framework
Professor at Beihang University, dedicated to promoting microcontrollers and embedded systems for over 20 years.
- Innolux's intelligent steer-by-wire solution makes cars smarter and safer
- 8051 MCU - Parity Check
- How to efficiently balance the sensitivity of tactile sensing interfaces
- What should I do if the servo motor shakes? What causes the servo motor to shake quickly?
- 【Brushless Motor】Analysis of three-phase BLDC motor and sharing of two popular development boards
- Midea Industrial Technology's subsidiaries Clou Electronics and Hekang New Energy jointly appeared at the Munich Battery Energy Storage Exhibition and Solar Energy Exhibition
- Guoxin Sichen | Application of ferroelectric memory PB85RS2MC in power battery management, with a capacity of 2M
- Analysis of common faults of frequency converter
- In a head-on competition with Qualcomm, what kind of cockpit products has Intel come up with?
- Dalian Rongke's all-vanadium liquid flow battery energy storage equipment industrialization project has entered the sprint stage before production
- Allegro MicroSystems Introduces Advanced Magnetic and Inductive Position Sensing Solutions at Electronica 2024
- Car key in the left hand, liveness detection radar in the right hand, UWB is imperative for cars!
- After a decade of rapid development, domestic CIS has entered the market
- Aegis Dagger Battery + Thor EM-i Super Hybrid, Geely New Energy has thrown out two "king bombs"
- A brief discussion on functional safety - fault, error, and failure
- In the smart car 2.0 cycle, these core industry chains are facing major opportunities!
- The United States and Japan are developing new batteries. CATL faces challenges? How should China's new energy battery industry respond?
- Murata launches high-precision 6-axis inertial sensor for automobiles
- Ford patents pre-charge alarm to help save costs and respond to emergencies
- New real-time microcontroller system from Texas Instruments enables smarter processing in automotive and industrial applications
- Watchdog for C2000-8027 series
- Some questions about magnetic field
- EEWORLD University Hall----Live Replay: Melexis consumer-grade ultra-low power position sensor simplifies design and reduces costs
- EEWORLD University - Multiphase Buck Regulator Design: Case Study
- I need help from the experts on writing MAX7219 dot matrix in microPython
- 【Top Micro Intelligent Display Module】IV: Serial port interaction and application of curve, drawing board and animation controls
- Last few hours: NXP i.MX answer questions and win AI-IoT series books, 10 books left, not all of them are mine~
- EEWORLD University Hall - Sharing of household heating and cooling inverter air conditioner application solutions based on Lingdong MM32SPIN series MCU
- Passband gain
- Steps and precautions for live load verification of relay protection devices