Embedded voice playback systems are increasingly widely used. There are several methods for playing Chinese voice, including recording chip method, voice synthesis chip method and Chinese voice library method. The capacity of Flash memory is increasing, and using Windows Speech SDK, Chinese voice files can be easily extracted, so the Chinese voice library method is widely used.
The Chinese character voice library method can be divided into two categories: dedicated voice playback and general voice playback. The dedicated voice playback system only stores a small amount of fixed voice resources and requires a smaller storage capacity, but its application range is limited. The general voice playback system stores the single-word voice resources of all Chinese characters with different pronunciations, and can realize the voice playback of any text combination. It is suitable for occasions where the playback content is frequently updated. This system consists of a voice library of 1220 Chinese character pronunciation files with different pronunciations, which are stored in a Flash memory, and realize an embedded general Chinese character text voice playback system based on SPCE061A. The system has simple hardware, is easy to use, and has low cost, and is suitable for application systems such as bus stop announcers.
1 Hardware Principle
The voice library composed directly of the wave pronunciation files of Chinese characters takes up a lot of storage space. After the voice files are compressed, the space occupied by the voice library data in the Flash memory can be greatly reduced. Lingyang provides a variety of compression algorithms for voice playback. This system uses the SACM_S480 compression algorithm of code excited linear prediction (CELP) coding, and its compression ratio is 80:3. The compressed voice resource data can be stored in a 512 kB Flash memory.
The voice playback system with ordinary single-chip microcomputer as the core requires more peripheral devices and circuits in hardware, such as memory, D/A converter, etc. The slow voice decompression speed in software will affect other operations. If the voice playback system is implemented with an embedded microprocessor, the cost is too high. Lingyang's 16-bit single-chip microcomputer SPCE061A is the best choice. It has DSP instructions and integrates the A/D, D/A and other functional modules required by the embedded voice system into one chip, with strong programmable audio processing functions. Lingyang also provides SACM_LIB voice library (this system uses SACMV26e.lib) to realize recording, playback and voice recognition functions.
The hardware principle of this system is shown in Figure 1. It mainly consists of two chips: SPCE061A microcontroller and external Flash memory SPR4096A. SPCE061A runs all software functions for voice decoding and playback and other operations. Its D/A output is connected to the speaker through the audio amplifier SPY0030. SPR4096A stores all voice information including the Chinese character pronunciation number index table, voice resource address index table and voice resource data.
There are three working modes for general text voice playback: pre-stored fixed text playback, playback after receiving text, and real-time text playback. Real-time text playback has the highest requirements on the system's software processing capabilities and can best test the coordination between the system's voice playback operation and other operations. Therefore, this system uses real-time text playback to debug the program, that is, the serial port inputs Chinese characters while playing text. In terms of hardware, the UART of SPCE061A is connected to the RS-232 of the PC.
SPCE061A has a built-in ICE (in-circuit emulator) debugging interface, which can realize online simulation, debugging and downloading. Program debugging is very convenient, which is one of the main reasons for choosing SPCE061A. In terms of hardware, the ICE debugging interface is connected to the parallel port of the PC. In addition to programmable audio processing functions, SPCE061A also has rich I/O ports, clock sources and interrupt sources, Flash storage up to 32 k words, RAM has 2 k words, and can perform complex control operations while playing voice.
The capacity of SPR4096A is 512 k×8 bits, divided into 256 sectors, each sector is 2 kB, and can be configured as BMI (bus memory interface) or SIF (serial interface bus). In order to save I/O ports, this system uses a serial interface. This serial interface is a SIO interface defined by Lingyang, which also uses two signal lines SCK and SDA, but the data format is slightly different from I2C or SPI. IOB0 of SPCE061A is the clock signal SCK, and IOB1 is the data signal SDA.
Previous article:Embedded application system design using Lingyang SPCE061A
Next article:Analysis of the Problems of Implementing RTOS on MSP430 Microcontroller Chip Using uC/OS-II
Professor at Beihang University, dedicated to promoting microcontrollers and embedded systems for over 20 years.
- Innolux's intelligent steer-by-wire solution makes cars smarter and safer
- 8051 MCU - Parity Check
- How to efficiently balance the sensitivity of tactile sensing interfaces
- What should I do if the servo motor shakes? What causes the servo motor to shake quickly?
- 【Brushless Motor】Analysis of three-phase BLDC motor and sharing of two popular development boards
- Midea Industrial Technology's subsidiaries Clou Electronics and Hekang New Energy jointly appeared at the Munich Battery Energy Storage Exhibition and Solar Energy Exhibition
- Guoxin Sichen | Application of ferroelectric memory PB85RS2MC in power battery management, with a capacity of 2M
- Analysis of common faults of frequency converter
- In a head-on competition with Qualcomm, what kind of cockpit products has Intel come up with?
- Dalian Rongke's all-vanadium liquid flow battery energy storage equipment industrialization project has entered the sprint stage before production
- Allegro MicroSystems Introduces Advanced Magnetic and Inductive Position Sensing Solutions at Electronica 2024
- Car key in the left hand, liveness detection radar in the right hand, UWB is imperative for cars!
- After a decade of rapid development, domestic CIS has entered the market
- Aegis Dagger Battery + Thor EM-i Super Hybrid, Geely New Energy has thrown out two "king bombs"
- A brief discussion on functional safety - fault, error, and failure
- In the smart car 2.0 cycle, these core industry chains are facing major opportunities!
- The United States and Japan are developing new batteries. CATL faces challenges? How should China's new energy battery industry respond?
- Murata launches high-precision 6-axis inertial sensor for automobiles
- Ford patents pre-charge alarm to help save costs and respond to emergencies
- New real-time microcontroller system from Texas Instruments enables smarter processing in automotive and industrial applications
- Crazy Shell AI open source drone serial port (visual data acquisition)
- Questions about adjustable color temperature LED lamp circuit
- Oscilloscope test of auto repair model CAN line test
- How to Accurately Test Phase Noise
- [XMC4800 Relax EtherCAT Kit Review] + Getting Started with DAVE, XMC4800 EtherCAT Slave Module Example Analysis
- Apply for the USB PD power receiving protocol chip CH224 and CH224EVT for free, show your DIY creativity and win Qinheng gifts!
- Industrial control personnel must learn Kingview and Siemens PLC wireless PPI classic communication solution
- Happy National Day to all forum friends!
- 555 classic circuit diagram: dual tone generator circuit diagram
- The two outputs of the voltage comparator LM393 are abnormal