MSM6679A-110 Voice Recognition Processor
¡ Semiconductor
Contents
Description ..........................................................................................................................................
Features ................................................................................................................................................
Functional and I/O Diagrams ..........................................................................................................
Pin Descriptions .................................................................................................................................
Electrical Specifications .....................................................................................................................
Absolute Maximum Ratings ............................................................................................
Operating Conditions .......................................................................................................
DC Characteristics (VDD = 4.5 to 5.5 V, Ta = -40 to 80˚C) ..........................................
AC Characteristics ............................................................................................................
Timing Diagrams ..............................................................................................................
Functional Description ......................................................................................................................
Voice Recognition .............................................................................................................
SI Recognition ...................................................................................................
SD Recognition .................................................................................................
Name Tag Recording ........................................................................................................
Audio Input Interface .......................................................................................................
Audio Output Interface ....................................................................................................
Memory Interface ..............................................................................................................
External Voice Synthesis Control ...................................................................................
Serial Interface ...................................................................................................................
MSM6679A-110 Slave-Mode API ....................................................................................................
Command Summary ........................................................................................................
Command Descriptions ...................................................................................................
Asynchronous Serial Protocol Example ........................................................................
1
1
2
8
12
12
12
13
14
15
17
17
17
20
20
21
21
21
24
25
26
27
31
44
2
¡ Semiconductor
MSM6679A-110 VRP
¡ Semiconductor
MSM6679A-110 Voice Recognition Processor
SI/SD Voice Recognizer, Recorder/Player, and Speech Synthesizer
DESCRIPTION
The MSM6679A-110 Voice Recognition Processor (VRP) is a slave-mode device that performs
five functions: speaker-independent (SI) voice recognition, speaker-dependent (SD) voice
recognition, solid-state sound recording, sound playback, and speech synthesis. The highly
integrated device also provides an on-chip memory controller, Flash memory interface, analog
data conversion, Oki speech synthesizer interface, and pulse width modulation (PWM) sound
output.
For SI recognition, the MSM6679A-110 contains a vocabulary template in external memory.
Pretrained SI vocabularies eliminate the need for laborious training, as usually required by SD
products. The memory requirements are dependent on the size of the vocabulary. The MSM6679A-
110 can tolerate background noise, while providing high recognition accuracy. In its designated
operating environment, the device achieves a typical recognition accuracy of >95% (using an
Oki-defined test procedure).
For SD recognition, the MSM6679A-110 stores SD vocabulary templates, as defined by the user,
in external SRAM. The MSM6679A-110 can create SD vocabularies of up to 61 words each, with
each word using approximately 50 bytes.
In addition to providing voice recognition capabilities, the MSM6679A-110 integrates a solid-
state recorder/player, speech synthesis functions, and a tone generator. ADPCM recording/
playback provides high quality sound and efficient memory utilization. The MSM6679A-110 can
respond to spoken commands, verbally or with tones, via an on-chip speech synthesizer and tone
generator. For larger speech-synthesis requirements, the MSM6679A-110 also provides a glueless
MSM665x control interface for off-chip speech synthesis.
The MSM6679A-110 can interface to any application or personal computer via a parallel or serial
interface through an open, device-independent serial mode API (SMAPI). To accelerate code
development, Oki supplies an evaluation kit, and assembly and C language programs for this
product.
FEATURES
• SI recognition
- Up to 20 - 25 words in each vocabulary
- Multiple vocabulary support
• SD recognition
- Up to 61 words in each vocabulary
- Multiple vocabulary support
• Speech synthesis
- Up to 2.3-sec internal and 27.6-sec external
speech synthesis on-chip; sample looping
and concatenation allows even longer
phrases.
- On-chip controller for MSM665x speech
synthesizer
- Standard beep tone outputs
- Pulse code modualation (PCM) and
adaptive differential pulse code
modualation (ADPCM) voice or sound-
effect output
• Speech capture and playback
- 28-kbps ADPCM speech compression
• Serial ASCII command interface
• 6944-Hz audio input sample rate for record
andplayback
• 10-kHz sample rate for voice recognition
• 200-msec recognition latency
• Flexible memory mapping for EPROM,
FLASH, and SRAM
• 32-MHz operation
• Packages: 84-pin PLCC (QFJ84-P-S115) or
100-pin TQFP (TQFP100-P-1414-0.50-K)
1