Geely releases a new generation of speech synthesis model with voice cloning capabilities-EEWORLD

Collect

Recently, Geely officially announced that the StarRui AI big model has achieved another breakthrough technical achievement - the "new generation HAM-TTS speech synthesis big model" has been officially released. According to the official introduction, the new generation HAM-TTS speech synthesis big model innovatively introduces the text acoustic information prediction module, which can synthesize natural, smooth and emotional speech based on the given text. At the same time, it has a powerful voice cloning ability, and can reproduce realistic voices with only a few seconds of reference voice samples, giving users a real and vivid voice interaction experience.

According to official introduction, the new generation of HAM-TTS speech synthesis model has taken the lead in breaking through the data collection problem, expanding the amount of training data to over 650,000 hours and the number of parameters to 800 million. In addition, Geely has also adopted a clever data enhancement strategy. That is, artificially setting "noise" in the training data through splicing and replacement, so as to improve the speech synthesis model's ability to recognize timbre, making the synthesized audio timbre more stable, more coherent, and closer to human voice.

At the same time, the new generation of HAM-TTS speech synthesis model also has powerful cross-language switching capabilities. Moreover, the new generation of HAM-TTS speech synthesis model can intelligently adjust multi-dimensional parameters such as tone, intonation, pauses and emotions according to specific scene requirements.

On January 11, 2024, Geely officially released the Star Rui AI Big Model. Geely Star Rui AI Big Model uses the powerful Star Rui Intelligent Computing Center as its computing power base, deeply integrating the self-developed basic big model with Geely's NPDS R&D system and a massive car-making full-link scenario database. It will become a big model with rich application scenarios in the automotive industry, powerful computing power, a complete automotive professional knowledge system, and secure and reliable data and models.

Reference address：Geely releases a new generation of speech synthesis model with voice cloning capabilities

Previous article："Crazy" stacking of materials, the competition of car audio is still in its infancy
Next article：Automotive high-speed audio and video transmission vehicle Ethernet solution

Popular Resources
Popular amplifiers

Latest Automotive Electronics Articles

Next-generation automotive microcontrollers: STMicroelectronics technology analysis
STMicroelectronics (ST) has been deeply involved in the automotive market for more than 30 years, and its products and solutions cover most application systems in ordinary vehicles. ...
WPG World Peace Group launches automotive headlight solution based on easy-to-charge semiconductor products
On November 14, 2024, WPG Holdings, a leading international semiconductor component distributor dedicated to the Asia-Pacific market, announced that its subsidiary WPI will launch a new product based on ConvenientPower CP ...
What is the car ZCU that we talk about every day?
In recent years, more and more automotive OEMs and Tier 1s have mentioned the concept of ZCU (Zone Control Unit). Since Tesla Model 3 first realized “central computing + zone control unit” ...
An article reviews the "no-map" intelligent driving solutions of various car companies
Origin Industry Development Demand In the second half of 2022, it can be considered that the problem of intelligent driving on highways and cities has been basically solved. Huawei and Xiaopeng, which have invested heavily in intelligent driving, have found that the proportion of cars used on highways and cities is less than 15%. ...
Renesas takes the lead in launching multi-domain fusion SoC using automotive-grade 3nm process
The fifth-generation R-Car SoC brings a future-oriented multi-domain fusion solution to the centralized EE architecture and supports chiplet expansion November 13, 2024, Beijing, China - Global Semiconductor Solutions ...
BYD and Huawei have made another big move!
V2X technology accelerates, paving the way for advanced autonomous driving
Rimac and Ceer to supply fully integrated electric drive systems for electric vehicles
Huawei's all-solid-state battery has surfaced, achieving a major technological breakthrough!

MoreSelected Circuit Diagrams

Change More Related Popular Components

MorePopular Articles

MoreDaily News

Guess you like