Article count:16439 Read by:87952319

Hottest Technical Articles
Exclusive: A senior executive of NetEase Games was taken away for investigation due to corruption
OPPO is going global, and moving forward
It is reported that Xiaohongshu is testing to directly direct traffic to personal WeChat; Luckin Coffee is reported to enter the US and hit Starbucks with $2, but the official declined to comment; It is reported that JD Pay will be connected to Taobao and Tmall丨E-commerce Morning News
Yu Kai of Horizon Robotics stands at the historical crossroads of China's intelligent driving
Lei Jun: Don't be superstitious about BBA, domestic brands are rising in an all-round way; Big V angrily criticized Porsche 4S store recall "sexy operation": brainless and illegal; Renault returns to China and is building a research and development team
A single sentence from an overseas blogger caused an overseas product to become scrapped instantly. This is a painful lesson. Amazon, Walmart, etc. began to implement a no-return and refund policy. A "civil war" broke out between Temu's semi-hosted and fully-hosted services.
Tmall 3C home appliances double 11 explosion: brands and platforms rush to
Shareholders reveal the inside story of Huayun Data fraud: thousands of official seals were forged, and more than 3 billion yuan was defrauded; Musk was exposed to want 14 mothers and children to live in a secret family estate; Yang Yuanqing said that Lenovo had difficulty recruiting employees when it went overseas in the early days
The app is coming! Robin Li will give a keynote speech on November 12, and the poster reveals a huge amount of information
It is said that Zhong Shanshan asked the packaged water department to sign a "military order" and the entire department would be dismissed if the performance did not meet the standard; Ren Zhengfei said that it is still impossible to say that Huawei has survived; Bilibili reported that employees manipulated the lottery丨Leifeng Morning News
Account Entry

Listen to your own beautiful voice to navigate, this is the new function of Baidu Maps

Latest update time:2019-09-20
    Reads:

▲Click above Leifeng.com Follow


When you think that the map software only has Zhiling's voice, it can actually also record your own voice.

Text | Leifeng.com

A wise artist once said: I have been in many cars, including some very bad ones. I have experienced this deeply. The driver took the wrong road, played disco music in the car, smoked cigarettes in the car, etc. What's worse, not only is it equipped with the above slots, but the mobile phone navigation voice also uses a very unpleasant voice. Whenever this happens, I want to help him replace it with a pleasant voice, such as my own.

I believe that many drivers’ navigation voice packages are either Zhiling or Guo Degang. Although they sound nice and interesting, they will inevitably become boring over time. The new voice customization function of Baidu Maps not only solves this problem, but also increases the author’s stickiness to this app.

The traditional speech synthesis process (such as AutoNavi's World of Warcraft voice package) usually requires recording more than 1,000 sentences, and the synthesis also takes a very long time (the production cycle of AutoNavi's World of Warcraft voice package is as long as 3 months). However, Baidu Map's voice customization can be completed in 20 sentences, and the synthesis time is also very short.

When you upgrade Baidu Maps to the latest version, you can see a "Voice Customization" icon in the upper left corner. After clicking it, you can choose the voices of stars such as Hua Chenyu, Qin Lan, Meng Dan, etc. In the recording page, you can choose female voice, male voice, female child voice or male child voice, as well as 4 types of recording texts according to the attributes of different users.

However, after starting to record, you need to find a quiet environment. There are a total of 20 recorded texts. It takes about 5 minutes to complete the process. The entire recognition process is very smooth.

After recording, the voice will enter the synthesis stage, which takes a total of 15-20 minutes and is done entirely in the cloud, so even if you exit the app it will not have any impact . After completion, just click to download. The author's voice package capacity is 10MB, which takes up almost no storage space.

However, due to factors such as the recording environment, sound characteristics, and the number of people recording, there may be a certain delay in the production of voice packages for some users.

After the download is complete, you can hear your own AI voice , and at this point, all you need to do is start navigating. When I first heard the navigation voice, I was really shocked. Although it was not realistic, the similarity was quite OK, and there was a sense of familiarity that came over me, forming an illusion that there was another me next to me.

And you can use your own voice not only when navigating, but even when interacting with Xiaodu Assistant, such as asking about weather conditions, nearby food, navigation to the Eiffel Tower, etc., it also uses synthesized voice for feedback instead of the default voice. This adds a sense of intimacy and makes me more willing to use this feature, or even the entire map app.

But listening to it yourself and to others is a completely different experience, so I also let people around me listen to the voice synthesized by Baidu Maps. Everyone's first impression was the same: This is indeed my voice. In terms of similarity, everyone said it was around 50%-70%.

However, I think this function is not for "yourself" use, because listening to your own voice for a long time is also quite boring. Personally, I think it is more for you to record the voices of important people around you. If you are a husband, you can listen to your wife's voice on the long commute to work in traffic jams; if you are a parent, you can listen to your children's voices. In addition, there are many more interesting usage scenarios for users to explore.

Furthermore, if your voice is nice enough, you can even publish the synthesized voice to the Baidu Map platform for others to download and use, so that you can better interact with other users. I browsed around and found that many people would imitate celebrities or Crayon Shin-chan, which is really interesting. After making a voice package, you can also share it with friends around you who are also using Baidu Maps. You don’t need to use other people’s phones to do it. You can quietly change it for your relatives and friends, so that it can also be a small surprise.

In the past, listening to your own voice while navigating seemed like a distant thing. As the world's first map app that supports voice customization, Baidu Maps can make it happen. After using it for two days, I still like this feature of Baidu Maps very much. The whole process takes less than 30 minutes , while traditional speech synthesis technology usually requires recording up to 1,000 sentences, and the synthesis time is also very long. The reason why Baidu's voice customization is so fast is that Baidu uses the original style transfer technology Meitron model, which has good performance in many aspects such as timbre, emotion and rhythm, and is currently the industry's leading speech synthesis technology.

This further positions Baidu Maps as an AI map, and more importantly, this feature does not require you to record in a professional studio, a quiet environment is enough. No matter what product it is, ease of use is very important, and throughout the whole process, I did not feel any threshold, from recording to use, it was done in one go, very simple.

New arrival! "AI Investment Research" has now launched the complete video of the CCF GAIR 2019 summit and white papers on major theme sessions, including the Robotics Frontier Session, Intelligent Transportation Session, Smart City Session, AI Chip Session, AI Finance Session, AI Healthcare Session, Smart Education Session, etc. "AI Investment Research" members can watch the annual summit videos and research reports for free, scan the QR code to enter the member page to learn more, or send a private message to teaching assistant Xiao Mu (WeChat: moocmm) for consultation.

Are you still watching?

Latest articles about

Xiaomi air conditioners are selling like hot cakes. Lu Weibing: A competitor's product that costs 3,000 yuan is sold for 20,000 yuan. Dong Mingzhu is caught in the crossfire. Royole Technology declares bankruptcy. Employees' claims may not be repaid. Zhong Shanshan says he looks down on entrepreneurs who sell goods through live streaming. 
Baidu: Making big model applications more practical 
Dahua Technology joins hands with Hongmeng, is it the direction of the tide or the collision of wisdom? 
Leading the westward expansion of e-commerce, the 150 billionth package will be delivered on Pinduoduo in 2024 
Exclusive: Vipshop Senior Operations Director Fan Li resigns 
Performance exploded! Xiaomi Motors' quarterly revenue sprinted to 10 billion yuan, Lu Weibing said there is no upper limit on the investment in intelligent driving; the widow of the founder of Shanshan Holdings took over from her eldest son as chairman; Zeekr executives called for vigilance against pig-killing scams 
Alibaba Cloud returns to growth track 
Scolding employees and being criticized for being overbearing, Dong Mingzhu: You are so funny, I am the boss; Hycan Auto was exposed to have defaulted on compensation for laid-off employees; Chairman of a state-owned enterprise responded to the high school education of the operations director丨Leifeng Morning News 
1688 is an OEM brand, not following the old path of strict selection 
The Double 11 changes in online retail: Who is driving the direction of the tide? 

 
EEWorld WeChat Subscription

 
EEWorld WeChat Service Number

 
AutoDevelopers

About Us Customer Service Contact Information Datasheet Sitemap LatestNews

Room 1530, Zhongguancun MOOC Times Building,Block B, 18 Zhongguancun Street, Haidian District,Beijing, China Tel:(010)82350740 Postcode:100190

Copyright © 2005-2024 EEWORLD.com.cn, Inc. All rights reserved 京ICP证060456号 京ICP备10001474号-1 电信业务审批[2006]字第258号函 京公网安备 11010802033920号