Microsoft 2020 International Day of Persons with Disabilities special event: AI makes audiobooks more vivid

Publisher:EnchantedMagicLatest update time:2020-12-03 Source: 新浪数码Keywords:AI Reading articles on mobile phones Scan QR code
Read articles on your mobile phone anytime, anywhere

      On the afternoon of December 2, Microsoft held a special event in Beijing to celebrate the 2020 International Day of Persons with Disabilities, showcasing the latest progress in Microsoft's AI voice technology - neural network voice intelligence. Neural network voice intelligence has the ability to produce multiple timbres and emotions, and can be quickly produced through the creative platform. At the same time, a donation ceremony for the audio content of Hong Dandan's "Library in the Heart" was also held at the event.

Microsoft Global Senior Vice President Hong Xiaowen

  During the event, Dr. Hsiao-Wen Hong, Microsoft's global senior vice president, chairman of Microsoft Asia Pacific R&D Group and director of Microsoft Research Asia, delivered a keynote speech. Hsiao-Wen Hong first emphasized Microsoft's mission of "empowering every person and every organization in the world to achieve extraordinary results". In 2020, the technology sector contributed about 5% to the global GDP, and it is expected to reach 10% by 2030. Microsoft will also continue to be committed to using inclusive technology to achieve everyone and bring products and services to everyone. Artificial intelligence will continue to make the world a better place from six aspects: Earth Plan, Technology Accessibility, Humanitarian Action Plan, Cultural Heritage Protection Technology, and Health and Medical Technology.

Microsoft Global Technical Fellow Huang Xuedong

  Subsequently, Dr. Xuedong Huang, Microsoft Global Technology Fellow and Chief Technology Officer of Microsoft Azure AI, also shared through a video: Thanks to the efforts of Microsoft Research Asia, Microsoft's AI voice technology has been integrated into an intelligent audio content creation platform that has both practical and promotional value, allowing people who have not been exposed to AI technology to participate in the creation of audio content, bringing richer audio content.

Hong Dandan's "Library in Mind" Audio Content Donation Ceremony

  A donation ceremony for the audio content of Hongdandan's "Library of Mind" was also held at the event. Hongdandan's "Library of Mind" was established by Beijing Hongdandan Cultural Exchange Center (hereinafter referred to as Hongdandan) to provide audio book lending services for the blind. Zheng Xiaojie, the founder of Hongdandan, said that Hongdandan found in many blind schools that the existing books and audio content for the blind are generally old and cannot meet the reading needs of the blind. Traditional manually recorded audio content also has the disadvantages of being time-consuming and small in quantity. Cooperation with Microsoft can bring rich choices to the blind, so that books can accompany the blind throughout their lives.

Ding Binggong, Chief Product Director of Microsoft Cloud Computing and Artificial Intelligence Division

  How to achieve vivid and rich speech synthesis? Ding Binggong, Chief Product Director of Microsoft Cloud Computing and Artificial Intelligence Division, explained the relevant technologies: Microsoft has four major advantages in speech synthesis: the most intelligent speech synthesis, the most extensive global speech coverage, flexible cloud and terminal calls, and powerful speech customization capabilities. On this basis, Microsoft launched neural network speech intelligence, which conducts neural network acoustic learning on the input text, and outputs natural audio after neural network acoustic decoding.

Neural network voice intelligence has multiple timbres and multiple emotions

  Compared with traditional intelligent voice, neural network voice intelligence has multiple timbres and multiple emotions, making the voice content no longer monotonous. For example, neural network voice intelligence can simulate the speaking styles of various scenarios such as news broadcasts, customer service, and chats, and can add emotions such as happiness, disdain, and anger, and can achieve emotional grading to make emotions more delicate. In addition to platform voices, neural network voice intelligence can also provide voice customization services, design voices that are in line with corporate, organizational or personal brand strategies, and optimize emotions according to the scene to create a unique personality and achieve natural human-computer interaction.

Intelligent audio content creation platform

  In actual use, the intelligent audio content creation platform created by Microsoft, through two parts: intelligent fully automatic generation mode and customized free creation mode, allows volunteers who are not familiar with AI technology to create audio content through simple operations.

"AI Voice + Public Welfare" Roundtable Dialogue

       At the end of the event, Microsoft organized two roundtable discussions on "AI Voice + Charity" and "AI Voice + Industry" to share more stories behind Microsoft's AI voice technology and Hong Dandan's charity activities.


Keywords:AI Reference address:Microsoft 2020 International Day of Persons with Disabilities special event: AI makes audiobooks more vivid

Previous article:Apple may use liquid-filled lenses to improve users' visual experience
Next article:Does OFILM respond to being excluded from Apple's camera module supply chain?

Recommended ReadingLatest update time:2024-11-16 23:42

AI+5G era: data generation and storage demand enter a stage of explosive growth
In the AI+5G era, we are accelerating into a stage of explosive growth in data generation and storage demand. "With the enhancement of edge computing capabilities, we will find that more and more sensing and analysis can be realized at the edge. However, these data are not fully stored at the edge." Zhang Dan introduc
[Embedded]
AI+5G era: data generation and storage demand enter a stage of explosive growth
Infineon acquires Imagimob, leader in micro-machine learning, further enhancing and expanding its embedded AI solutions
Infineon acquires Imagimob, leader in micro-machine learning, further enhancing and expanding its embedded AI solutions Infineon Technologies AG announced that it has acquired Stockholm-based start-up Imagimob Ltd., a leading platform provider dedicated to machine learning (ML) on edge devices. ) to provide assist
[Semiconductor design/manufacturing]
Mouser now sells Seeed Studio reComputer Jetson development kit to help build AI applications
June 27, 2022 – Mouser Electronics, an authorized global semiconductor and electronic component distributor focused on introducing new products, is now stocking Seeed Studio’s reComputer Jetson 20-1 Xavier NX and reComputer Jetson 10-1 Nano development kits. Based on advanced NVIDIA cores, the development kits enab
[Embedded]
Mouser now sells Seeed Studio reComputer Jetson development kit to help build AI applications
Understanding the development of AI perception technology for autonomous driving
Autonomous driving is a technology that integrates perception, decision-making, and interaction. Environmental perception is the first step in autonomous driving and is the link between the vehicle and the environment. The means of sensing the environment are becoming increasingly diversified through
[Embedded]
Understanding the development of AI perception technology for autonomous driving
Intel AI technology innovation shines at Paris Olympics
Intel Technology Supports the Paris Olympics: Creating a New Chapter for Sports Events The 2024 Paris Olympic Games are full of exciting events, and athletes, spectators, and sports fans have already experienced many new experiences. As the official global artificial intelligence platform partner of t
[Network Communication]
Intel AI technology innovation shines at Paris Olympics
Intel and Boston Consulting Group Launch Enterprise-Grade Generative AI Solution
This groundbreaking solution, powered by Intel AI supercomputers, unlocks business value through customized data sets while ensuring high security and data privacy. Today, Boston Consulting Group (BCG) and Intel announced a strategic collaboration to implement generative AI (GenAI) using end-to-end In
[Industrial Control]
Infineon Technologies and Reality AI develop advanced sensing solutions to give vehicles the ability to hear
Currently, most ADAS are based on cameras, radars or lidars, and the target object must be within the system's line of sight. However, for emergency vehicles, although their alarms can be heard in advance, they can only be discovered when they enter the ADAS's field of view. (Image source: Infineon) According to
[Automotive Electronics]
Infineon Technologies and Reality AI develop advanced sensing solutions to give vehicles the ability to hear
Renesas Electronics to Demonstrate First AI Solution Based on Helium Technology for Arm® Cortex®-M85 Processor
Renesas Electronics to Exhibit at Embedded World The first AI solution based on Helium technology of Arm® Cortex®-M85 processor Renesas, a leading supplier of microcontrollers, will exhibit at Embedded World 2023 Demonstrating new processor’s ideal performance for demanding AI applicatio
[Industrial Control]
Renesas Electronics to Demonstrate First AI Solution Based on Helium Technology for Arm® Cortex®-M85 Processor
Latest Mobile phone portable Articles
Change More Related Popular Components

EEWorld
subscription
account

EEWorld
service
account

Automotive
development
circle

About Us Customer Service Contact Information Datasheet Sitemap LatestNews


Room 1530, 15th Floor, Building B, No.18 Zhongguancun Street, Haidian District, Beijing, Postal Code: 100190 China Telephone: 008610 8235 0740

Copyright © 2005-2024 EEWORLD.com.cn, Inc. All rights reserved 京ICP证060456号 京ICP备10001474号-1 电信业务审批[2006]字第258号函 京公网安备 11010802033920号