What you need to know
- A recent tech demo uses Azure Custom Neural Voice to imitate a presenter's actual voice.
- You can compare the presenter and the AI-generated voice within the video.
- The demo explains how to configure Dapr to communicate over gRPC.
Microsoft's Donavan Brown recently shared a video that utilizes Azure Custom Neural Voice to imitate his real voice. Brown is a partner program manager of Azure Incubations at Microsoft. His recent video illustrates the power of Azure when used to replicate human speech.
The video itself is about how to configure Dapr to communicate over gRPC. It's a clear video that explains the process well, but everyday tech enthusiasts are probably more interested in the technology that went into creating the presentation than the contents of the video.
Azure Custom Neural Voice is a text-to-speech feature in Azure Cognitive Service. It lets organizations create a synthetic voice, such as the Flo virtual chatbot for the insurance company Progressive. When Custom Neural Voice came out of preview in February 2021, Microsoft explained (opens in new tab) how it could be used for chatbots, voice assistants, online learning, and in other areas.
Traditional methods of creating text-to-speech voices require around 10,000 lines of voice data. In contrast, Azure Custom Neural Voice can create a realistic voice with much less voice data.
Brown starts the video by speaking to the camera. The video then transitions to a tech demo that uses a synthetic voice based on Brown's real voice. Having both Brown's actual voice and the synthetic voice makes it easy to compare the two.
On Twitter, Brown explained that he played some sentences out loud with his wife and that neither of them could determine if the clips were of Brown's actual voice or from the synthetic voice.
When I first started playing with it there were some sentences I shared with my wife and we could not tell if it was me or not. It is unbelievable what we can do with Azure.When I first started playing with it there were some sentences I shared with my wife and we could not tell if it was me or not. It is unbelievable what we can do with Azure.— Donovan Brown #BlackLivesMatter (@DonovanBrown) September 14, 2021September 14, 2021
Brown also explains that creating a synthetic voice based on a person requires consent.
Sean Endicott is the news writer for Windows Central. If it runs Windows, is made by Microsoft, or has anything to do with either, he's on it. Sean's been with Windows Central since 2017 and is also our resident app expert. If you have a news tip or an app to review, hit him up at email@example.com.
Get the best of Windows Central in in your inbox, every day!
Thank you for signing up to Windows Central. You will receive a verification email shortly.
There was a problem. Please refresh the page and try again.