A start-up says its voice recognition tech beats Google and Amazon at reducing racial bias

A young woman using voice recognition software on a smartphone.

Luis Alvarez | DigitalVision | Getty Images

LONDON — A British speech recognition firm says its software performed better than that of major tech companies like Google and Amazon in understanding Black voices.

Speechmatics, which is based in Cambridge, England, said Tuesday that its system had an overall accuracy rate of 83% for African American voices.

That’s higher than Microsoft (73%), Amazon (69%), Google (69%), IBM (62%) and Apple (55%), according to research published by Stanford University in 2020 which compared results from major tech companies on how accurately their speech recognition programs understood African Americans.

Systems from Amazon, IBM, Google, Microsoft and Apple made nearly twice as many errors when interpreting words spoken by African Americans than white people, according to researchers at Stanford.

Speechmatics says its system misidentified words from Black voices 17% of the time, versus 31% for Google and Amazon.

“It’s critical to study and improve fairness in speech-to-text systems given the potential for disparate harm to individuals through downstream sectors ranging from healthcare to criminal justice,” said Allison Koenecke, lead author of the Stanford study.

Bias in A.I.

Voice recognition tech has rapidly become embedded in everyday life, thanks to the prevalence of virtual assistants on smart devices like phones and speakers.

Apple pioneered the use of voice-activated software on mobile devices with its digital assistant Siri, while Amazon was one of the first to bring speech recognition to the home with its Echo speakers and Alexa assistant.

Researchers have become increasingly concerned about bias in the algorithms powering these speech recognition services. Specifically, experts say many voice recognition programs are trained on limited sets of data, making them less effective.

While speech recognition applications have little trouble transcribing, a white, male, East Coast news presenter, “they don’t have the same level of accuracy” with underrepresented voices, according to Will Williams, Speechmatics’ vice president of machine learning.

“As with all these things, it’s about the quality of data in the training sets,” Stephanie Hare, an AI ethics researcher, told CNBC. “There has been racial bias, gender bias, and regional accent bias in speech recognition technology for a long time.”

“This technology does not work the same for everyone, yet,” Hare added. “It could, eventually, with refinement.”

Unlabeled data

Source link

Related Articles

Back to top button