Google AI researchers working with the ALS Remedy Improvement Institute at the moment shared particulars about Venture Euphonia, a speech-to-text transcription service for folks with talking impairments. Additionally they say their strategy can enhance automated speech recognition for folks with non-native English accents as nicely.
Folks with amyotrophic lateral sclerosis (ALS) typically have slurred speech, however current AI methods are sometimes skilled on voice information with none affliction or accent.
The brand new strategy is profitable primarily because of the introduction of small quantities of knowledge that represents folks with accents and ALS.
“We present that 71% of the development comes from solely 5 minutes of coaching information,” in response to a paper revealed on arXiv July 31 titled “Personalizing ASR for Dysarthric and Accented Speech with Restricted Information.”
Personalised fashions had been in a position to obtain 62% and 35% relative phrase error fee (WER) enchancment for ALS and accents respectively.
The ALS speech information set consists of 36 hours of audio from 67 folks with ALS, working with the ALS Remedy Improvement Institute.
The non-native English speaker information set is known as L2 Arctic and has 20 recordings of utterances that final one hour every.
Venture Euphonia additionally makes use of methods from Parrotron, an AI device for folks with speech impediments launched in July, in addition to fine-tuning methods.
Written by 12 coauthors, the work is being offered at Worldwide Speech Communication Affiliation, or Interspeech 2019, which takes place September 15-19 in Graz, Austria.
“This paper’s strategy overcomes information shortage by starting with a base mannequin skilled on 1000’s of hours of ordinary speech. It will get round sub-group heterogeneity by coaching personalised fashions,” the paper reads.
The analysis, which a Google AI weblog submit highlighted at the moment, follows the introduction of Venture Euphonia and different initiatives in Might, resembling Reside Relay, a function to make cellphone calls simpler for deaf folks, and Venture Diva, an effort to make Google Assistant accessible for nonverbal folks.
Google is soliciting information from folks with ALS to enhance its mannequin’s accuracy and is engaged on subsequent steps for Venture Euphonia, resembling utilizing phoneme errors to scale back phrase error charges.