Speech recognition machine demonstrates superhuman ability
USC biomedical engineers have created the world's first machine system that can recognize spoken words better than humans can.
The system might soon facilitate voice control of computers and other machines, help the deaf and instantly produce clean transcripts of conversations. The U.S. Navy, which listens for the sounds of submarines in the hubbub of the open seas, is another possible user.
The system's novel underlying principles could have applications in such medical areas as patient monitoring and the reading of electrocardiograms.
In benchmark testing using just a few spoken words, USC's Berger-Liaw Neural Network Speaker Independent Speech Recognition System not only bested all existing computer speech recognition systems but outperformed the keenest human ears.
Neural nets are computing devices that mimic the way brains process information. Rather than being programmed, neural nets learn to do tasks through a training regimen in which desired responses to stimuli are reinforced and unwanted ones are not.
Speaker-independent systems can recognize a word no matter who or what pronounces it.
No previous speaker-independent computer system has ever outperformed humans in recognizing spoken language, says system co-designer Theodore W. Berger, a professor of biomedical engineering.
The system can distinguish words in vast amounts of random "white noise" and can pluck words from the background clutter of other voices-the hubbub heard in bus stations and cocktail parties, for example.
Even the best existing systems fail completely when as little as 10 percent of hubbub masks a speaker's voice. At slightly higher noise levels, the likelihood that a human listener can identify spoken test words is mere chance. By contrast, Berger and Liaw's system functions at 60 percent recognition with a hubbub level 560 times the strength of the target stimulus.
With just a minor adjustment, the system can identify different speakers of the same word with superhuman acuity. Berger and system co-designer Jim-Shih Liaw achieved this improved performance by paying closer attention to the signal characteristics used by real flesh-and-blood brains in processing information.
"It has been difficult for artificial neural networks even to approach the power of biological systems," said Liaw, director of the Laboratory for Neural Dynamics and a research assistant professor of biomedical engineering at the USC School of Engineering. "Deficiencies were often laid to the fact that even 1,000-neuron networks are tiny, compared with the millions or billions of neurons in biological systems."
Remarkably, USC's neural net system uses an architecture consisting of just 11 neurons connected by only 30 links.
A demonstration of the Berger-Liaw Neural Network Speaker-Independent Speech Recognition System can be found online at: http://www. usc. edu/ext-relations/news_ service/real/real_video.html
Latest stories
- USC Price School Celebrates Naming Gift February 9, 2012 2:45 PM
- George Will Shares His Perspective on Politics February 9, 2012 1:10 PM
- Life on the Rez February 9, 2012 12:10 PM
-
For Journalists »
-
USC in the News
for 2/8/2012 »-
The Chronicle of Higher Education mentioned USC’s $6 billion fundraising campaign. The story noted that USC had already raised $1 billion in a “quiet phase,” including the $200 million naming gift from USC Trustee and alumnus David Dornsife and wife Dana Dornsife to the USC Dornsife College.
The Guardian (U.K.) highlighted two major gifts to USC in a list of the 10 biggest philanthropic benefactors in America. The list included the $200 million naming gift from USC Trustee and alumnus David Dornsife and wife Dana Dornsife to the USC Dornsife College, and the $110 million gift from USC Trustee and USC Viterbi School alumnus John Mork and wife Julie to create the USC Mork Family Scholars Program.
The New York Times featured the USC U.S.-China Institute documentary “Assignment: China — The Week that Changed the World.” The documentary, part of a series, examines media coverage of the 1972 Nixon trip that reshaped U.S.-China relations after a quarter century of isolation and hostility. “People look back now and take it for granted that the outcome was preordained,” said the institute’s Mike Chinoy, who produced the documentary. Voice of America also featured the story.
Los Angeles Times featured the Oscar Senti-meter, a tool developed by the USC Annenberg School, Los Angeles Times and IBM that analyzes thousands of tweets about the Academy Awards nominees. The story noted that Mexican actor Demian Bechir received an enormous boost on Twitter the day of the nominations, with a total of 6,893 tweets mentioning him, a 47-fold increase from the day before. The story noted the tool uses language-recognition technology developed in collaboration with USC Viterbi School’s Signal Analysis and Interpretation Lab.
The Times of India (India) featured a three-day medical emergency training workshop organized in association with USC. At the workshop, held at GCS Medical College in India, 50 doctors and more than 100 paramedics learned how to improve emergency support systems. William Mallon of the Keck School of USC said that discussion topics included the use of portable ultrasonic devices to scan patients. “The ultrasound applications help physicians make accurate and timely decisions,” he noted. Daily News & Analysis (India) also featured the workshop.
-
-
Campus News
- Capital Connections
- USC faculty, staff and alumni in Washington, D.C., and Sacramento
- In Print
- New and recent books written or edited by USC faculty and staff
- Family Matters
- Achievements and awards
- Obituaries
