Deep Speech 2: Mandarin and English Recognized

End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.

We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.

As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!

One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...

Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.

Scroll down for more stories in the same category. (Story submitted 12/20/2015)

Follow this kind of news @Technovelgy.

| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |

Would you like to contribute a story tip? It's easy:
Get the URL of the story, and the related sf author, and add it here.

Comment/Join discussion ( 0 )

Related News Stories - (" Communication ")

Ubiquiti FrontRow Camera Records Your Life
Why be choosy? Just upload your whole life to the Internet, and be done with it.

SmileCloud Bubloons Are Custom Clouds
'Spurgle kicked at the letter G... It was a monstrous white thing, ten feet thick, half a city block long...' - Alan Nelson, 1953.

Fog Computing (AKA Edge Computing) Ad Hoc Networks
'The tiny devices chirped their impulse codes at one another...' - Vernor Vinge, 1999.

Biggest HiSeas 'Mars Mission' Problem? No Internet
I think sf writers have this covered!

 

Google
  Web TechNovelgy.com   

Technovelgy (that's tech-novel-gee!) is devoted to the creative science inventions and ideas of sf authors. Look for the Invention Category that interests you, the Glossary, the Invention Timeline, or see what's New.

 

 

 

 

 

Current News

China Melts Tibetan Permafrost To Plant Forest
'Can you give us a microwave spotlight?'

iFlytek Doctor Robot First To Pass Medical Exams
Doctor shortage? No problem, we'll just use the autodoc.

Slaughterbot AI KIller Quadcopter Drones
'The real border was defended by... a swarm of quasi-independent aerostats.'

Do We Really Want Backflipping Robots?
Also includes wonderful blooper reel.

RNA-Based Biocomputing Device
Living things can sense and analyze complex signals in living cells.

Seasteading Floating Cities
'It was a remarkable island, circular, about half a kilometer in diameter.'

Tesla Semi 'Electrotruck' Unveiled
Elon Musk unveils yet another technological marvel.

Watch What People Are Seeing Via Brain Scanning
'had managed to see through the other man's eyes as the other man, all unaware, washed their Zis limousine sixteen hundred meters away...'

Integrated Circuits Printed Right Onto Fabric!
'...a shirt that displayed email on its sleeve.

Interstellar Asteroid Visits Our Solar System
'This asteroid had whirled in from the cold of the interplanetary space...'

PRIMA Bionic Vision Restoration
'The VISOR... was a medical device used in the Federation to aid patients who have suffered loss of eyesight...'

Audi Traffic Jam Pilot Knows If You're Sleeping
'Even here, riding a garbage truck to eternity, the machine watched him...'

UM Hall Thruster Breaks Records
Someday, we'll see an ion drive used to get to Mars.

Ionity Ultra-fast Charging Station Network
'Recharge the batteries... in almost every town and village...'

VAuth Voice Security Wearable From University of Michigan
'Siri, I gave you a voice command...' 'Yes, but do I know you?'

Ubiquiti FrontRow Camera Records Your Life
Why be choosy? Just upload your whole life to the Internet, and be done with it.

More SF in the News Stories

More Beyond Technovelgy science news stories

Home | Glossary | Invention Timeline | Category | New | Contact Us | FAQ | Advertise |
Technovelgy.com - where science meets fiction™

Copyright© Technovelgy LLC; all rights reserved.