Deep Speech 2: Mandarin and English Recognized
End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.
We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.
As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!
One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...
Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.
Scroll down for more stories in the same category. (Story submitted 12/20/2015)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
you like to contribute a story tip?
Get the URL of the story, and the related sf author, and add
Comment/Join discussion ( 0 )
Related News Stories -
Ubiquiti FrontRow Camera Records Your Life
Why be choosy? Just upload your whole life to the Internet, and be done with it.
SmileCloud Bubloons Are Custom Clouds
'Spurgle kicked at the letter G... It was a monstrous white thing, ten feet thick, half a city block long...' - Alan Nelson, 1953.
Fog Computing (AKA Edge Computing) Ad Hoc Networks
'The tiny devices chirped their impulse codes at one another...' - Vernor Vinge, 1999.
Biggest HiSeas 'Mars Mission' Problem? No Internet
I think sf writers have this covered!
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
A 'Genuine Nanorobotic Production Factory'
'Microscopic machinery, smaller than ants, smaller than pins, working energetically, purposefully - constructing something...'
Neuromorphic Computer Offers Non-von Neumann Architecture
Fires faster than brain at 1/10K energy.
Evorus Your Crowd-Powered Conversational Assistant
'...the DS [Daily Schedule] was suddenly transformed into a valued confidante.'
Mealworms Food Of The Future
Get your grubs on.
Alibaba's AI May Read Better Than You
'Mike ... could accept other languages and was doing technical translating - and reading endlessly.'
Musk's Boring Flamethrower
'Skeletons in tatters. Burned by a flesh gun'
Humanity Star LEO Advertisement?
'Everyone has noticed those enormous advertisements...'
Nissan ProPILOT Slippers Are Self-Parking, Autonomous
Beyond science and fiction.
Atomristors - Atomic Memristors - Using Thin Nanomaterials
'I could almost feel those little tunnel junction neuristors working, forming their own interconnections as I operated it.'
Bigelow Prepares Inflatable Lunar Hotel
'Suddenly, hitherto unheard-of sums of money became available for investment in civilian orbital stations.'
Drunk Driver Of Tesla Claims Autopilot Was In Charge
'Mr. Garden, you are in no condition to drive.'
Medical Exoskeleton From Cyberdyne Gets FDA Approval
It's been a long road for HAL-5; I started writing about it in 2005.
Fungi-Infused Concrete Repairs Itself
'I noticed that curious mottled knots were forming, indicating where the room had been strained and healed faultily.'
Shiftwear Display Shoes
'He unlaced her shoe and glanced at its readout.'
NASA SEXTANT First With X-Ray Nav In Space
'You need at least four beacons for an accurate fix.'
GM Introduces Cruise AV With No Steering Wheel
'How about the steering wheel?' ... 'I do not need one.'
More SF in the News Stories
More Beyond Technovelgy science news stories