Deep Speech 2: Mandarin and English Recognized
End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.
We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.
As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!
One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...
Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.
Scroll down for more stories in the same category. (Story submitted 12/20/2015)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
you like to contribute a story tip?
Get the URL of the story, and the related sf author, and add
Comment/Join discussion ( 0 )
Related News Stories -
Hurdl PIXL Wearable Helps Fans Connect With Stars
Like Macross Plus!
Advertising Drones Hover Over Traffic In Mexico
'Blurbflies are allowd to travel the streets, buzzing their adverts alive and direct...' - Jeff Noon, 2000.
Audiobooks - Fastest Growing Format In Publishing
'The public preferred lectons...' - Stanislaw Lem, 1961.
Douglas Adams Your Babel Fish Is Ready - The Pilot By Waverly
'You'll need to have this fish in your ear.' - Douglas Adams, 1979.
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
TALOS Exoskeleton Development Proceeding
'Suited up, you look like a big steel gorilla...'
Autonomous Robots Navigate Like Rats
'Out of warrens in the wall, tiny robot mice darted.'
SINTEF Robot Cleans Solar Panels
'The window cleaners, with large padded feet...'
Pangorin Restaurant Service Robots
What'll you have? Jawa juice?
Drug Creates Real Melanin Tan
I've used them all my life...
Medical Drones Hover Like Angels Near You
'The death-reversal equipment is on its way...'
SkEye Amazing Israeli Gigapixel Drone
'An eye that could not only see, but fly...'
How Rude! DARPA Wants Robots To Behave More Like Threepio
'Do I know protocol? Why, it's my primary function.'
'Liquid Light' Flows Around Corners
Light as a superfluid.
Unrolling The Filmy Materials Of Space Tech
'When unfolded and unrolled... it became a tough, gleaming film.'
Buddy Companion Robot Your Bulbous Friend
'Nanny was built in the shape of a sphere, a large metal sphere, flattened on the bottom...'
Poli-X1 Prototype Bee Pollinator
Is there anything drones can't do?
Bake in Space Bake-Off... In Space!
'A joyous condition commenced for the cook in the electric kitchen...'
DeepMind AI Baffled By Homer Simpson, Needs Human Help
'Whenever a robot finds something it can't identify straight off...'
Does Earth's Middle Mantle Hold Oceans Of Water?
Al Gore, you have no idea.
Vaccine Blocks Heroin High
'You're biochemically incapable of getting off...'
More SF in the News Stories
More Beyond Technovelgy science news stories