Deep Speech 2: Mandarin and English Recognized
End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.
We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.
As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!
One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...
Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.
Scroll down for more stories in the same category. (Story submitted 12/20/2015)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
you like to contribute a story tip?
Get the URL of the story, and the related sf author, and add
Comment/Join discussion ( 0 )
Related News Stories -
Hurdl PIXL Wearable Helps Fans Connect With Stars
Like Macross Plus!
Advertising Drones Hover Over Traffic In Mexico
'Blurbflies are allowd to travel the streets, buzzing their adverts alive and direct...' - Jeff Noon, 2000.
Audiobooks - Fastest Growing Format In Publishing
'The public preferred lectons...' - Stanislaw Lem, 1961.
Douglas Adams Your Babel Fish Is Ready - The Pilot By Waverly
'You'll need to have this fish in your ear.' - Douglas Adams, 1979.
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
Cattle Avoidance Feature In Indian Autonomous Cars
'The driver went about the business of gently slipping the teflon-coated metal scoop beneath the first animal...'
Project KOVR Fashion Protection From Infosphere
'... the entire shroudlike membrane took on whatever physical characteristics were projected at any nanosecond.'
Twist Bioscience High Density Digital Data On DNA
'They tied the memory to the bloodline and that was their record!'
'We're Not Creating A Terminator' Say Russians About Gun-Wielding Robot Fedor
Nobody is thinking about the Terminator. Westworld, maybe.
Vantablack Now IMMEASURABLY Black
'a black coating now thatís ninety-nine percent absorptive...'
Mercedes-Benz Autonomous Taxi Fleet In 3 Years
'... the taxi utilized sophisticated electronic sensors to perceive its surroundings.'
Is 'The Pulsar Positioning System' Evidence For SETI?
'For a hyperspace jump, you need at least four beacons for an accurate fix.'
Someday, You Might Like VR Enough To Move In
'That barrier was going to melt away someday soon. The transhumanists had promised...'
Humans Use Mental Power For Turtle Slavery
Now we need to start looking for animals with fingers...
Solar-Powered Moisture Vaporator
'The atmosphere yielded its moisture with reluctance.'
DxtER! Tricorder Prize Won By Final Frontier Medical Devices
We've been waiting a long time for this, Star Trek fans.
President Trump's Wall As Otra Nation Hyperloop
'...an hollow tube must be constructed the whole distance... as to admit a four wheeled carriage...'
Pickup Lines From Artificial Intelligences
'They hate us, you know... The humans. They'll stop at nothing.'
Pooper Scooper Drone Robot Watchdog 1
'Robots pick up the garbage and junk...'
Cassie Robot Brings AT-ST Walker To Life
There's even a log test!
Hundreds Of Robot Lawnmowers Invade Texas Town
'The mower reached the edge of the lawn, clucked to itself...'
More SF in the News Stories
More Beyond Technovelgy science news stories