Deep Speech 2: Mandarin and English Recognized
End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.
We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.
As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!
One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...
Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.
Scroll down for more stories in the same category. (Story submitted 12/20/2015)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
you like to contribute a story tip?
Get the URL of the story, and the related sf author, and add
Comment/Join discussion ( 0 )
Related News Stories -
Burner Generates Temporary Phone Numbers
'Interesting phone system he's got, by the way...' - John Varley, 1984.
HushMe Bluetooth Device Reinvents The Hush-A-Phone
'Talking into a hush-a-phone which he had plugged into the telephone jack...' - Robert Heinlein, 1940.
Ubiquiti FrontRow Camera Records Your Life
Why be choosy? Just upload your whole life to the Internet, and be done with it.
SmileCloud Bubloons Are Custom Clouds
'Spurgle kicked at the letter G... It was a monstrous white thing, ten feet thick, half a city block long...' - Alan Nelson, 1953.
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
IBM's Grain Of Sand Computer
'Our ancestors... thought to make the very sand beneath their feet intelligent...'
Liquid Metal Shape-Changing 'Soft Robotics'
'A mimetic poly-alloy... 'What the hell does that mean?''
The Hammock Caravan And Italo Calvino's Octavia
'Now I will tell you how Octavia, the spider-web city, is made.'
Super-Resolution Microscopy Provides '4D' Views
View the magnified interior of living cells.
Have I Seen The Tesla Roadster Story Before?
'Only it wasn't a vessel. It was an automobile...'
Watch 'Do You Trust This Computer' For Free Today
Thanks for making this available, Elon.
Self-Driving Car Ticketed
This just missed making my day.
Elon Musk Tweets Versions Of Clarke's Operation Cleanup
'Fortunately, the old orbital forts were superbly equipped for this task.'
Burner Generates Temporary Phone Numbers
'Interesting phone system he's got, by the way...'
Walmart’s Autonomous Robot Bees
Everyone loves bees.
EA Created AI That Taught Itself To Play Battlefield
Harmless fun for computer scientists.
Is Teleportation A Death Sentence?
'A long trail of dead, he thought, left across the stars...'
New Brain Scanner Lets You Move Around
'In Bob Arctor's living room his thousand dollar custom-quality cephscope crafted by Altec...'
Can An Entire Brain Be Simulated In A Computer?
'The miles of relays and photocells had given way to the spongy globe of platinum iridium about the size of the human brain.'
Physicists Try To Turn Light Into Matter
If E=mc squared, then... m=E/c squared!
Save Your Brain's Connectome, Upload Yourself Elsewhere
'You've got remote storage. How regular is the update?'
More SF in the News Stories
More Beyond Technovelgy science news stories