Deep Speech 2: Mandarin and English Recognized

End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.

We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.

As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!

One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...

Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.

Scroll down for more stories in the same category. (Story submitted 12/20/2015)

Follow this kind of news @Technovelgy.

| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |

Would you like to contribute a story tip? It's easy:
Get the URL of the story, and the related sf author, and add it here.

Comment/Join discussion ( 0 )

Related News Stories - (" Communication ")

Biggest HiSeas 'Mars Mission' Problem? No Internet
I think sf writers have this covered!

Sansar Social Virtual Reality Platform In 2017?
'And just as a daydreamer forgets his actual surroundings, and sees other realities...' - Vernor VInge, 1981.

Publishing Technologies In Science Fiction
In response to a reader question, a set of links related to publishing technologies in science fiction

Hurdl PIXL Wearable Helps Fans Connect With Stars
Like Macross Plus!

 

Google
  Web TechNovelgy.com   

Technovelgy (that's tech-novel-gee!) is devoted to the creative science inventions and ideas of sf authors. Look for the Invention Category that interests you, the Glossary, the Invention Timeline, or see what's New.

 

 

 

 

 

Current News

Biggest HiSeas 'Mars Mission' Problem? No Internet
I think sf writers have this covered!

Clever Electric Truck Generates More Power Than It Uses
Better than a fictional electrotruck!

Eden-ISS, Greenhouse In Antarctica
'With this kind of light we could get the gardens going again."

Make Space Tools On The Spot (Like Moties)
'A moment ago it was squeezing silver toothpaste in a ribbon...'

Will Robots Be Moral If We Raise Them Like Our Children?
'The birth of Machine, my robot child...'

Foldable Galaxy Phones, I Swear They're Coming (Maybe)
How hard can it be?

Bacteria Behave Differently In Space
'The Republic struggled to control its Sours...'

Brain Connected To Internet - ‘Brainternet'
Fascinating!

Artificial Spider Silk
You can also use it to make a roof - on an asteroid.

MIT Tunes Ions For Frictionless Surface - Superlubricity!
'My telelubricator here neutralizes the interatomic bonds the surface of any solid...'

Seiko Astron Always Knows Your Time Zone
'Harrington glanced at his wrist watch - a bulky affair - and whistled.'

Robot Buddhist Priest Chants, Drums
'He crossed the waiting room to the Padre booth...'

Koniku Kore, Mouse Brain-Based Chip, Detects Explosives
'As a matter of fact, this mouse is going to keep on thinking forever.'

CNH Industrial Autonomous Tractor Concept Video
'...the tiny red glints of self-guided tractors.'

The Neuroon Open Sleep Tracker For Lucid Dreaming
'Leads trail away from insertion points on her face and wrist... to a lucid dreamer on the bedside shelf.'

Siri Now Smoother, Perkier (Thanks, Deep Learning!)
'Good morning, Dr. Chandra. This is Hal.'

More SF in the News Stories

More Beyond Technovelgy science news stories

Home | Glossary | Invention Timeline | Category | New | Contact Us | FAQ | Advertise |
Technovelgy.com - where science meets fiction™

Copyright© Technovelgy LLC; all rights reserved.