Science Fiction Dictionary
A  B  C  D  E  F  G  H  I  J  K  L  M  N  O  P  Q  R  S  T  U  V  W  X  Y  Z

 

Visual Speech Recognition - When Will HAL Read Lips For Real?

Visual Speech Recognition, also known as automated lip reading, is a field with a special meaning for science fiction fans. In the film 2001:A Space Odyssey, the HAL 9000 computer was able to read lips.


(HAL 9000 [background] eavesdrops on astronauts Poole and Bowman)

In the film, HAL's increasingly erratic behavior becomes a matter of concern for the astronauts. Since HAL can effectively monitor every part of the ship, the astronauts retire to a small pod to discuss the matter. Unfortunately, it turns out that somebody did research on computer lip-reading, and so HAL was on to them, with very unfortunate results for Poole.

In a recent paper, Ahmad Hassanat at Mu’tah University in Jordan provides a review of existing approaches, and suggestions for moving forward with VSR. He also outlines some of the challenges in actually creating a computer able to read lips, like the fictional HAL 9000.


(From Visual Speech Recognition chart)

The fundamental process of lip reading is to recognize a sequence of shapes formed by the mouth and then match it to a specific word or sequence of words.

There is a significant challenge here. During speech, the mouth forms between 10 and 14 different shapes, known as visemes. By contrast, speech contains around 50 individual sounds known as phonemes. So a single viseme can represent several different phonemes.

And therein lies the problem. A sequence of visemes cannot usually be associated with a unique word or sequence of words. Instead, a sequence of visemes can have several different solutions.

The first problem for automated lip reading is face and lip recognition. This has improved in leaps and bounds in recent years. A more difficult challenge is in recognizing, extracting and categorizing the geometric features of the lips during speech.

This is done by measuring the height and width of the lips as well as other features such as the shape of the ellipse bounding the lips, the amount of teeth on view and the redness of the image, which determines the amount of tongue that is visible.

Determining the exact contour of the lips is hard because of the relatively small difference between pixels showing face and lips.

Another problem is that some people are more expressive with their lips than others so it easier to interpret what they are saying from lip movements alone. Indeed, some people hardly move their lips at all and these so-called “visual-speechless persons” are almost impossible to interpret.

Hassanat’s own visual speech recognition system is remarkably good. His experiments achieve an average success rate of 76 percent, albeit in carefully controlled conditions. The success rate is even higher for women because of the absence of beards and mustaches.

Technovelgy readers may want to recall that, even in the surveillance classic 1984, the telescreen was always on, but whether or someone was watching was not clear.

There was of course no way of knowing whether you were being watched at any given moment. How often, or on what system, the Thought Police plugged in on any individual wire was guesswork.

With Visual Speech Recognition, thought, your conversation with others could be surveilled by machines even if people are not watching.

Via Technology Review and Visual Speech Recognition

Scroll down for more stories in the same category. (Story submitted 9/17/2014)

Follow this kind of news @Technovelgy.

| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |

Would you like to contribute a story tip? It's easy:
Get the URL of the story, and the related sf author, and add it here.

Comment/Join discussion ( 0 )

Related News Stories - (" Artificial Intelligence ")

Harmonia Making Generative Audio Tools For Everyone
'A cacophony of stentorious metal sounds.' - Ray Cummings, 1941.

Artists Replaced By Robots? Everyone's a Patron Now
'The results would not be happy; a schizoid painting was bound to ensue.' - FL Wallace, 1953

Mem, The All-Your-Memories, Super Note-Taking App
'Life experience is linearly additive, but the correlation of memory impressions is an unlimited expansion.' - Robert Heinlein, 1941.

Copilot Software AI Training Sued By Involuntary Contributors
'...we've promised him a generous pension from the royalties.' - Anthony Boucher, 1943.

 

Google
  Web TechNovelgy.com   

Technovelgy (that's tech-novel-gee!) is devoted to the creative science inventions and ideas of sf authors. Look for the Invention Category that interests you, the Glossary, the Invention Timeline, or see what's New.

 

 

 

 

Science Fiction Timeline
1600-1899
1900-1939
1940's   1950's
1960's   1970's
1980's   1990's
2000's   2010's

Current News

'Courier Commons' By Tomorrow Lab, From Karl Schroeder (and Bruce Sterling?)
'The pokkecon rang again. *The coffee’s for him?* Tsuyoshi said.'

Terrifying Robotic Apple Harvester
'... little machines, that went from plant to plant.'

Jetson-Style Clockwork Robot Nail Salon Coming To Target Near You
The Jetsons imagined so much future.

Mechanical Horse Sculpture Gallops In Place
'Rod placed the brain inside the panel... the horse raised its head, wiggled its ears, blinked twice, gave a tentative whinny.'

'Make Sunsets' Tweaks Climate By Atmospheric Alteration
'Pina2bo would have to operate full blast for many years to put as much SO2 into the stratosphere as its namesake had done in a few minutes.'

Eviation Alice Electric Plane First Flight
'A white electric plane approached at great speed...'

Hotels Turn To Robots As Human Workers Regroup
'Chain of hotels that specialized in non-human service.'

Changesite Mineral To Be Mined On Moon By China
'But then... not every bulldozer operator works on the Moon.'

Tongue-Controlled Tong Wearable Mouth Computer
'Griff found the white and pink map distracting and switched it off using his tongue mouse.'

Is It Better To Be Short?
'He was one of the smaller, energy-saving new breed...'

Taikonaut Tai Chi Foot Loops
'Jimmy Cardigan and Harlowe, staring through the darkside port, had their feet in the foot-loops...'

Space Billboards Would Ruin Our View Of The Cosmos
'But the rising sign, as it had been designed to do, held his eyes. A vast circle of scarlet stars came up into the greenish desert dusk.'

Orion's 'Skip-to-M'Lou' Entry
'A lightning pilot possibly could land that tin toy without power and still walk away from it provided he had the skill to play Skip-to-M’Lou in and out of the atmosphere...'

MarsCat and MetaCat, Your Robot Cat Companions
'It was you who betrayed me — you and your robot cat.'

Mars Mission Using Nuclear Thermal Propulsion
'with its atomic engine as noiseless as a dancing sunbeam...'

Physiotherapists Get Help From Robots
'Most of the Members went into cold-rest; the others tended them...'

More SF in the News Stories

More Beyond Technovelgy science news stories

Home | Glossary | Invention Timeline | Category | New | Contact Us | FAQ | Advertise |
Technovelgy.com - where science meets fiction™

Copyright© Technovelgy LLC; all rights reserved.