 |
Science Fiction
Dictionary
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
|
 |
Visual Speech Recognition - When Will HAL Read Lips For Real?
Visual Speech Recognition, also known as automated lip reading, is a field with a special meaning for science fiction fans. In the film 2001:A Space Odyssey, the HAL 9000 computer was able to read lips.

(HAL 9000 [background] eavesdrops on astronauts Poole and Bowman)
In the film, HAL's increasingly erratic behavior becomes a matter of concern for the astronauts. Since HAL can effectively monitor every part of the ship, the astronauts retire to a small pod to discuss the matter. Unfortunately, it turns out that somebody did research on computer lip-reading, and so HAL was on to them, with very unfortunate results for Poole.
In a recent paper, Ahmad Hassanat at Mu’tah University in Jordan provides a review of existing approaches, and suggestions for moving forward with VSR. He also outlines some of the challenges in actually creating a computer able to read lips, like the fictional HAL 9000.

(From Visual Speech Recognition chart)
The fundamental process of lip reading is to recognize a sequence of shapes formed by the mouth and then match it to a specific word or sequence of words.
There is a significant challenge here. During speech, the mouth forms between 10 and 14 different shapes, known as visemes. By contrast, speech contains around 50 individual sounds known as phonemes. So a single viseme can represent several different phonemes.
And therein lies the problem. A sequence of visemes cannot usually be associated with a unique word or sequence of words. Instead, a sequence of visemes can have several different solutions.
The first problem for automated lip reading is face and lip recognition. This has improved in leaps and bounds in recent years. A more difficult challenge is in recognizing, extracting and categorizing the geometric features of the lips during speech.
This is done by measuring the height and width of the lips as well as other features such as the shape of the ellipse bounding the lips, the amount of teeth on view and the redness of the image, which determines the amount of tongue that is visible.
Determining the exact contour of the lips is hard because of the relatively small difference between pixels showing face and lips.
Another problem is that some people are more expressive with their lips than others so it easier to interpret what they are saying from lip movements alone. Indeed, some people hardly move their lips at all and these so-called “visual-speechless persons” are almost impossible to interpret.
Hassanat’s own visual speech recognition system is remarkably good. His experiments achieve an average success rate of 76 percent, albeit in carefully controlled conditions. The success rate is even higher for women because of the absence of beards and mustaches.
Technovelgy readers may want to recall that, even in the surveillance classic 1984, the telescreen was always on, but whether or someone was watching was not clear.
There was of course no way of knowing whether you were being watched at any given moment. How often, or on what system, the Thought Police plugged in on any individual wire was guesswork.
With Visual Speech Recognition, thought, your conversation with others could be surveilled by machines even if people are not watching.
Via Technology Review and Visual Speech Recognition
Scroll down for more stories in the same category. (Story submitted 9/17/2014)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
Would
you like to contribute a story tip?
It's easy:
Get the URL of the story, and the related sf author, and add
it here.
Comment/Join discussion ( 0 )
Related News Stories -
("
Artificial Intelligence
")
LLM 'Cognitive Core' Now Evolving
'Their only check on the growth and development of Vulcan 3 lay in two clues: the amount of rock thrown up to the surface... and the amount of the raw materials and tools and parts which the computer requested.' - Philip K. Dick, 1960.
When Your Child's Best Friend Is An AI
'Figments of his mind in one sense, of course, for he had shaped them...' - Clifford Simak, 1963.
Australian Authors Reject AI Training Of Llama
'It's done with a flip of the third joint of the tentacle on the down beat.' - Anthony Boucher, 1943.
Does AI Provide A Way Forward For Talk Therapy
'And there in the next room by the sofa sat a familiar suitcase, that of his psychiatrist Dr. Smile.' - Philip K. Dick, 1965.
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
|
 |
Science Fiction
Timeline
1600-1899
1900-1939
1940's 1950's
1960's 1970's
1980's 1990's
2000's 2010's
Current News
LLM 'Cognitive Core' Now Evolving
'Their only check on the growth and development of Vulcan 3 lay in two clues: the amount of rock thrown up to the surface... and the amount of the raw materials and tools and parts which the computer requested.'
Has Elon Musk Given Up On Mars?
'There ain't no such thing as a free lunch.'
Bacteria Turns Plastic Into Pain Relief? That Gives Me An Idea.
'I guess there's nobody round this table who doesn't have a Crosswell [tapeworm] working for him in the small intestine.'
When Your Child's Best Friend Is An AI
'Figments of his mind in one sense, of course, for he had shaped them...'
China's Drone Mothership Can Carry 100 Drones
'So the parent drone carries a spotter that it launches...'
Drones Recharge In Mid-Air Like Jets Refuel!
'...nurse drones that would cruise around dumping large amounts of power into randomly selected pods.'
Australian Authors Reject AI Training Of Llama
'It's done with a flip of the third joint of the tentacle on the down beat.'
Is China Mining Helium-3 On The Moon's Farside?
'...for months Grantline bores had dug into the cliff.'
Maybe It's Too Soon To Require Autonomous Mode
'I hope all those other cars are on automatic,' he said anxiously.
Is Agentic AI The Wrong Kind Of Smartness?
'It’s smart enough to go wrong in very complicated ways, but not smart enough to help us find out what’s wrong.'
Heat Waver - The First Ever Combo Solar Collector And Wind Turbine
'...like a spray of tulips mounted fanwise.'
Tesla 'Fleet Response Agents' Bolster FSD Autonomy
'You hate the whole idea that some bored drone pusher in a remote driving centre has got your life... in his hands.'
Mori3 Autonomous Shapeshifting Robot
'My homeland is being threatened by the Replicators. Thus far all attempts to stop them have failed.'
Tesla Seeks 'Tesla Robotaxi' And 'Robobus' Trademarks Ignoring Prior Art
'A robobus had just rolled up to the curb.'
Scary Grid Safety Robots
'The ultimate horror for our paranoid culture...'
Does AI Provide A Way Forward For Talk Therapy
'And there in the next room by the sofa sat a familiar suitcase, that of his psychiatrist Dr. Smile.'
More SF in the News Stories
More Beyond Technovelgy science news stories
|
 |