LipNet: lip-reading AI uses machine learning

Andrea James 4:00 am Mon Nov 28, 2016

Lip-reading algorithms have all sorts of real-world applications, and LipNet shows great promise in machine-learning lipreading of constructed sentences from the GRID sentence corpus.

From the paper LipNet: sentence-level lipreading

Lipreading is the task of decoding text from the movement of a speaker's mouth. Traditional approaches separated the problem into two stages: designing or learning visual features, and prediction. More recent deep lipreading approaches are end-to-end trainable (Wand et al., 2016; Chung & Zisserman, 2016a). All existing works, however, perform only word classification, not sentence-level sequence prediction. Studies have shown that human lipreading performance increases for longer words (Easton & Basala, 1982), indicating the importance of features capturing temporal context in an ambiguous communication channel. Motivated by this observation, we present LipNet, a model that maps a variable-length sequence of video frames to text, making use of spatiotemporal convolutions, an LSTM recurrent network, and the connectionist temporal classification loss, trained entirely end-to-end. To the best of our knowledge, LipNet is the first lipreading model to operate at sentence-level using a single end-to-end speaker-independent deep model to simultaneously learn spatiotemporal visual features and a sequence model. On the GRID corpus, LipNet achieves 93.4% accuracy, outperforming experienced human lipreaders and the previous 79.6% state-of-the-art accuracy.

• LipNet: How easy do you think lipreading is? (YouTube / Yannis Assael)

Elon Musk's cars can't drive themselves, but they sure can drive lawsuits

Tesla's "Full Self-Driving" turns out to be full of something, but it ain't self-driving. After years of promises that every car since 2019 was "autonomy-ready," the company now admits that… READ THE REST
When your billion-dollar condo's skin starts falling off

Meet 432 Park, a troubled super-tall tower in New York that is cracking apart and preparing to drop "concrete hand grenades" on passersby. New York's most infamous luxury condo is… READ THE REST
Warner Bros. Discovery is up for sale. Anyone got $44bn?

For every distressed but influential legacy media company, there is surely a right-wing billionaire desperate to buy power and reach. Warner Bros. Discovery, the money-losing $40bn giant that brings us… READ THE REST
Business owner? You're at risk. Protect your data for 65% off

TL;DR: Get seamless web hosting with a 3-year subscription to the IONOS Web Hosting Plus Plan for just $99.99 (Reg. $288.00). They're calling the job hunt a humiliation ritual. And despite the faux-optimistic reports… READ THE REST
Bring your software into this decade with Microsoft Office 2021

TL;DR: Microsoft Office Professional 2021, the full suite of software, is only $49.97 (MSRP $219.99). Don't spend a minute longer with sluggish software from last decade. If you've got to get… READ THE REST
Don't rely on the cloud! Secure your data for less than a cloud subscription.

TL;DR: With reliable storage and lightning-fast data transfers, get the Western Digital Elements Portable USB External Hard Drive for just $79.99 (Reg. $64.99). Here's a scary newsflash: you don't own your data. We've… READ THE REST