Lip Reading Detection using Deep Learning Model

Journal: GRENZE International Journal of Engineering and Technology
Authors: J.Manoranjini, Naga Lalith Ryakam, Chinmay Ram Rangisetty
Volume: 10 Issue: 2
Grenze ID: 01.GIJET.10.2.304 Pages: 4774-4779

Abstract

Lip reading is the practice of interpreting spoken language through visual lip movements, has brought a lot of attention because of its possible uses in noisy environments, enhancing accessibility, and facilitating multimodal communication. This advanced lip reading system employs a combination of bidirectional LSTM networks, 3D convolutional layers (Conv3D), and the softmax activation function. The system's primary objective is to leverage both the temporal dynamics of lip movements and the spatial characteristics of lip shapes to achieve precise transcription of spoken language. This process significantly enhances the model's ability to identify intricate lip shapes and the dynamic changes that occur during speech. Because it takes a comprehensive approach, the system has the potential to enhance multimodal communication, increase accessibility for people with hearing impairments, and improve communication in noisy environments by accurately transcribing visual lip cues into spoken language.

Download Now << BACK

GIJET