Enhancing User Sentiment Analysis in Video
Interviews: Leveraging NLP and Facial Emotion
Recognition for Comprehensive Analysis
Journal:
GRENZE International Journal of Engineering and Technology
Authors:
Nikhila M R, Sushma B S, Lakshmaiah L, Geetha M, Suresha S
Volume:
10
Issue:
2
Grenze ID:
01.GIJET.10.2.438
Pages:
5425-5431
Abstract
In today's digital age, video interviews are prevalent, enabling remote interactions and
necessitating user sentiment analysis for insights in recruitment, market research, and
psychology. This paper outlines an advanced approach using NLP and facial emotion recognition
for enhanced sentiment analysis in video interviews. It employs multimodal emotion detection
models: audio-based emotion prediction via CNN for tone recognition, CNN-based facial emotion
recognition (with optional media pipe face landmarks), and an LSTM network for speaker
emotion prediction from video clip audio and image sequences. Hyperparameter tuning ensures
optimal accuracy across datasets. OpenCV detects faces, extracting expressions for emotion
analysis, while speech-to-text analysis via the Google Cloud Speech-to-Text API aids in accurate
transcription. Text sentiment analysis with Scikit-learn's Naive Bayes classifier further enhances
understanding. This integrated method captures verbal and non-verbal cues, facilitating nuanced
sentiment analysis for informed decision-making in diverse domains.