Robust Audio-Visual Speech Recognition Using Noise-Resilient Attention and Dynamic Feature Fusion

Likely AI

www.researchsquare.comSubmitted April 30, 2026

Speech recognition systems are significantly degraded in noisy environments, where audio-only methods struggle with background noise and overlapping speech. Audio-visual speech recognition (AVSR) leverages the strengths of audio and visual modalities to enhance performance. This paper presents...

The Verdict

ClassificationLikely AI

ConfidenceMedium confidence

Community Verdict

Be the first to vote on this assessment.

Embed Badge

Add this badge to your site to show the AI classification for this content.

[![Real Press](https://real.press/api/badge/2c78d8b7-bbbf-4271-a245-4c58f5b183f1)](https://real.press/content/2c78d8b7-bbbf-4271-a245-4c58f5b183f1)