Real-Time Audio-Visual ... |
Real-Time Audio-Visual Automatic Speech Recognition DemonstratorLeader: Alexanderos Potamianos, TSI-TUC
Objective of the showcase project: One of the most promising approaches to improve the performance and extend the applicability of Automatic Speech Recognition (ASR) systems is to integrate visual information into the recognition process. Towards practically deployable AV-ASR, we build a proof-of-concept laptop-based AV-ASR prototype which: (i) uses consumer microphone and camera to capture the speaker; (ii) performs visual/audio feature extraction, as well as speech recognition on the laptop in real-time; (iii) is robust to failures of a single modality, such as visual occlusion of the speaker's face; and (iv) automatically adapts to changing acoustic noise levels. Ongoing Work: Back-End, Fusion Module, Graphical User Interface, Integration Video of the Real-Time Audio-Visual Automatic Speech Recognition Demonstrator Showcase |