Abstract
Speech recognition systems are crucial for enabling human-computer interactions, particularly for the hearing- impaired, who use them to communicate with the listening community. However, conventional speech recognition techniques faced new challenges during the Coronavirus outbreak, as protective masks obscured lip movements. To address this, a new Radio Frequency (RF) based lip-reading framework has been proposed that can read lips under face masks. This framework uses radar technology to enable RF-sensing-based lipreading, presenting the Doppler changes in the received signal due to lip movements in the form of time-frequency diagrams.To classify phonetic characters, the system uses a multi-layered deep learning neural network called MobileNet-v2, which has been optimized to achieve high accuracy on a validation dataset while avoiding overfitting the training data. The optimization process involved conducting thorough research on 12 versions of the network, each with its unique hyperparameter combination, and evaluating their performance on the validation dataset to identify the version that showed the most promising results in generalizing to new data. The selected MobileNet-v2 version achieved an impressive classification performance of 99.16% accuracy when applied to classify the considered phonetic characters.The system’s performance and testing metrics include the confusion matrix, accuracy calculation, and the ROC Curve graphic. Overall, this RF-based lip-reading framework has significant potential in enhancing speech recognition systems’ performance, particularly during situations where conventional techniques face limitations, such as mask-wearing during pandemics.
Original language | English |
---|---|
Title of host publication | 2023 IEEE International Radar Conference (RADAR) |
Publisher | IEEE |
Number of pages | 6 |
ISBN (Electronic) | 9781665482783 |
ISBN (Print) | 9781665482790 |
DOIs | |
Publication status | Published - 28 Dec 2023 |
Externally published | Yes |
Event | 2023 IEEE International Radar Conference - Sydney, Australia Duration: 6 Nov 2023 → 10 Nov 2023 https://www.radar2023.org/ (Link to conference website) |
Publication series
Name | |
---|---|
ISSN (Print) | None |
Conference
Conference | 2023 IEEE International Radar Conference |
---|---|
Abbreviated title | RADAR 2023 |
Country/Territory | Australia |
City | Sydney |
Period | 6/11/23 → 10/11/23 |
Internet address |
|
Keywords
- Radio frequency
- Time-frequency analysis
- Lips
- Face recognition
- Radar detection
- Training data
- Speech recognition
- COVID-19
- deep learning
- RF sensing
- radar waves
ASJC Scopus subject areas
- Signal Processing
- Instrumentation
- Computer Networks and Communications