Page 741 - AI for Good Innovate for Impact

P. 741

AI for Good Innovate for Impact

Use Case 3: From Silence to Speech: A Novel AI System for Speech

Development for the Deaf Through Lip-Reading 4.9: Accessibility

Organization: Huazhong University of Science and Technology

Country: China
Contact Person(s):

Ran Wang, rex _wang@ hust .edu .cn
Yang Xiao, Yang _Xiao@ hust .edu .cn

1 Use Case Summary Table

Item Details

Category Accessibility

Problem Addressed By leveraging Artificial Intelligence(AI) technologies to enable individuals
with hearing impairments to regain vocal communication capabilities, this
initiative aims to mitigate educational resource inequities stemming from
the scarcity of human and equipment resources.
Key Aspects of Solu- Develop an AI-powered lip-reading recognition system that converts
tion visual facial signals into textual language information through system
processing. This system, integrated with standardized video materials, will
serve as a self-learning guide for deaf individuals to study lip language.
The platform will be disseminated through online networks to ensure
enough educational resources.
Technology Deep Learning, Vision-language model
Keywords
Data Availability Public Data source of Lip Reading: CAS-VSR-W1k [1]

Metadata (Type of Videos
Data)

Model Training and Our vision-language model was trained and tested on public video data-
Fine-Tuning sets.

Testbeds or Pilot The AI algorithms are deployed on cloud-based servers, and the product
Deployments is delivered to deaf users through client-side applications.
Link: [2]

705

736 737 738 739 740 741 742 743 744 745 746