Page 741 - AI for Good Innovate for Impact
P. 741

AI for Good Innovate for Impact



                   Use Case 3: From Silence to Speech: A Novel AI System for Speech

               Development for the Deaf Through Lip-Reading                                                         4.9: Accessibility














               Organization: Huazhong University of Science and Technology

               Country: China
               Contact Person(s):

                    Ran Wang, rex _wang@ hust .edu .cn
                    Yang Xiao, Yang _Xiao@ hust .edu .cn


               1      Use Case Summary Table

                Item               Details

                Category           Accessibility

                Problem Addressed By leveraging Artificial Intelligence(AI) technologies to enable individuals
                                   with hearing impairments to regain vocal communication capabilities, this
                                   initiative aims to mitigate educational resource inequities stemming from
                                   the scarcity of human and equipment resources.
                Key Aspects of Solu- Develop an AI-powered lip-reading recognition system that converts
                tion               visual facial signals into textual language information through system
                                   processing. This system, integrated with standardized video materials, will
                                   serve as a self-learning guide for deaf individuals to study lip language.
                                   The platform will be disseminated through online networks to ensure
                                   enough educational resources.
                Technology         Deep Learning, Vision-language model
                Keywords
                Data Availability  Public Data source of Lip Reading: CAS-VSR-W1k [1]

                Metadata (Type of  Videos
                Data)

                Model Training and  Our vision-language model was trained and tested on public video data-
                Fine-Tuning        sets.

                Testbeds or Pilot  The AI algorithms are deployed on cloud-based servers, and the product
                Deployments        is delivered to deaf users through client-side applications.
                                   Link: [2]












                                                                                                    705
   736   737   738   739   740   741   742   743   744   745   746