Page 382 - AI for Good Innovate for Impact
P. 382

AI for Good Innovate for Impact



                          Use Case 19: Bandwidth efficient live interaction with virtual 3D

                      demonstrator using semantic communication and GenAI











                      Organization: Tata Consultancy Services

                      Country: India

                      Contact Person(s): 
                           Abhijan Bhattacharyya, Abhijan.bhattacharyya@ tcs .com
                           Ashis Sau, ashis.sau@ tcs .com
                           Suraj Mahato, surajkumar.mahato@ tcs .com


                      1      Use Case Summary Table

                       Item              Details

                       Category          5G
                       Problem
                       Addressed         Clearly describe the primary issue or challenge
                                         1)  3D remote education experience with very low bandwidth: It enables train-
                                            ees or students to interact with a live 3D virtual representation of a distant
                                            teacher/ demonstrator in real-time with a very low-bandwidth consumption.
                                            The bandwidth savings is very significant compared with 3D and even
                                            with 2D visual transmission which is in the order of several Gbps and
                                            Mbps respectively.
                                         2)  Democratized scalable solution with no need for specialized infra-
                                            structure: Unlike holographic 3D video transfer, it does not require
                                            any specialized sensor, camera and studio set up at both end points.
                                            The teacher/ demonstrator only needs a RGB camera attached to a
                       Key Aspects of       computer. This solution can be in generalized as cost-effective, scalable
                       Solution             3D telepresence with acceptable realism.
                                         3)  AI-native Semantic Communication with optional GenAI integration: It
                                            uses semantic communication to predict the body pose and position of
                                            the distant teacher / demonstrator in real-time using AI and transmits
                                            the predicted information to the trainee / student-end for mimicking a
                                            pre-stored 3D body model. It uses artificial intelligence to predict the
                                            body posture of the distant teacher in real-17th European Conference
                                            on Technology Enhanced Learningtime, and the figure is displayed
                                            in situ using augmented reality. GenAI may be used to virtually recre-
                                            ate some of the objects associated with the teacher, at the trainee/
                                            student’s end.

                       Technology        Semantic Communication, human pose estimation, natural language
                       Keywords          processing, augmented reality.

                                         For body modelling we used the datasets used in [2]. For motion prediction
                       Data Availability
                                         we used [3].




                  346
   377   378   379   380   381   382   383   384   385   386   387