Page 222 - AI for Good-Innovate for Impact Final Report 2024
P. 222

AI for Good-Innovate for Impact



                      Use case – 52: Computer Network Fusion Video Brain









                      Country: China

                      Organization: China Mobile Communications Corporation Co., Ltd

                      Contact person: Zhanmei, Zhang;  13802881237@139.com


                      52�1� Use case Summary Table


                       Domain                       Industry, Innovation, and Infrastructure
                       The Problem to be addressed  •  Massive video data is mainly monitored through manual
                                                       viewing, which requires a lot of human resources and
                                                       cannot monitor video content in real time.
                                                    •  The analysis of massive video data in the central node
                                                       consumes a lot of network bandwidth resources.
                                                    •  For intelligent video analysis algorithm training, the lack
                                                       of sample data makes it difficult to improve the accuracy.
                                                    •  Video analysis only through the traditional target detec-
                                                       tion small model is prone to produce a large number of
                                                       false positives, requiring a lot of manpower to audit.

                       Key aspects of the solution  •  In order to meet the business expansion needs of
                                                       massive video access, large models combined with small
                                                       models will be introduced to complete video analysis:
                                                    1.  Based on the combination of high reference quantity
                                                       and strong feature capture ability of large model and
                                                       high flexibility of small model, it can effectively realize
                                                       efficient analysis of video.
                                                    2.  In order to use large models as feature extractors,
                                                       perform preliminary analysis of videos, infer image
                                                       events and behaviors, and extract useful feature informa-
                                                       tion from videos, such as color, texture, shape, motion,
                                                       etc. The feature information extracted from the large
                                                       model is further analyzed and predicted by using the
                                                       small target detection model

                       Technology keywords          Cloud edge collaboration; Large and small model collabo-
                                                    ration

                       Data availability            https:// huggingface .co/ datasets; https:// public .roboflow
                                                    .com/

                       Metadata (type of data)      Structured and unstructured data














                  206
   217   218   219   220   221   222   223   224   225   226   227