Page 222 - AI for Good-Innovate for Impact Final Report 2024

P. 222

AI for Good-Innovate for Impact

Use case – 52: Computer Network Fusion Video Brain

Country: China

Organization: China Mobile Communications Corporation Co., Ltd

Contact person: Zhanmei, Zhang; 13802881237@139.com

52�1� Use case Summary Table

Domain Industry, Innovation, and Infrastructure
The Problem to be addressed • Massive video data is mainly monitored through manual
viewing, which requires a lot of human resources and
cannot monitor video content in real time.
• The analysis of massive video data in the central node
consumes a lot of network bandwidth resources.
• For intelligent video analysis algorithm training, the lack
of sample data makes it difficult to improve the accuracy.
• Video analysis only through the traditional target detec-
tion small model is prone to produce a large number of
false positives, requiring a lot of manpower to audit.

Key aspects of the solution • In order to meet the business expansion needs of
massive video access, large models combined with small
models will be introduced to complete video analysis:
1. Based on the combination of high reference quantity
and strong feature capture ability of large model and
high flexibility of small model, it can effectively realize
efficient analysis of video.
2. In order to use large models as feature extractors,
perform preliminary analysis of videos, infer image
events and behaviors, and extract useful feature informa-
tion from videos, such as color, texture, shape, motion,
etc. The feature information extracted from the large
model is further analyzed and predicted by using the
small target detection model

Technology keywords Cloud edge collaboration; Large and small model collabo-
ration

Data availability https:// huggingface .co/ datasets; https:// public .roboflow
.com/

Metadata (type of data) Structured and unstructured data

206

217 218 219 220 221 222 223 224 225 226 227