|
Work item:
|
F.748.61 (ex F.AICP-IO)
|
|
Subject/title:
|
General technical requirements and framework for artificial intelligence cloud platform - inference optimization
|
|
Status:
|
Consented on 2025-10-17
|
|
Approval process:
|
AAP
|
|
Type of work item:
|
Recommendation
|
|
Version:
|
New
|
|
Equivalent number:
|
-
|
|
Timing:
|
2025-10 (Medium priority)
|
|
Liaison:
|
ITU-T SG13, SG17
|
|
Supporting members:
|
Beijing Baidu Netcom Science Technology Co., Ltd., China Mobile Communications Co. Ltd., China Telecommunications Corporation, China Unicom, Huawei Technologies Co., Ltd., Alibaba China Co. Ltd., Hangzhou Hikvision Digital Technology Co.,Ltd., Tencent Technology (Shenzhen) Company Limited
|
|
Summary:
|
With the explosion of foundation models, the importance of inference optimization is receiving increasing attention, as it enables foundation models to be deployed in scenarios such as cloud, edge, and endpoint, providing extensive services efficiently, and improving the efficiency of multimedia application development significantly. Therefore, the technical requirements for inference optimization are crucial. This Recommendation provides general technical requirements includes data processing, model processing, model deployment and model inference. This Recommendation is to specify framework and requirements that meets the needs of foundation model inference optimization and promotes the in-depth application and development of foundation models in various industries.
|
|
Comment:
|
-
|
|
Reference(s):
|
|
|
Historic references:
|
|
Contact(s):
|
|
| ITU-T A.5 justification(s): |
|
|
|
|
First registration in the WP:
2025-02-25 14:37:36
|
|
Last update:
2025-11-06 15:10:28
|