AAP Recommendation

F.748.20: Technical framework for deep neural network model partition and collaborative execution

Study Group
16

Study Period
2022-2024

Consent Date
2022-10-28

Approval Date
2022-12-14

Provisional Name
F.AI-DMPC

Input used for Consent
SG16-TD88-R1/PLEN (2022-10)

Status
A

IPR
Site

Deep neural network (DNN) model inference process usually requires a large amount of computing resources and memory. Therefore, it is difficult for end devices to perform DNN models independently. It is an effective way to implement end-edge collaborative DNN execution through DNN model partition, which can reduce latency and improve resource utilization at the same time. This recommendation aims to specify the technical framework of DNN model partition and collaborative execution. First, it is necessary to predict the overall inference latency under the current system state according to different DNN partition strategies in advance. Then, choose the appropriate partition locations and collaborative execution strategy based on the equipment computation capabilities, network status and DNN model properties. Finally, implement the model collaborative execution and optimize the resource allocation in the meanwhile.

AAP Current Status
Step # Action
Start / End
Status Announcement Related documents Comments / Resolution logs