AAP Recommendation

P.565: Framework for creation and performance testing of machine learning based models for the assessment of transmission network impact on speech quality for mobile packet-switched voice services

Study Group
12

Study Period
2017-2020

Consent Date
2019-12-05

Approval Date
2020-01-13

Provisional Name
P.VSQMTF

Input used for Consent
TD 1020R2-GEN
TD 1035-GEN (A5 TD)

Status
A

IPR
Site

The output of the framework is a machine learning based speech quality prediction model, which predicts the impact on the speech quality from the IP transport and underlying transport, as well as the jitter buffer in the end client; thus providing a network centric view on the speech quality service delivered on mobile packet switched networks. This is expressed in terms of a MOS-LQO under the assumption of an otherwise clean transmission, without background noise, automatic gain control, voice enhancement devices, transcoding, bridging, frequency response, clock drift or any other impairment not caused by the IP transport and underlying transport. The models according to this framework use information on the temporal structure of the reference signal to identify the importance of individual sections of the bitstream with regard to speech quality. These models do not perform any perceptual analysis of the recorded speech signal. The framework specifies three modules required for the development of these kinds of metrics: the databases generator module, the machine learning module, and the validation module for the trained model. In addition, database content and the features used by the machine learning algorithm are described. The framework also provides a large set of test vectors, in the form of error (jitter and packet loss) patterns files for learning and validation. The recommendation specifies minimum required performance, as well as conditions and requirements for an independent additional validation for models developed based on the framework. The recommendation also specifies implementation requirements. The models developed based on the framework enable the assessment of transmission network impact on speech quality for mobile packet-switched voice services, and therefore benefit operators and regulators alike with a fast and easy speech quality trend monitoring / benchmarking and troubleshooting. In addition, if predictors according to this framework are used together with perceptual speech quality metrics like P.863, it is possible to identify if the source of problems resides inside or outside the transport network observed by the predictor according to this framework and thus a more detailed analysis of the situation can be achieved and consequently troubleshooting of less obvious degradations such as the ones occurring outside of the transport network (e.g. emerged from automatic gain control, voice enhancement devices, transcoding or analog processing) is enabled.

For the purpose of the AAP Last Call the electronic attachment to this Recommendation is located at https://www.itu.int/ifa/t/2017/sg12/exchange/plenary/AAP/P.565-attachment.zip

AAP Current Status
Step # Action
Start / End
Status Announcement Related documents Comments / Resolution logs