Description:
|
1 Motivation
Terminal and network equipment increasingly includes complex signal processing techniques, including artificial intelligence (AI). In addition, super-wideband and fullband systems are more and more established in the market. Most devices cannot be regarded as linear, time-invariant systems anymore. The subjectively relevant transmission characteristics of such equipment need to be correctly determined using adequate measurement methods. There is a need of having reproducible and well-defined measurement methods, which can be used for certification labs as well as for developers, and which ideally should be combined to one quality value.
Test signals and analysis techniques for use in telephonometry have been continuously collected in previous study periods. New test signals allow evaluating many different parameters more realistically and are no longer limited to narrowband and wideband signals. However, there is still lack of analysis methods for mixed content such as speech and music. Modern speech codecs allow the transmission of signals of any kind. Existing test methods and to some extent also signals need to be adapted, since they may no longer be appropriate for latest signal processing methods. In addition, the interaction of consecutive signal processing blocks in an end-to-end connection that may degrade overall quality need to be investigated more in detail.
The evaluation methodologies for speech and audio processing are still incomplete and need further improvement. New technologies in hands-free, conference systems, and speech processing require the adaptation of existing testing methodologies and the study of new procedures. There is a need to produce new or update existing product-/application-oriented Recommendations for future hands-free communication terminals that support e.g., conferencing, audio-visual aspects, or immersive audio services.
The following major deliverables, in force at the time of approval of this Question, fall under its responsibility: P.50, P.59, P.300, P.310, P.311, P.313, P.330, P.340, P.341, P.342, P.381, P.382, P.383, P.501, P.502, P.505.
2 Question
The following items are to be considered within the study of the Question, special consideration should be given to super-wideband/fullband systems, signal processing of circuit-switched and packet-based terminals, and maintenance of existing Recommendations:
- What kind of new complex signal processing used in terminals, systems and networks may influence speech and audio transmission quality and what objective testing methodology can be used?
- What kind of techniques can be used to simulate time-variant use and time-variant behaviour of telecommunication equipment?
- What additional type of test signals and testing techniques are needed for wideband, super-wideband and fullband transmission systems?
- Which type of test signals and analysis procedures can be used for spatial audio?
- Which type of test signals can be used for AI-based signal processing in terminals?
- Which test signals other than speech and noise are needed and how can they be defined?
- Which test signals can be used for the simulation of noisy environments?
- What methods are suitable for the objective assessment of background noise transmission and to what extent can the background noise transmission be assessed without the knowledge of the near-end background noise signal?
- What testing methods/signals can be used to optimize background noise transmission in combination with VAD and comfort noise insertion techniques?
- What testing methods are needed for speech and audio enhancement devices and what are the limits for the different quality determining parameters identified?
- What are the consequences on the speech quality of speech processing implemented in hands-free terminals and new types of conferencing devices for, e.g., Smart Home? What characteristics and limits can apply?
- What characteristics and limits can apply to other speech processing techniques such as speech recognition systems?
- What are the implications of the interaction between terminal signal processing and network signal processing on speech quality?
- How can existing and/or new speech quality parameters be combined to a single speech quality representation covering all conversational aspects?
3 Tasks
Tasks include, but are not limited to:
- improve/adapt existing test signals and objective speech quality testing methodologies;
- identify and study new basic objective testing methodologies in telecommunications;
- identify and study new basic objective testing methodologies for audio;
- identify and study new basic objective testing methodologies for spatial audio;
- identify and study new testing methodologies for real-time signal processing techniques used e.g., in ICC (in-car communication);
- identify and study new testing methodologies for background noise transmission quality;
- identify and study the impact of time-variant user behaviour and time-variant signal processing by defining new test methods and setups;
- improve testing methods for speech enhancement devices;
- add new and improve existing testing methodologies for modern hands-free and conference terminals;
- study applications to multichannel sound pick-up (microphone arrays) and multichannel/multi-device sound reproduction (including stereo and immersive audio).
- maintenance of Recommendations: P.50, P.59, P.300, P.310, P.311, P.313, P.330, P.340, P.341, P.342, P.381, P.382, P.383, P.501, P.502, P.505.
An up-to-date status of work under this Question is contained in the SG12 work programme at https://itu.int/ITU-T/workprog/wp_search.aspx?sp=18&q=6/12.
4 Relationships
Recommendations:
- P.57, P.58, P.64, P.79, G.161, G.168, G.169, P.1100, P.1110, P.1120, P.1130, P.1140, P.1150, P.370, P.380, P.570, P.581, P.700
Questions:
- 4/12, 5/12, 7/12, 9/12, 10/12
Study groups:
- ITU-T SG21
Other bodies:
- ETSI TC STQ, 3GPP SA4, TIA, IEEE, IEC
WSIS Action Lines:
- C2
Sustainable Development Goals:
- 9
|