ITU-T Study Group 16 - Question 10/16 (Study Period 2009-2012)

عربي | 中文 | Español | Français | Русский

Advanced Search

Home : ITU-T Home : Study Groups : Study Group 16

Question 10/16 – Speech and audio coding and related software tools

(Continuation of Question 10/16 and part of Question 23/16)

Motivation

The goal of this Question is to produce speech, audio and sound coding Recommendations for conversational (e.g. telephony, audioconferencing, videoconferencing and video telephony) and non-conversational (e.g., multimedia streaming, broadcast TV, IPTV, file download, media storage/playback, or digital cinema) audio/visual services. The speech and audio coding scope includes:

Narrowband (or telephony band) speech & audio coding (300-3400 Hz)
Wideband speech and audio coding (50-7000 Hz)
Superwideband speech and audio coding (50-14000 Hz)
Fullband speech and audio coding (20-20000 Hz)
Mono-to-multichannel coding

These Recommendations will be either new Recommendations or extensions of existing ITU-T speech and audio coding Recommendations, for example using advanced techniques to significantly improve the trade-offs between bit rate, quality, delay, and algorithm complexity. This Question will also be responsible for the maintenance of the existing ITU-T speech and audio coding Recommendations.

The standards developed by this Question will have sufficient flexibility to accommodate transport in a wide range of applications over a variety of transport technologies, including telephony and audio-visual services over NGN/IMS, mobile radio access networks, public and private WANs and LANs. Other applications include circuit multiplication equipment and simultaneous voice and data services.

Additionally, the question will continue the development of the G.191 software tools library (STL). G.191 provides a common set of tools for use in ITU-T standardization activities on speech and audio coding, including a library of portable, inter-workable and reliable software routines. It has been substantially improved over successive releases, and requirements for further extensions and tools have already been identified to process wider audio bandwidth signals.

Study items

Study items to be considered include, but are not limited to:

Speech and audio coding algorithms to extend existing ITU-T speech and audio coding recommendations or to create new ones in order to achieve the following objectives:
- enhancements in quality at a given audio bandwidth (including pre- and post-processing functions such as noise suppression techniques)
- enhancements in quality obtained by increasing the audio bandwidth and/or the number of channels
- improvements in compression efficiency and flexibility as provided by scalability in bandwidth and bit rate or by discontinuous transmission and comfort noise generation algorithms
- robust operation (e.g. with packet loss concealment methods) in error/loss-prone environments such as non-guaranteed-bandwidth packet networks or mobile wireless communication
- reduction of real-time delay with the purpose of reducing quality degradation due to end to end latency in conversational applications
- reduction of complexity
- lossless data compression for existing ITU-T speech and audio coding Recommendations
Maintenance of existing ITU-T speech and audio coding standards and of ITU-T software tool library through collection of defect reports, assessment on their merit, and identification of the appropriate course of action
Extensions to ITU-T software tool library for signal processing standardization activities
Compressed data formats to support packetization and streaming
Development of supplemental enhancement information to accompany speech and audio data for enabling enhanced functionality in application environments (e.g. metadata, spatialisation information)
Methods to allow streams to be easily mixed by MCUs or terminals
Techniques to permit networks or terminals to adjust the bit rate of speech and audio streams efficiently (e.g. scalability feature)
Techniques for efficient compressed-digital to compressed-digital processing (including transcoding)
The impact of quality control requirements on speech and audio codec development
Security aspects that directly affect speech and audio coding including watermarking techniques
Parameter extraction from audio, in support of applications such as speech recognition, speaker verification, biometrics applications, etc.
Study and specification of data for speech/audio annotation, indexing, and searching
Considerations on how to help measure and mitigate climate changes

Tasks

Tasks include, but are not limited to:

Extensions of existing G-series speech and audio coding Recommendations, including G.711, G.711.1, G.718 (in coordination with Q9/16), G.719, G.722, G.722.1, G.722.2, G.726, G.727, G.728, G.723.1, G.729, G.729.1
Maintenance of existing G-series regarding speech / audio coding and signal processing Recommendations including G.191, G.192, G.711, G.726, G.727, G.728, G.723.1, G.729, G.722, G.722.1, G.722.2, G.729.1
Development of new speech and audio coding recommendations
Upgrade of ITU-T G.191 Software Tool Library to support to ITU-T signal processing activities, e.g.:
- superwideband and fullband audio processing
- channel models, error patterns and statistics for packet-based networks (including IP and Internet), wireless networks and mobile-satellite systems
- identify techniques for verification of the correct implementation of algorithms

An up-to-date status of work under this Question is found in the SG 16 work programme (http://itu.int/ITU-T/workprog/wp_search.aspx?isn_sg=554).

Relationships

Recommendations:

G.718 (interoperable mode extensions planned in 2009-2012)

G.16X-series speech enhancement Recommendations

G.76X-series circuit multiplication Recommendations

G.799.X-series voice over IP gateway Recommendations

H.300-series system Recommendations

P.800-series

Questions:

1, 2, 4, 6, 7, 8, 9, 14, 15, 16, 18, 22, 26, 28/16

Study Groups:

ITU-T SG 9 on speech and audio coding aspects of digital cable systems and IPTV

ITU-T SG 12 for speech and audio coding quality performance assessment and software tools matters

ITU-R SG 6 on terrestrial and satellite broadcasting services

Other Bodies:

3GPP and 3GPP2

ETSI DECT and TISPAN

IETF for speech and audio packetization issues

IMTC

IP/MPLS Forum

ISO/MPEG

TIA