(Continuation of Question 10/16 and part of Question 23/16) Motivation
The goal of this Question is to produce speech, audio and sound coding
Recommendations for conversational (e.g. telephony, audioconferencing,
videoconferencing and video telephony) and non-conversational (e.g., multimedia
streaming, broadcast TV, IPTV, file download, media storage/playback, or digital
cinema) audio/visual services. The speech and audio coding scope includes:
- Narrowband (or telephony band) speech & audio coding (300-3400 Hz)
- Wideband speech and audio coding (50-7000 Hz)
- Superwideband speech and audio coding (50-14000 Hz)
- Fullband speech and audio coding (20-20000 Hz)
- Mono-to-multichannel coding
These Recommendations will be either new Recommendations or extensions of
existing ITU-T speech and audio coding Recommendations, for example using
advanced techniques to significantly improve the trade-offs between bit rate,
quality, delay, and algorithm complexity. This Question will also be responsible
for the maintenance of the existing ITU-T speech and audio coding
Recommendations.
The standards developed by this Question will have sufficient flexibility to
accommodate transport in a wide range of applications over a variety of
transport technologies, including telephony and audio-visual services over NGN/IMS,
mobile radio access networks, public and private WANs and LANs. Other
applications include circuit multiplication equipment and simultaneous voice and
data services.
Additionally, the question will continue the development of the G.191
software tools library (STL). G.191 provides a common set of tools for use in
ITU-T standardization activities on speech and audio coding, including a library
of portable, inter-workable and reliable software routines. It has been
substantially improved over successive releases, and requirements for further
extensions and tools have already been identified to process wider audio
bandwidth signals.
Study items
Study items to be considered include, but are not limited to:
- Speech and audio coding algorithms to extend existing ITU-T speech and
audio coding recommendations or to create new ones in order to achieve the
following objectives:
- enhancements in quality at a given audio bandwidth (including pre- and
post-processing functions such as noise suppression techniques)
- enhancements in quality obtained by increasing the audio bandwidth and/or the
number of channels
- improvements in compression efficiency and flexibility as provided by
scalability in bandwidth and bit rate or by discontinuous transmission and
comfort noise generation algorithms
- robust operation (e.g. with packet loss concealment methods) in
error/loss-prone environments such as non-guaranteed-bandwidth packet networks
or mobile wireless communication
- reduction of real-time delay with the purpose of reducing quality degradation
due to end to end latency in conversational applications
- reduction of complexity
- lossless data compression for existing ITU-T speech and audio coding
Recommendations
- Maintenance of existing ITU-T speech and audio coding standards and of ITU-T
software tool library through collection of defect reports, assessment on their
merit, and identification of the appropriate course of action
- Extensions to ITU-T software tool library for signal processing
standardization activities
- Compressed data formats to support packetization and streaming
- Development of supplemental enhancement information to accompany speech and
audio data for enabling enhanced functionality in application environments (e.g.
metadata, spatialisation information)
- Methods to allow streams to be easily mixed by MCUs or terminals
- Techniques to permit networks or terminals to adjust the bit rate of speech
and audio streams efficiently (e.g. scalability feature)
- Techniques for efficient compressed-digital to compressed-digital processing
(including transcoding)
- The impact of quality control requirements on speech and audio codec
development
- Security aspects that directly affect speech and audio coding including
watermarking techniques
- Parameter extraction from audio, in support of applications such as speech
recognition, speaker verification, biometrics applications, etc.
- Study and specification of data for speech/audio annotation, indexing, and
searching
- Considerations on how to help measure and mitigate climate changes
Tasks
Tasks include, but are not limited to:
- Extensions of existing G-series speech and audio coding Recommendations,
including G.711, G.711.1, G.718 (in coordination with Q9/16), G.719, G.722,
G.722.1, G.722.2, G.726, G.727, G.728, G.723.1, G.729, G.729.1
- Maintenance of existing G-series regarding speech / audio coding and signal
processing Recommendations including G.191, G.192, G.711, G.726, G.727, G.728,
G.723.1, G.729, G.722, G.722.1, G.722.2, G.729.1
- Development of new speech and audio coding recommendations
- Upgrade of ITU-T G.191 Software Tool Library to support to ITU-T signal
processing activities, e.g.:
- superwideband and fullband audio processing
- channel models, error patterns and statistics for packet-based networks
(including IP and Internet), wireless networks and mobile-satellite systems
- identify techniques for verification of the correct implementation of
algorithms
An up-to-date status of work under this Question is found in the SG 16 work
programme (http://itu.int/ITU-T/workprog/wp_search.aspx?isn_sg=554).
Relationships
Recommendations:
- G.718 (interoperable mode extensions planned in 2009-2012)
- G.16X-series speech enhancement Recommendations
- G.76X-series circuit multiplication Recommendations
- G.799.X-series voice over IP gateway Recommendations
- H.300-series system Recommendations
- P.800-series
Questions:
- 1, 2, 4, 6, 7, 8, 9, 14, 15, 16, 18, 22, 26, 28/16
Study Groups:
- ITU-T SG 9 on speech and audio coding aspects of digital cable systems and
IPTV
- ITU-T SG 12 for speech and audio coding quality performance assessment and
software tools matters
- ITU-R SG 6 on terrestrial and satellite broadcasting services
Other Bodies:
- 3GPP and 3GPP2
- ETSI DECT and TISPAN
- IETF for speech and audio packetization issues
- IMTC
- IP/MPLS Forum
- ISO/MPEG
- TIA
|