Rec. ITU-T H.222.0 (08/2018) - Information technology – Generic coding of moving pictures and associated audio information: Systems
Summary
History
FOREWORD
Table of Contents
Introduction
Figure Intro. 1 – Simplified overview of the scope of this Recommendation | International Standard
Intro. 1  Transport stream
Figure Intro. 2 – Prototypical transport demultiplexing and decoding example
Figure Intro. 3 – Prototypical transport multiplexing example
Figure Intro. 4 – Prototypical transport stream to program stream conversion
Figure Intro. 5 – Prototypical decoder for program streams
Intro. 3  Conversion between transport stream and program stream
Intro. 4  Packetized elementary stream
Intro. 5  Timing model
Intro. 6  Conditional access
Intro. 7  Multiplex-wide operations
Intro. 8.2 Synchronization
Intro. 8.3 Relation to compression layer
Intro. 9  System reference decoder
Intro. 10  Applications
INTERNATIONAL STANDARD
ITU-T RECOMMENDATION
SECTION 1 – GENERAL
     1.1 Scope
     1.2 Normative references
          1.2.1 Identical Recommendations | International Standards
          1.2.2 Paired Recommendations | International Standards equivalent in technical content
          1.2.3 Additional references
SECTION 2 – TECHNICAL ELEMENTS
     2.1 Definitions
NOTE – Usage of J2K blocks requires J2K block mode (defined in 2.1.72
2.1.100 pack (system): A pack consists of a pack header followed by zero or more packets. It is a layer in the system coding syntax described in 2.5.3.3.
2.1.101 packet data (system): Contiguous bytes of data from an elementary stream present in a packet.
     2.2 Symbols and abbreviations
          2.2.1 Arithmetic operators
          2.2.2 Assignment
          2.2.3 Bitwise operators
          2.2.4 Constants
          2.2.5 Logical operators
          2.2.6 Mnemonics
          2.2.7 Range operator
          2.2.8 Relational operators
     2.3 Method of describing bit stream syntax
     2.4 Transport stream bitstream requirements
          2.4.1 Transport stream coding structure and parameters
          2.4.2 Transport stream system target decoder
               2.4.2.1 General
Figure 2-1 – Transport stream system target decoder notation
     2.4.2.2 System clock frequency
     2.4.2.3 Input to the transport stream system target decoder
Transport streams with multiple programs and variable rate
     2.4.2.4 Buffering
Buffer BSn
Audio
Systems
Video
Leak method
Vbv_delay method
Removal of access units
System data
Low delay
Trick mode
     2.4.2.5 Decoding
     2.4.2.6 Presentation
     2.4.2.7 Buffer management
Definition of overflow and underflow
     2.4.2.8 T-STD extensions for carriage of ISO/IEC 14496 data
     2.4.2.9 T-STD extensions for carriage of Rec. ITU-T H.264 | ISO/IEC 14496-10 video
     2.4.2.10 T-STD extensions for carriage of ISO/IEC 14496-17 text streams
     2.4.2.11 T-STD extensions for carriage of J2K video elementary streams
     2.4.2.12 T-STD extensions for carriage of HEVC
     2.4.2.13 T-STD extensions for carriage of MVCD video sub-bitstream
     2.4.2.14 T-STD extensions for carriage of MV HEVC and SHVC
     2.4.3 Specification of the transport stream syntax and semantics
          2.4.3.1 Transport stream
Table 2-1 – Transport stream
     2.4.3.2 Transport stream packet layer
Table 2-2 – Transport packet of this Recommendation | International Standard
     2.4.3.3 Semantic definition of fields in transport stream packet layer
Table 2-3 – PID table
Table 2-4 – Scrambling control values
Table 2-5 – Adaptation field control values
     2.4.3.4 Adaptation field
     2.4.3.5 Semantic definition of fields in adaptation field
     2.4.3.6 PES packet
     2.4.3.7 Semantic definition of fields in PES packet
Table 2-24 – Trick mode control values
Table 2-25 – Field_id field control values
Table 2-26 – Coefficient selection values
     2.4.3.8 Carriage of program streams and ISO/IEC 11172-1 Systems streams in the transport stream
     2.4.4 Program-specific information
          2.4.4.1 General
Table 2-28 – Program-specific information
     2.4.4.2 Pointer
Table 2-29 – Program-specific information pointer
     2.4.4.3 Semantics definition of fields in pointer syntax
     2.4.4.4 Program association table
     2.4.4.5 Table_id assignments
     2.4.4.6 Semantic definition of fields in program association section
     2.4.4.7 Conditional access table
Table 2-32 – Conditional access section
     2.4.4.8 Semantic definition of fields in conditional access section
     2.4.4.9 Program map table
Table 2-33 – Transport stream program map section
     2.4.4.10 Semantic definition of fields in transport stream program map section
     2.4.4.11 Syntax of the private section
Table 2-35 – Private section
     2.4.4.12 Semantic definition of fields in private section
     2.4.4.13 Syntax of the transport stream section
Table 2-36 – The transport stream description table
     2.4.4.14 Semantic definition of fields in the transport stream section
     2.5 Program stream bitstream requirements
          2.5.1 Program stream coding structure and parameters
          2.5.2 Program stream system target decoder
Figure 2-2 – Program stream system target decoder notation
     2.5.2.1 System clock frequency
     2.5.2.2 Input to the program stream system target decoder
     2.5.2.3 Buffering
     2.5.2.4 PES streams
     2.5.2.5 Decoding and presentation
     2.5.2.6 P-STD extensions for carriage of ISO/IEC 14496 data
     2.5.2.7 P-STD extensions for carriage of Rec. ITU-T H.264 | ISO/IEC 14496-10 video
     2.5.2.8 P-STD extensions for carriage of ISO/IEC 14496-17 text streams
     2.5.3 Specification of the program stream syntax and semantics
          2.5.3.1 Program stream
Table 2-37 – Program stream
     2.5.3.2 Semantic definition of fields in program stream
     2.5.3.3 Pack layer of program stream
Table 2-38 – Program stream pack
     2.5.3.4 Semantic definition of fields in program stream pack
     2.5.3.5 System header
Table 2-40 – Program stream system header
     2.5.3.6 Semantic definition of fields in system header
     2.5.3.7 Packet layer of program stream
     2.5.4 Program stream map
          2.5.4.1 Syntax of program stream map
Table 2-41 – Program stream map
     2.5.4.2 Semantic definition of fields in program stream map
     2.5.5 Program stream directory
          2.5.5.1 Syntax of program stream directory packet
Table 2-42 – Program stream directory packet
     2.5.5.2 Semantic definition of fields in program stream directory
Table 2-43 – Intra_coded indicator
Table 2-44 – Coding_parameters indicator
     2.6 Program and program element descriptors
          2.6.1 Semantic definition of fields in program and program element descriptors
          2.6.2 Video stream descriptor
Table 2-46 – Video stream descriptor
     2.6.3 Semantic definitions of fields in video stream descriptor
Table 2-47 – Frame rate code
     2.6.4 Audio stream descriptor
     2.6.5 Semantic definition of fields in audio stream descriptor
     2.6.6 Hierarchy descriptor
     2.6.7 Semantic definition of fields in hierarchy descriptor
     2.6.8 Registration descriptor
Table 2-51 – Registration descriptor
     2.6.9 Semantic definition of fields in registration descriptor
     2.6.10 Data stream alignment descriptor
Table 2-52 – Data stream alignment descriptor
     2.6.11 Semantics of fields in data stream alignment descriptor
Table 2-53 – Video stream alignment values
Table 2-54 – AVC video stream alignment values
Table 2-55 – HEVC video stream alignment values
Table 2-56 – Audio stream alignment values
     2.6.12 Target background grid descriptor
Figure 2-3 – Target background grid descriptor display area
     2.6.13 Semantics of fields in target background grid descriptor
Table 2-57 – Target background grid descriptor
     2.6.14 Video window descriptor
Table 2-58 – Video window descriptor
     2.6.15 Semantic definition of fields in video window descriptor
     2.6.16 Conditional access descriptor
Table 2-59 – Conditional access descriptor
     2.6.17 Semantic definition of fields in conditional access descriptor
     2.6.18 ISO 639 language descriptor
Table 2-60 – ISO 639 language descriptor
     2.6.19 Semantic definition of fields in ISO 639 language descriptor
Table 2-61 – Audio type values
     2.6.20 System clock descriptor
Table 2-62 – System clock descriptor
     2.6.21 Semantic definition of fields in system clock descriptor
     2.6.22 Multiplex buffer utilization descriptor
Table 2-63 – Multiplex buffer utilization descriptor
     2.6.23 Semantic definition of fields in multiplex buffer utilization descriptor
     2.6.24 Copyright descriptor
Table 2-64 – Copyright descriptor
     2.6.25 Semantic definition of fields in copyright descriptor
     2.6.26 Maximum bitrate descriptor
Table 2-65 – Maximum bitrate descriptor
     2.6.27 Semantic definition of fields in maximum bitrate descriptor
     2.6.28 Private data indicator descriptor
Table 2-66 – Private data indicator descriptor
     2.6.29 Semantic definition of fields in Private data indicator descriptor
     2.6.30 Smoothing buffer descriptor
Table 2-67 – Smoothing buffer descriptor
     2.6.31 Semantic definition of fields in smoothing buffer descriptor
     2.6.32 STD descriptor
Table 2-68 – STD descriptor
     2.6.33 Semantic definition of fields in STD descriptor
     2.6.34 IBP descriptor
Table 2-69 – IBP descriptor
     2.6.35 Semantic definition of fields in IBP descriptor
     2.6.36 MPEG-4 video descriptor
Table 2-70 – MPEG-4 video descriptor
     2.6.37 Semantic definition of fields in MPEG-4 video descriptor
     2.6.38 MPEG-4 audio descriptor
Table 2-71 – MPEG-4 audio descriptor
     2.6.39 Semantic definition of fields in MPEG-4 audio descriptor
     2.6.40 IOD descriptor
Table 2-73 – IOD descriptor
     2.6.41 Semantic definition of fields in IOD descriptor
     2.6.42 SL descriptor
Table 2-74 – SL descriptor
     2.6.43 Semantic definition of fields in SL descriptor
     2.6.44 FMC descriptor
Table 2-75 – FMC descriptor
     2.6.45 Semantic definition of fields in FMC descriptor
     2.6.46 External_ES_ID descriptor
Table 2-76 – External_ES_ID descriptor
     2.6.47 Semantic definition of fields in External_ES_ID descriptor
     2.6.48 Muxcode descriptor
Table 2-77 – Muxcode descriptor
     2.6.49 Semantic definition of fields in Muxcode descriptor
     2.6.50 FmxBufferSize descriptor
Table 2-78 – FmxBufferSize descriptor
     2.6.51 Semantic definition of fields in FmxBufferSize descriptor
     2.6.52 MultiplexBuffer descriptor
Table 2-79 – MultiplexBuffer descriptor
     2.6.53 Semantic definition of fields in MultiplexBuffer descriptor
     2.6.54 FlexMuxTiming descriptor
Table 2-80 – FlexMuxTiming descriptor
     2.6.55 Semantic definition of fields in FlexMuxTiming descriptor
     2.6.56 Content labelling descriptor
Table 2-81 – Content labelling descriptor
     2.6.57 Semantic definition of fields in content labelling descriptor
Table 2-83 – Content_time_base_indicator values
     2.6.58 Metadata pointer descriptor
Table 2-84 – Metadata pointer descriptor
     2.6.59 Semantic definition of fields in metadata pointer descriptor
Table 2-85 – Metadata format values
Table 2-86 – MPEG_carriage_flags
     2.6.60 Metadata descriptor
     2.6.61 Semantic definition of fields in metadata descriptor
Table 2-88 – decoder_config_flags
     2.6.62 Metadata STD descriptor
Table 2-89 – Metadata STD descriptor
     2.6.63 Semantic definition of fields in metadata STD descriptor
     2.6.64 AVC video descriptor
Table 2-90 – AVC video descriptor
     2.6.65 Semantic definition of fields in AVC video descriptor
     2.6.66 AVC timing and HRD descriptor
Table 2-91 – AVC timing and HRD descriptor
     2.6.67 Semantic definition of fields in AVC timing and HRD descriptor
     2.6.68 MPEG-2 AAC audio descriptor
Table 2-92 – MPEG-2 AAC_audio_descriptor
     2.6.69 Semantic definition of fields in MPEG-2 AAC audio descriptor
Table 2-93 – MPEG-2_AAC_additional_information field values
     2.6.70 MPEG-4 text descriptor
Table 2-94 – MPEG-4 text descriptor
     2.6.71 Semantic definition of fields in MPEG-4 text descriptor
     2.6.72 MPEG-4 audio extension descriptor
Table 2-95 – MPEG-4 audio extension descriptor
     2.6.73 Semantic definition of fields in MPEG-4 audio extension descriptor
     2.6.74 Auxiliary video stream descriptor
Table 2-96 – Auxiliary video stream descriptor
     2.6.75 Semantic definition of fields in auxiliary video stream descriptor
     2.6.76 SVC extension descriptor
Table 2-97 – SVC extension descriptor
     2.6.77 Semantic definition of fields in SVC extension descriptor
     2.6.78 MVC extension descriptor
Table 2-98 – MVC extension descriptor
     2.6.79 Semantics of fields in MVC extension descriptor
     2.6.80 J2K video descriptor
     2.6.81 Semantics of fields in J2K video descriptor
DEN_frame_rate – This field shall be coded the same as frat_denominator field specified in Table S.1 (see Annex S).
NUM_frame_rate – This field shall be coded the same as frat_numerator field specified in Table S.1 (see Annex S).
Table 2-100 – Example frame rates based on DEN_frame_rate and NUM_frame_rate values
If for some reason the MaxCLL and/or MaxFALL values are unknown, the value 0x0000 shall be used.
     2.6.82 MVC operation point descriptor
Table 2-101 – MVC operation point descriptor
     2.6.83 Semantic definition of fields in MVC operation point descriptor
     2.6.84 MPEG2_stereoscopic_video_ format_ descriptor
Table 2-102 – MPEG2_stereoscopic_video_format_descriptor syntax
     2.6.85 Semantic definition of fields in the MPEG2_stereoscopic_video_format_descriptor
     2.6.86 Stereoscopic_program_info_descriptor
Table 2-103 – Stereoscopic_program_info_descriptor syntax
     2.6.87 Semantic definition of fields in the stereoscopic_program_info_descriptor
Table 2-104 – Stereoscopic_service_type values
     2.6.88 Stereoscopic_video_info_descriptor
Table 2-105 – Stereoscopic_video_info_descriptor syntax
     2.6.89 Semantic definition of fields in the stereoscopic_video_info_descriptor
Table 2-106 – Upsampling factor values
     2.6.90 Extension descriptor
     2.6.91 Semantic definition of fields in the extension descriptor
HEVC_tile_substream_descriptor() – This structure is defined in 2.6.122 and 2.6.123.
HEVC_subregion_descriptor() – This structure is defined in 2.6.125 and 2.6.126.
Table 2-108 – Extension descriptor tag values
     2.6.92 ODUpdate_descriptor
     2.6.93 Transport_profile_descriptor
Table 2-109 – Transport_profile_descriptor syntax
     2.6.94 Semantic definition of fields in the Transport_profile_descriptor
Table 2-110 – Transport_profile values
     2.6.95 HEVC video descriptor
Table 2-111 – HEVC video descriptor
     2.6.96 Semantic definition of fields in HEVC video descriptor
Table 2-112 – Semantics of HDR_WGC_idc
     2.6.97 HEVC timing and HRD descriptor
Table 2-113 – HEVC timing and HRD descriptor
     2.6.98 Semantic definition of fields in HEVC timing and HRD descriptor
     2.6.99 AF extensions descriptor
Table 2-114 – Adaptation field extension descriptor
     2.6.100 HEVC operation point descriptor
Table 2-115 – HEVC operation point descriptor
     2.6.101 Semantic definition of fields in HEVC operation point descriptor
     2.6.102 HEVC hierarchy extension descriptor
Table 2-116 – HEVC hierarchy extension descriptor
     2.6.103 Semantic definition of fields in HEVC hierarchy extension descriptor
Table 2-117 – Semantics of extension dimension bits
     2.6.104 Green extension descriptor
Table 2-118 – Green extension descriptor
     2.6.105 Semantics for green extension descriptor
     2.6.106 MPEG-H 3D audio descriptor
Table 2-119 – MPEG-H 3D audio descriptor
     2.6.107 Semantics for MPEG-H 3D audio descriptor
     2.6.108 MPEG-H 3D audio config descriptor
Table 2-120 – MPEG-H 3D audio config descriptor
     2.6.109 Semantics for MPEG-H 3D audio config descriptor
     2.6.110 MPEG-H 3D audio scene descriptor
     2.6.111 Semantic definition of fields in MPEG-H 3D audio scene descriptor
     2.6.112 MPEG-H 3D audio text label descriptor
Table 2-122 – MPEG-H 3D audio text label descriptor
     2.6.113 Semantic definition of fields in MPEG-H 3D audio text label descriptor
     2.6.114 MPEG-H 3D audio multi-stream descriptor
     2.6.115 Semantic definition of fields in MPEG-H 3D audio multi-stream descriptor
     2.6.116 MPEG-H 3D audio DRC and Loudness descriptor
     2.6.117 Semantic definition of fields in MPEG-H 3D audio DRC and Loudness descriptor
     2.6.118 MPEG-H 3D audio command descriptor
Table 2-125 – MPEG-H 3D audio command descriptor
     2.6.119 Quality extension descriptor
Table 2-126 – Quality extension descriptor
     2.6.120 Virtual segmentation descriptor
Table 2-127 – Virtual segmentation descriptor
     2.6.121 Semantic definition of fields in virtual segmentation descriptor
     2.6.122  HEVC tile substream descriptor
Table 2-128 – HEVC tile substream descriptor
     2.6.123  Semantic definition of the fields in the HEVC tile substream descriptor
NOTE – If ReferenceFlag is set to '0', Flag[0] has the same semantics as PreambleFlag for the case that ReferenceFlag is set to '1'.
     2.6.124 HEVC tile substream af_descriptor
     2.6.125 HEVC subregion descriptor
Table 2-129 — HEVC subregion descriptor
     2.6.126 Semantic definition of the fields in the HEVC subregion descriptor
SubstreamCountMinus1[i] – The value of this 8-bit field in the range of 0 to (TotalSubstreamIDs – 1) indicates
PictureSizeHor[i] – This 16-bit field indicates the horizontal subregion dimension, measured in pixels.
PictureSizeVert[i] – This 16-bit field indicates the vertical subregion dimension, measured in pixels.
PatternCount[i] – This 7-bit field indicates the number of different subregion layouts.
SubstreamOffset[k][j][i] – This array of 7 bit fields indicates the offset that has to be added
     2.7 Restrictions on the multiplexed stream semantics
          2.7.1 Frequency of coding the system clock reference
          2.7.2 Frequency of coding the program clock reference
          2.7.3 Frequency of coding the elementary stream clock reference
          2.7.4 Frequency of presentation timestamp coding
          2.7.5 Conditional coding of timestamps
          2.7.6 Timing constraints for scalable coding
          2.7.7 Frequency of coding P-STD_buffer_size in PES packet headers
          2.7.8 Coding of system header in the program stream
          2.7.9 Constrained system parameter program stream
Packet rate
Decoder buffer size
BSn ≤ 8976 bytes
     2.7.10 Transport stream
          Sample rate locking in transport streams
     2.8 Compatibility with ISO/IEC 11172
     2.9 Registration of copyright identifiers
          2.9.1 General
          2.9.2 Implementation of a Registration Authority (RA)
     2.10 Registration of private data format
          2.10.1 General
          2.10.2 Implementation of a Registration Authority (RA)
     2.11 Carriage of ISO/IEC 14496 data
          2.11.1 Introduction
          2.11.2 Carriage of individual ISO/IEC 14496-2 and 14496-3 Elementary Streams in PES packets
               2.11.2.1 Introduction
Table 2-130 – Carriage of individual ISO/IEC 14496 streams in Rec. ITU-T H.222.0 | ISO/IEC 13818-1
     2.11.2.2 STD extensions for individual ISO/IEC 14496 elementary streams
Figure 2-4 – T-STD model extensions for individual ISO/IEC 14496 elementary streams
     2.11.3 Carriage of audiovisual ISO/IEC 14496-1 scenes and associated ISO/IEC 14496 streams
          2.11.3.1 Introduction
          2.11.3.2 Assignment of ES_ID values
          2.11.3.3 Timing of ISO/IEC 14496 scenes and associated streams
If X(t) = fstc(t)/fobject(t)
     2.11.3.4 Delivery timing of SL-packetized streams
     2.11.3.5 Delivery timing of FlexMux streams
     2.11.3.6 Carriage of SL-packetized streams in PES packets
     2.11.3.7 Carriage of FlexMux streams in PES packets
     2.11.3.8 Carriage of SL packets and FlexMux packets in sections
Table 2-131 – Section syntax for transport of ISO/IEC 14496 stream
     2.11.3.9 T-STD extensions
          2.11.3.9.1 T-STD Model for 14496 content
Figure 2-5 – T-STD model for ISO/IEC 14496 content
     2.11.3.9.2   Processing of FlexMux streams
     2.11.3.9.3   Definition of FlexMux Buffer, FBnp
     2.11.3.9.4   Processing of SL-packetized streams
     2.11.3.9.5 Buffer management
     2.11.3.10  Carriage within a transport stream
          2.11.3.10.1 Overview
          2.11.3.10.2 Initial Object Descriptor
     2.11.3.11  P-STD Model for 14496 content
Figure 2-6 – P-STD model for ISO/IEC 14496 Systems stream
     2.11.3.11.1 Processing of FlexMux streams
     2.11.3.11.2 Definition of FlexMux Buffer, FBnp
     2.11.3.11.3 Processing of SL-packetized streams
     2.11.3.11.4 Buffer management
     2.11.3.12  Carriage within a program stream
          2.11.3.12.1 Overview
          2.11.3.12.2 Initial object descriptor
     2.12 Carriage of metadata
          2.12.1 Introduction
          2.12.2 Metadata time-line model
Figure 2-7 – Timing model for delivery of content and metadata
Figure 2-8 – Delivery of metadata in PES packets
     2.12.3 Options for transport of metadata
     2.12.4 Use of PES packets to transport metadata
          2.12.4.1 General
          2.12.4.2 Metadata Access Unit Wrapper
Table 2-133 – Metadata Access Unit Wrapper
Table 2-135 – Cell fragment indication
     2.12.5 Use of the DSM-CC synchronized download protocol to transport metadata
     2.12.6 Use of metadata sections to transport metadata
Table 2-136 – Section syntax for transport of metadata
Table 2-137 – Section fragment indication
     2.12.7 Use of the DSM-CC data carousel to transport metadata
     2.12.8 Use of the DSM-CC object carousel to transport metadata
     2.12.9 Metadata-related signalling
          2.12.9.1 General
          2.12.9.2 Signalling of metadata services and streams
          2.12.9.3 Signalling of content for use by a metadata system
          2.12.9.4 Association of metadata to content
          2.12.9.5 Signalling decoder configuration data
          2.12.9.6 Overview of metadata signalling
Figure 2-9 – Metadata signalling and referencing
     2.12.10 STD model for metadata
Figure 2-10 – Metadata decoding in the STD
     2.13 Carriage of ISO 15938 data
          2.13.1 Introduction
          2.13.2 ISO 15938 decoder configuration data
     2.14 Carriage of Rec. ITU-T H.264 | ISO/IEC 14496-10 video
          2.14.1 Introduction
          2.14.2 Carriage in PES packets
          2.14.3 STD extensions
               2.14.3.1 T-STD extensions
Figure 2-11 – T-STD model extensions for Rec. ITU-T H.264 | ISO/IEC 14496-10 video
DPBn buffer management
TBn, MBn and EBn buffer management
Transfer between MBn and EBn
STD delay
Buffer management conditions
     2.14.3.2 P-STD extensions
Figure 2-12 – P-STD model extensions for Rec. ITU-T H.264 | ISO/IEC 14496-10 video
DPBn buffer management
Bn buffer management
STD delay
Buffer management conditions
     2.14.3.3 View and dependency representation delimiter NAL unit
Table 2-138 – View and dependency representation delimiter NAL unit
     2.14.3.4 Semantics of view and dependency representation delimiter NAL unit
     2.14.3.5 T-STD extensions for SVC
Carriage in PES packets
DPB buffer management
TBn, MBn, EBn buffer management
Rxn = bit_rate
Rbxn = 1200 × MaxBR[level]n
Access unit re-assembling and EB removal
STD delay
Buffer management conditions
     2.14.3.6 P-STD extensions for SVC
Carriage in PES packets
DPB buffer management
Bn buffer management
Access unit re-assembling and B removal
STD delay
Buffer management conditions
     2.14.3.7 T-STD extensions for MVC and MVCD
Carriage in PES packets
DPB buffer management
TBn, MBn, EBn buffer management
Rbxn = 1200 × MaxBR[level]n
Access unit re-assembling and EB removal
STD delay
Buffer management conditions
Carriage of MVCD sub-bitstream in PES packets
     2.14.3.8 P-STD extensions for MVC
Carriage in PES packets
DPB buffer management
Bn buffer management
Access unit re-assembling and B removal
STD delay
Buffer management conditions
     2.15 Carriage of ISO/IEC 14496-17 text streams
          2.15.1 Introduction
          2.15.2 Carriage in PES packets
          2.15.3 STD extensions
               2.15.3.1 T-STD extensions
Figure 2-17 – T-STD model extensions for ISO/IEC 14496-17 text streams
Buffer management
     2.15.3.2 P-STD extensions
     2.16 Carriage of auxiliary video streams
     2.17 Carriage of HEVC
          2.17.1 Constraints for the transport of HEVC
Carriage in PES packets
DPB buffer management
     2.17.2 T-STD Extensions for single layer HEVC
Figure 2-18 – T-STD model extensions for single layer HEVC
TBn, MBn, EBn buffer management
BSoh = (1/750) seconds ( max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s}
BSmux = 0.004 seconds ( max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s}
EBSn = cpb_size
Rxn = bit_rate
Rbxn = BrNalFactor × MaxBR[tier, level]
STD delay
Buffer management conditions
     2.17.3 T-STD extensions for layered transport of HEVC temporal video subsets
Figure 2-19 – T-STD model extensions for layered transport of HEVC temporal video subsets
TBn,k, MBn,k, EBn buffer management
MBSn,k = BSmux + BSoh + CpbNalFactor × MaxCPB[tier, level] – cpb_size
EBSn = cpb_size
Rxn,k = bit_rate
Rbxn,k = BrNalFactor × MaxBR[tier, level]
STD delay
Buffer management conditions
     2.17.4 T-STD extensions for layered transport of HEVC sub-partitions with bitstream-partition-specific CPB operation
Figure 2-20 – T-STD model extensions for bitstream-partition-specific CPB operation
TBl,k, MBl,k, EBl buffer management
MBSl,k = BSmux + BSoh + CpbNalFactor × MaxCPB[tier, level] – cpb_size
Aggregation of elementary streams
Table 2-139 – Implied hierarchy_layer_index if no hierarchy descriptors are used
Carriage in PES packets
STD delay
Buffer management conditions
     2.17.5 T-STD Extensions for the carriage of HEVC tiles
          2.17.5.1 Carriage of HEVC motion-constrained tile sets
There are alternative options to carry the HEVC tile substreams:
     2.17.5.2 Carriage of HEVC motion-constrained tile sets as separate elementary streams
Figure 2-21 – T-STD model extensions for transport of HEVC tiles through individual ESs
Process to access a subregion:
The HEVC subregion descriptor and all HEVC tile substream descriptors shall be read.
Next, it checks if the applicable HEVC tile substream descriptor provides a PatternReference value.
With that PatternReference, it selects the applicable SubstreamOffset values:
SubstreamOffset[k][PatternReference][i] with 0 < k < SubstreamCountMinus1[i]
TBn, MBn, EBn buffer management
The following additional notations are used to describe the T-STD extensions and are illustrated in Figure 2-21.
MBSn = BSmux + BSoh + CpbNalFactor × MaxCPB[tier, level] – cpb_size (measured in bytes)
BSoh = (1/750) seconds × max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s}
BSmux = 0.004 seconds × max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s}
SBS = cpb_sizeH
SBS = ∑k (SBSk) with n ≤ k ≤ n+m
     2.17.5.3 Carriage of HEVC motion-constrained tile sets in a common ES using AF descriptors
Figure 2-22 – T-STD model extensions for transport of HEVC tiles in a common ES using AF descriptors
Next, it selects the SubstreamIDx that relates to the upper left HEVC tile substream of the region to be displayed.
The receiver sets up an empty list of SubstreamIDs that belong to the subregion and adds
Next, it checks if the applicable HEVC tile substream af_descriptor provides a PatternReference value. If there is no explicit PatternReference value, which means that either descriptor_length equals 1 or ReferenceFlag equals 0, the index j defaults t...
With that PatternReference, it selects the applicable SubstreamOffset values:
SubstreamOffset[k][PatternReference][i] with 0 < k < SubstreamCountMinus1[i]
TB, MB, RB buffer management
MBS = BSmux + BSoh + CpbNalFactor × MaxCPB[tier, level] – cpb_size (measured in bytes)
BSoh, packet overhead buffering, is defined as:
BSoh = (1/750) seconds × max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s}
BSmux = 0.004 seconds × max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s}
RBS = cpb_sizeH
     2.17.5.4 Carriage of HEVC motion-constrained tile sets in a common ES ignoring AF descriptors
Figure 2-23 – T-STD model extension for transport of HEVC tiles in a common ES ignoring AF descriptors
TB, MB, EB buffer management
MBS = BSmux + BSoh + CpbNalFactor × MaxCPB[tier, level] – cpb_size (measured in bytes)
BSoh = (1/750) seconds × max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s}
BSmux = 0.004 seconds × max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s}
EBS = cpb_size
     2.18 Carriage of green access units
          2.18.1 Carriage of green access units in MPEG-2 sections
          2.18.2 Semantics of green access unit section
          2.18.3 Green access unit
Table 2-141 – Green access unit
     2.18.4 Timing relationship between green access unit and media access unit
     2.18.5 Buffer model for processing green access units
Figure 2-24 – T-STD model extension for green access units
     2.19 Carriage of ISO/IEC 23008-3 MPEG-H 3D audio data
          2.19.1 Introduction
          2.19.2 Carriage in PES packets
          2.19.3 STD extensions for ISO/IEC 23008-3 elementary streams
          2.19.4 STD extensions for multiple ISO/IEC 23008-3 elementary streams
Figure 2-25 – Transport stream system target decoder for multiple audio elementary streams
     2.19.5 MPEG-2 Transport stream random access constraints and signalling
     2.20 Carriage of Quality Access Units in MPEG-2 sections
          2.20.1 General description
Table 2-142 – Quality Access Unit
     2.20.2 Buffer model for processing Quality Access Units
Figure 2-26 – Quality Access Unit decoder processing model
     2.21 Carriage of sample variants
          2.21.1 Introduction
          2.21.2 Constraints
Sample variants shall be carried in an MPEG-2 TS metadata stream that complies with the following constraints:
     2.21.3 Sample Variants decoder configuration data
     2.22 Carriage of Media Orchestration Access Units
     A.1 CRC decoder model
Figure A.1 – 32-bit CRC decoder model
     B.1 Introduction
          B.1.1 Purpose
          B.1.2 Future applications
Video on demand
Interactive video services
Video networks
     B.1.3 Benefits
     B.1.4 Basic functions
          B.1.4.1 Stream selection
          B.1.4.2 Retrieval
          B.1.4.3 Storage
     B.2 General elements
          B.2.1 Scope
          B.2.2 Overview of the DSM-CC application
Figure B.1 – Configuration of DSM-CC application
     B.2.3 The transmission of DSM-CC commands and acknowledgements
Figure B.2 – BSM-CC bitstream decoded as a stand-alone bitstream
Figure B.3 – DSM-CC bitstream decoded as part of the system bitstream
     B.3 Technical elements
          B.3.1 Definitions
          B.3.2 Specification of DSM-CC syntax
Table B.1 – DSM-CC syntax
     B.3.3 Semantics of fields in specification of DSM-CC syntax
Table B.2 – Command_id assigned values
     B.3.4 Control layer
          Constraints on setting flags in DSM-CC control
Table B.3 – DSM-CC control
     B.3.5 Semantics of fields in control layer
Table B.4 – Select mode assigned values
     B.3.6 Acknowledgement layer
          Constraints on setting flags in DSM-CC control
Table B.5 – DSM-CC Acknowledgement
     B.3.7 Semantics of fields in acknowledgement layer
     B.3.8 Time code
          Constraints on time code
Table B.6 – Time code
     B.3.9 Semantics of fields in time code
     C.1 Explanation of program-specific information in transport streams
     C.2 Introduction
     C.3 Functional mechanism
i) table_id
ii) table_id_extension
iii) section_number
iv) version_number
v) current_next_indicator
     C.4 The mapping of sections into transport stream packets
     C.5 Repetition rates and random access
     C.6 What is a program?
     C.7 Allocation of program_number
i) multilingual transmissions into separate markets
ii) Very large program definitions
     C.8 Usage of PSI in a typical system
     C.9 The relationships of PSI structures
Figure C.1 – Program and network mapping relationships
     C.9.1 Program Association Table
     C.9.2 Program map table
     C.9.3 Conditional access table
     C.9.4 Network information table
     C.9.5 Private_section()
     C.9.6 Descriptors
Table C.1 – Composite_descriptor
     C.10 Bandwidth utilization and signal acquisition time
6-MHz CATV channel
OC-3 fibre channel (155 Mbit/s)
C-band satellite transponder
     D.1 Introduction
          D.1.1 Timing model
Figure D.1 – Constant delay model
     D.1.2 Audio and video presentation synchronization
     D.1.3 System time clock recovery in the decoder
Figure D.2 – STC recovery using PLL
     D.1.4 SCR and PCR jitter
Table D.1 – Re-multiplexing strategy
     D.1.5 Clock recovery in the presence of network jitter
     D.1.6 System clock used for chroma sub-carrier generation
     D.1.7 Component video and audio reconstruction
     D.1.8 Frame slipping
     D.1.9 Smoothing of network jitter
     E.1 General considerations
     E.2 Suggestion
Table E.1 – PES packet header example
     F.1 Introduction
          F.1.1 Transport stream syntax
Figure F.1 – Transport stream syntax diagram
     F.1.2 PES packet
Figure F.2 – PES packet syntax diagram
     F.1.3 Program Association section
Figure F.3 – Program association section diagram
     F.1.4 CA section
Figure F.4 – Conditional access section diagram
     F.1.5 TS program map section
Figure F.5 – TS program map section diagram
     F.1.6 Private section
Figure F.6 – Private section diagram
     F.1.7 Program stream
Figure F.7 – Program stream diagram
     F.1.8 Program stream map
Figure F.8 – Program stream map diagram
     G.1 General information
          G.1.1 Sync byte emulation
          G.1.2 Skipped picture status and decoding process
          G.1.3 Selection of PID values
          G.1.4 PES start_code emulation
     H.1 Private data
1) Transport stream packet Table 2-2
2) Transport stream adaptation field Table 2-6
3) PES packet Table 2-21
4) Descriptors
5) Private Section
     I.1 Systems conformance and real-time interface
     J.1 Introduction
     J.2 Network compliance models
Figure J.1 – Sending system streams over a jitter-inducing network
     J.3 Network specification for jitter smoothing
Figure J.2 – Jitter-smoothing using network-layer timestamps
Bdj = JRmax
     J.4 Example decoder implementations
          J.4.1 Network adapter followed by an MPEG-2 decoder
          J.4.2 Integrated decoder
Figure J.3 – Integrated dejittering and MPEG-2 decoding
     K.1 Introduction
     K.2 The different types of splicing point
          K.2.1 Ordinary splicing points
          K.2.2 Seamless splicing points
     K.3 Decoder behaviour on splices
          K.3.1 On non-seamless splices
          K.3.2 On seamless splices
          K.3.3 Buffer overflow
     L.1 Procedure for the request of a Registered Identifier (RID)
     L.2 Responsibilities of the Registration Authority
          L.2.1 Contact information of the Registration Authority
     L.3 Responsibilities of parties requesting an RID
     L.4 Appeal procedure for denied applications
     M.1 Contact information of organization requesting a Registered Identifier (RID)
     M.2 Statement of an intention to apply the assigned RID
     M.3 Date of intended implementation of the RID
     M.4 Authorized representative
     M.5 For official use only of the Registration Authority
Examples
     O.1 Procedure for the request of an RID
     O.2 Responsibilities of the Registration Authority
     O.3 Contact information for the Registration Authority
     O.4 Responsibilities of parties requesting an RID
     O.5 Appeal procedure for denied applications
     P.1 Contact information of organization requesting an RID
     P.2 Request for a specific RID
     P.3 Short description of RID that is in use and date system that was implemented
     P.4 Statement of an intention to apply the assigned RID
     P.5 Date of intended implementation of the RID
     P.6 Authorized representative
     P.7 For official use of the Registration Authority
     Q.1 Introduction
     Q.2 Leak rate from transport buffer
Rxn = 1.2 × Rmax × N bits per second
     Q.3 Buffer size
BSn = BSmux + BSdec + BSoh
BSoh = 528 bytes
BSmux = 0.004 seconds × Rmax × N
     Q.3.1 TBSn: same as other audio
     Q.3.2 BSmux: different from other audio
     Q.3.3 BSdec: different from other audio
     Q.3.4 BSoh: different from other audio
          Example: sampling frequency is 48 kHz
     Q.4 Conclusion
     R.1 Content access procedure for ISO/IEC 14496 program components within a program stream
Figure R.1 – Example of ISO/IEC 14496 content in a program stream
     R.2 Content access procedure for ISO/IEC 14496 program components within a transport stream
Figure R.2 – Example of ISO/IEC 14496 content in a transport stream
Figure R.3 – Usage of MPEG-4 in a transport stream with BIFS scene referring to native PES
     S.1 Introduction
     S.2 J2K video access unit, J2K video elementary stream, J2K video sequence and J2K still picture
     S.3 Optional J2K block mode for high resolution support
     S.4 Optional J2K stripe mode for Ultra-Low Latency
When this mode is enabled, each J2K access unit contains N Rec. ITU-T T.800 | ISO/IEC 15444-1 codestreams where:
N = (strp_max_idx + 1) for progressive content
N = 2*(strp_max_idx + 1) for interlaced content
     S.5 Elementary stream header (elsm) and mapping to PES packets
Table S.1 – J2K Access unit elementary stream header
Figure S.1 – Structure and order of JPEG 2000 access units
     S.6 J2K transport constraints
     S.7 Interpretation of flags in adaptation and PES headers for J2K video elementary streams
     S.8 T-STD extension for J2K video elementary streams
          S.8.1 General
Figure S.2 – T-STD model extensions for J2K Video
TBn and EBn buffer management
Removal of J2K access units from EBn
STD delay
Buffer management conditions
     S.8.2 J2K video elementary stream buffer size
     T.1 Introduction
     T.2 MIME type and subtype
Singer@apple.com
ISO/IEC JTC1/SC29 (MPEG)
     T.3 Security considerations
     T.4 Parameters
          T.4.1 The profiles parameter
          T.4.2 The codecs parameter
Examples:
ISO/IEC 13818-2 Main Profile
     U.1 Introduction
          U.1.1 General
          U.1.2 Notation
Table U.1 – Variable field length notation example
Table U.1bis – Table U.1 in equivalent full notation
     U.1.3 Annex references
     U.2 TEMI access unit and TEMI elementary stream
Table U.2 – TEMI access unit
     U.3 AF descriptors
          U.3.1 Introduction
Table U.3 – AF descriptor tags
     U.3.2 Location descriptor
Table U.4 – TEMI location descriptor
     U.3.3 Semantic definition of fields in location descriptor
Table U.5 – TEMI URL scheme types
Table U.6 – TEMI service types
     U.3.4 Base URL descriptor
Table U.7 – TEMI base URL descriptor
     U.3.5 Semantic definition of fields in Base URL descriptor
     U.3.6 Timeline descriptor
Table U.8 – TEMI timeline descriptor
     U.3.7 Semantic definition of fields in the timeline descriptor
MTi = timescale * (PTSi - PTS0) / 90000 + MT0
NTPi = (PTSi - PTS0) / 90000.0 + NTP0
PTPi = (PTSi - PTS0) / 90000.0 + PTP0
     U.3.8 MPEG-H 3dAudio extStreamID descriptor
Table U.9 – TEMI MPEG-H_3dAudio_extStreamID descriptor
     U.3.9 Semantic definition of fields in AF_MPEG-H_3dAudio extStreamID descriptor
     U.3.10 MPEG-H_3dAudio multi-stream and command descriptors
     U.3.11 Boundary descriptor
Figure U.1 – Stream partitioning into 2 and 5 second segments
     U.3.12 Semantic definition of fields in boundary descriptor
Table U.11 – sequence_number_length_code interpretation
     U.3.13 Labelling Descriptor
Table U.12 – Labelling Descriptor
     U.3.14 Semantic definition of fields in labelling descriptor
Table U.13 – label_length_code interpretation
Table U.14 – label_type values
     U.3.15 HEVC tile substream af_descriptor
Table U.15 – HEVC tile substream af_descriptor
     V.1 Introduction
Figure V.1 – Illustration of HEVC tiled encoding of panoramic content beyond UHD
     V.2 HEVC tile substream identification example
Figure V.2 – Example of HEVC tile substream identification
     V.3 Subregion layout example
Figure V.3 – Example of subregion layout for a 3 x 3 RoI