Rec. ITU-T H.222.0 (08/2018) - Information technology – Generic coding of moving pictures and associated audio information: Systems Summary History FOREWORD Table of Contents Introduction Figure Intro. 1 – Simplified overview of the scope of this Recommendation | International Standard Intro. 1 Transport stream Figure Intro. 2 – Prototypical transport demultiplexing and decoding example Figure Intro. 3 – Prototypical transport multiplexing example Figure Intro. 4 – Prototypical transport stream to program stream conversion Figure Intro. 5 – Prototypical decoder for program streams Intro. 3 Conversion between transport stream and program stream Intro. 4 Packetized elementary stream Intro. 5 Timing model Intro. 6 Conditional access Intro. 7 Multiplex-wide operations Intro. 8.2 Synchronization Intro. 8.3 Relation to compression layer Intro. 9 System reference decoder Intro. 10 Applications INTERNATIONAL STANDARD ITU-T RECOMMENDATION SECTION 1 – GENERAL 1.1 Scope 1.2 Normative references 1.2.1 Identical Recommendations | International Standards 1.2.2 Paired Recommendations | International Standards equivalent in technical content 1.2.3 Additional references SECTION 2 – TECHNICAL ELEMENTS 2.1 Definitions NOTE – Usage of J2K blocks requires J2K block mode (defined in 2.1.72 2.1.100 pack (system): A pack consists of a pack header followed by zero or more packets. It is a layer in the system coding syntax described in 2.5.3.3. 2.1.101 packet data (system): Contiguous bytes of data from an elementary stream present in a packet. 2.2 Symbols and abbreviations 2.2.1 Arithmetic operators 2.2.2 Assignment 2.2.3 Bitwise operators 2.2.4 Constants 2.2.5 Logical operators 2.2.6 Mnemonics 2.2.7 Range operator 2.2.8 Relational operators 2.3 Method of describing bit stream syntax 2.4 Transport stream bitstream requirements 2.4.1 Transport stream coding structure and parameters 2.4.2 Transport stream system target decoder 2.4.2.1 General Figure 2-1 – Transport stream system target decoder notation 2.4.2.2 System clock frequency 2.4.2.3 Input to the transport stream system target decoder Transport streams with multiple programs and variable rate 2.4.2.4 Buffering Buffer BSn Audio Systems Video Leak method Vbv_delay method Removal of access units System data Low delay Trick mode 2.4.2.5 Decoding 2.4.2.6 Presentation 2.4.2.7 Buffer management Definition of overflow and underflow 2.4.2.8 T-STD extensions for carriage of ISO/IEC 14496 data 2.4.2.9 T-STD extensions for carriage of Rec. ITU-T H.264 | ISO/IEC 14496-10 video 2.4.2.10 T-STD extensions for carriage of ISO/IEC 14496-17 text streams 2.4.2.11 T-STD extensions for carriage of J2K video elementary streams 2.4.2.12 T-STD extensions for carriage of HEVC 2.4.2.13 T-STD extensions for carriage of MVCD video sub-bitstream 2.4.2.14 T-STD extensions for carriage of MV HEVC and SHVC 2.4.3 Specification of the transport stream syntax and semantics 2.4.3.1 Transport stream Table 2-1 – Transport stream 2.4.3.2 Transport stream packet layer Table 2-2 – Transport packet of this Recommendation | International Standard 2.4.3.3 Semantic definition of fields in transport stream packet layer Table 2-3 – PID table Table 2-4 – Scrambling control values Table 2-5 – Adaptation field control values 2.4.3.4 Adaptation field 2.4.3.5 Semantic definition of fields in adaptation field 2.4.3.6 PES packet 2.4.3.7 Semantic definition of fields in PES packet Table 2-24 – Trick mode control values Table 2-25 – Field_id field control values Table 2-26 – Coefficient selection values 2.4.3.8 Carriage of program streams and ISO/IEC 11172-1 Systems streams in the transport stream 2.4.4 Program-specific information 2.4.4.1 General Table 2-28 – Program-specific information 2.4.4.2 Pointer Table 2-29 – Program-specific information pointer 2.4.4.3 Semantics definition of fields in pointer syntax 2.4.4.4 Program association table 2.4.4.5 Table_id assignments 2.4.4.6 Semantic definition of fields in program association section 2.4.4.7 Conditional access table Table 2-32 – Conditional access section 2.4.4.8 Semantic definition of fields in conditional access section 2.4.4.9 Program map table Table 2-33 – Transport stream program map section 2.4.4.10 Semantic definition of fields in transport stream program map section 2.4.4.11 Syntax of the private section Table 2-35 – Private section 2.4.4.12 Semantic definition of fields in private section 2.4.4.13 Syntax of the transport stream section Table 2-36 – The transport stream description table 2.4.4.14 Semantic definition of fields in the transport stream section 2.5 Program stream bitstream requirements 2.5.1 Program stream coding structure and parameters 2.5.2 Program stream system target decoder Figure 2-2 – Program stream system target decoder notation 2.5.2.1 System clock frequency 2.5.2.2 Input to the program stream system target decoder 2.5.2.3 Buffering 2.5.2.4 PES streams 2.5.2.5 Decoding and presentation 2.5.2.6 P-STD extensions for carriage of ISO/IEC 14496 data 2.5.2.7 P-STD extensions for carriage of Rec. ITU-T H.264 | ISO/IEC 14496-10 video 2.5.2.8 P-STD extensions for carriage of ISO/IEC 14496-17 text streams 2.5.3 Specification of the program stream syntax and semantics 2.5.3.1 Program stream Table 2-37 – Program stream 2.5.3.2 Semantic definition of fields in program stream 2.5.3.3 Pack layer of program stream Table 2-38 – Program stream pack 2.5.3.4 Semantic definition of fields in program stream pack 2.5.3.5 System header Table 2-40 – Program stream system header 2.5.3.6 Semantic definition of fields in system header 2.5.3.7 Packet layer of program stream 2.5.4 Program stream map 2.5.4.1 Syntax of program stream map Table 2-41 – Program stream map 2.5.4.2 Semantic definition of fields in program stream map 2.5.5 Program stream directory 2.5.5.1 Syntax of program stream directory packet Table 2-42 – Program stream directory packet 2.5.5.2 Semantic definition of fields in program stream directory Table 2-43 – Intra_coded indicator Table 2-44 – Coding_parameters indicator 2.6 Program and program element descriptors 2.6.1 Semantic definition of fields in program and program element descriptors 2.6.2 Video stream descriptor Table 2-46 – Video stream descriptor 2.6.3 Semantic definitions of fields in video stream descriptor Table 2-47 – Frame rate code 2.6.4 Audio stream descriptor 2.6.5 Semantic definition of fields in audio stream descriptor 2.6.6 Hierarchy descriptor 2.6.7 Semantic definition of fields in hierarchy descriptor 2.6.8 Registration descriptor Table 2-51 – Registration descriptor 2.6.9 Semantic definition of fields in registration descriptor 2.6.10 Data stream alignment descriptor Table 2-52 – Data stream alignment descriptor 2.6.11 Semantics of fields in data stream alignment descriptor Table 2-53 – Video stream alignment values Table 2-54 – AVC video stream alignment values Table 2-55 – HEVC video stream alignment values Table 2-56 – Audio stream alignment values 2.6.12 Target background grid descriptor Figure 2-3 – Target background grid descriptor display area 2.6.13 Semantics of fields in target background grid descriptor Table 2-57 – Target background grid descriptor 2.6.14 Video window descriptor Table 2-58 – Video window descriptor 2.6.15 Semantic definition of fields in video window descriptor 2.6.16 Conditional access descriptor Table 2-59 – Conditional access descriptor 2.6.17 Semantic definition of fields in conditional access descriptor 2.6.18 ISO 639 language descriptor Table 2-60 – ISO 639 language descriptor 2.6.19 Semantic definition of fields in ISO 639 language descriptor Table 2-61 – Audio type values 2.6.20 System clock descriptor Table 2-62 – System clock descriptor 2.6.21 Semantic definition of fields in system clock descriptor 2.6.22 Multiplex buffer utilization descriptor Table 2-63 – Multiplex buffer utilization descriptor 2.6.23 Semantic definition of fields in multiplex buffer utilization descriptor 2.6.24 Copyright descriptor Table 2-64 – Copyright descriptor 2.6.25 Semantic definition of fields in copyright descriptor 2.6.26 Maximum bitrate descriptor Table 2-65 – Maximum bitrate descriptor 2.6.27 Semantic definition of fields in maximum bitrate descriptor 2.6.28 Private data indicator descriptor Table 2-66 – Private data indicator descriptor 2.6.29 Semantic definition of fields in Private data indicator descriptor 2.6.30 Smoothing buffer descriptor Table 2-67 – Smoothing buffer descriptor 2.6.31 Semantic definition of fields in smoothing buffer descriptor 2.6.32 STD descriptor Table 2-68 – STD descriptor 2.6.33 Semantic definition of fields in STD descriptor 2.6.34 IBP descriptor Table 2-69 – IBP descriptor 2.6.35 Semantic definition of fields in IBP descriptor 2.6.36 MPEG-4 video descriptor Table 2-70 – MPEG-4 video descriptor 2.6.37 Semantic definition of fields in MPEG-4 video descriptor 2.6.38 MPEG-4 audio descriptor Table 2-71 – MPEG-4 audio descriptor 2.6.39 Semantic definition of fields in MPEG-4 audio descriptor 2.6.40 IOD descriptor Table 2-73 – IOD descriptor 2.6.41 Semantic definition of fields in IOD descriptor 2.6.42 SL descriptor Table 2-74 – SL descriptor 2.6.43 Semantic definition of fields in SL descriptor 2.6.44 FMC descriptor Table 2-75 – FMC descriptor 2.6.45 Semantic definition of fields in FMC descriptor 2.6.46 External_ES_ID descriptor Table 2-76 – External_ES_ID descriptor 2.6.47 Semantic definition of fields in External_ES_ID descriptor 2.6.48 Muxcode descriptor Table 2-77 – Muxcode descriptor 2.6.49 Semantic definition of fields in Muxcode descriptor 2.6.50 FmxBufferSize descriptor Table 2-78 – FmxBufferSize descriptor 2.6.51 Semantic definition of fields in FmxBufferSize descriptor 2.6.52 MultiplexBuffer descriptor Table 2-79 – MultiplexBuffer descriptor 2.6.53 Semantic definition of fields in MultiplexBuffer descriptor 2.6.54 FlexMuxTiming descriptor Table 2-80 – FlexMuxTiming descriptor 2.6.55 Semantic definition of fields in FlexMuxTiming descriptor 2.6.56 Content labelling descriptor Table 2-81 – Content labelling descriptor 2.6.57 Semantic definition of fields in content labelling descriptor Table 2-83 – Content_time_base_indicator values 2.6.58 Metadata pointer descriptor Table 2-84 – Metadata pointer descriptor 2.6.59 Semantic definition of fields in metadata pointer descriptor Table 2-85 – Metadata format values Table 2-86 – MPEG_carriage_flags 2.6.60 Metadata descriptor 2.6.61 Semantic definition of fields in metadata descriptor Table 2-88 – decoder_config_flags 2.6.62 Metadata STD descriptor Table 2-89 – Metadata STD descriptor 2.6.63 Semantic definition of fields in metadata STD descriptor 2.6.64 AVC video descriptor Table 2-90 – AVC video descriptor 2.6.65 Semantic definition of fields in AVC video descriptor 2.6.66 AVC timing and HRD descriptor Table 2-91 – AVC timing and HRD descriptor 2.6.67 Semantic definition of fields in AVC timing and HRD descriptor 2.6.68 MPEG-2 AAC audio descriptor Table 2-92 – MPEG-2 AAC_audio_descriptor 2.6.69 Semantic definition of fields in MPEG-2 AAC audio descriptor Table 2-93 – MPEG-2_AAC_additional_information field values 2.6.70 MPEG-4 text descriptor Table 2-94 – MPEG-4 text descriptor 2.6.71 Semantic definition of fields in MPEG-4 text descriptor 2.6.72 MPEG-4 audio extension descriptor Table 2-95 – MPEG-4 audio extension descriptor 2.6.73 Semantic definition of fields in MPEG-4 audio extension descriptor 2.6.74 Auxiliary video stream descriptor Table 2-96 – Auxiliary video stream descriptor 2.6.75 Semantic definition of fields in auxiliary video stream descriptor 2.6.76 SVC extension descriptor Table 2-97 – SVC extension descriptor 2.6.77 Semantic definition of fields in SVC extension descriptor 2.6.78 MVC extension descriptor Table 2-98 – MVC extension descriptor 2.6.79 Semantics of fields in MVC extension descriptor 2.6.80 J2K video descriptor 2.6.81 Semantics of fields in J2K video descriptor DEN_frame_rate – This field shall be coded the same as frat_denominator field specified in Table S.1 (see Annex S). NUM_frame_rate – This field shall be coded the same as frat_numerator field specified in Table S.1 (see Annex S). Table 2-100 – Example frame rates based on DEN_frame_rate and NUM_frame_rate values If for some reason the MaxCLL and/or MaxFALL values are unknown, the value 0x0000 shall be used. 2.6.82 MVC operation point descriptor Table 2-101 – MVC operation point descriptor 2.6.83 Semantic definition of fields in MVC operation point descriptor 2.6.84 MPEG2_stereoscopic_video_ format_ descriptor Table 2-102 – MPEG2_stereoscopic_video_format_descriptor syntax 2.6.85 Semantic definition of fields in the MPEG2_stereoscopic_video_format_descriptor 2.6.86 Stereoscopic_program_info_descriptor Table 2-103 – Stereoscopic_program_info_descriptor syntax 2.6.87 Semantic definition of fields in the stereoscopic_program_info_descriptor Table 2-104 – Stereoscopic_service_type values 2.6.88 Stereoscopic_video_info_descriptor Table 2-105 – Stereoscopic_video_info_descriptor syntax 2.6.89 Semantic definition of fields in the stereoscopic_video_info_descriptor Table 2-106 – Upsampling factor values 2.6.90 Extension descriptor 2.6.91 Semantic definition of fields in the extension descriptor HEVC_tile_substream_descriptor() – This structure is defined in 2.6.122 and 2.6.123. HEVC_subregion_descriptor() – This structure is defined in 2.6.125 and 2.6.126. Table 2-108 – Extension descriptor tag values 2.6.92 ODUpdate_descriptor 2.6.93 Transport_profile_descriptor Table 2-109 – Transport_profile_descriptor syntax 2.6.94 Semantic definition of fields in the Transport_profile_descriptor Table 2-110 – Transport_profile values 2.6.95 HEVC video descriptor Table 2-111 – HEVC video descriptor 2.6.96 Semantic definition of fields in HEVC video descriptor Table 2-112 – Semantics of HDR_WGC_idc 2.6.97 HEVC timing and HRD descriptor Table 2-113 – HEVC timing and HRD descriptor 2.6.98 Semantic definition of fields in HEVC timing and HRD descriptor 2.6.99 AF extensions descriptor Table 2-114 – Adaptation field extension descriptor 2.6.100 HEVC operation point descriptor Table 2-115 – HEVC operation point descriptor 2.6.101 Semantic definition of fields in HEVC operation point descriptor 2.6.102 HEVC hierarchy extension descriptor Table 2-116 – HEVC hierarchy extension descriptor 2.6.103 Semantic definition of fields in HEVC hierarchy extension descriptor Table 2-117 – Semantics of extension dimension bits 2.6.104 Green extension descriptor Table 2-118 – Green extension descriptor 2.6.105 Semantics for green extension descriptor 2.6.106 MPEG-H 3D audio descriptor Table 2-119 – MPEG-H 3D audio descriptor 2.6.107 Semantics for MPEG-H 3D audio descriptor 2.6.108 MPEG-H 3D audio config descriptor Table 2-120 – MPEG-H 3D audio config descriptor 2.6.109 Semantics for MPEG-H 3D audio config descriptor 2.6.110 MPEG-H 3D audio scene descriptor 2.6.111 Semantic definition of fields in MPEG-H 3D audio scene descriptor 2.6.112 MPEG-H 3D audio text label descriptor Table 2-122 – MPEG-H 3D audio text label descriptor 2.6.113 Semantic definition of fields in MPEG-H 3D audio text label descriptor 2.6.114 MPEG-H 3D audio multi-stream descriptor 2.6.115 Semantic definition of fields in MPEG-H 3D audio multi-stream descriptor 2.6.116 MPEG-H 3D audio DRC and Loudness descriptor 2.6.117 Semantic definition of fields in MPEG-H 3D audio DRC and Loudness descriptor 2.6.118 MPEG-H 3D audio command descriptor Table 2-125 – MPEG-H 3D audio command descriptor 2.6.119 Quality extension descriptor Table 2-126 – Quality extension descriptor 2.6.120 Virtual segmentation descriptor Table 2-127 – Virtual segmentation descriptor 2.6.121 Semantic definition of fields in virtual segmentation descriptor 2.6.122 HEVC tile substream descriptor Table 2-128 – HEVC tile substream descriptor 2.6.123 Semantic definition of the fields in the HEVC tile substream descriptor NOTE – If ReferenceFlag is set to '0', Flag[0] has the same semantics as PreambleFlag for the case that ReferenceFlag is set to '1'. 2.6.124 HEVC tile substream af_descriptor 2.6.125 HEVC subregion descriptor Table 2-129 — HEVC subregion descriptor 2.6.126 Semantic definition of the fields in the HEVC subregion descriptor SubstreamCountMinus1[i] – The value of this 8-bit field in the range of 0 to (TotalSubstreamIDs – 1) indicates PictureSizeHor[i] – This 16-bit field indicates the horizontal subregion dimension, measured in pixels. PictureSizeVert[i] – This 16-bit field indicates the vertical subregion dimension, measured in pixels. PatternCount[i] – This 7-bit field indicates the number of different subregion layouts. SubstreamOffset[k][j][i] – This array of 7 bit fields indicates the offset that has to be added 2.7 Restrictions on the multiplexed stream semantics 2.7.1 Frequency of coding the system clock reference 2.7.2 Frequency of coding the program clock reference 2.7.3 Frequency of coding the elementary stream clock reference 2.7.4 Frequency of presentation timestamp coding 2.7.5 Conditional coding of timestamps 2.7.6 Timing constraints for scalable coding 2.7.7 Frequency of coding P-STD_buffer_size in PES packet headers 2.7.8 Coding of system header in the program stream 2.7.9 Constrained system parameter program stream Packet rate Decoder buffer size BSn ≤ 8976 bytes 2.7.10 Transport stream Sample rate locking in transport streams 2.8 Compatibility with ISO/IEC 11172 2.9 Registration of copyright identifiers 2.9.1 General 2.9.2 Implementation of a Registration Authority (RA) 2.10 Registration of private data format 2.10.1 General 2.10.2 Implementation of a Registration Authority (RA) 2.11 Carriage of ISO/IEC 14496 data 2.11.1 Introduction 2.11.2 Carriage of individual ISO/IEC 14496-2 and 14496-3 Elementary Streams in PES packets 2.11.2.1 Introduction Table 2-130 – Carriage of individual ISO/IEC 14496 streams in Rec. ITU-T H.222.0 | ISO/IEC 13818-1 2.11.2.2 STD extensions for individual ISO/IEC 14496 elementary streams Figure 2-4 – T-STD model extensions for individual ISO/IEC 14496 elementary streams 2.11.3 Carriage of audiovisual ISO/IEC 14496-1 scenes and associated ISO/IEC 14496 streams 2.11.3.1 Introduction 2.11.3.2 Assignment of ES_ID values 2.11.3.3 Timing of ISO/IEC 14496 scenes and associated streams If X(t) = fstc(t)/fobject(t) 2.11.3.4 Delivery timing of SL-packetized streams 2.11.3.5 Delivery timing of FlexMux streams 2.11.3.6 Carriage of SL-packetized streams in PES packets 2.11.3.7 Carriage of FlexMux streams in PES packets 2.11.3.8 Carriage of SL packets and FlexMux packets in sections Table 2-131 – Section syntax for transport of ISO/IEC 14496 stream 2.11.3.9 T-STD extensions 2.11.3.9.1 T-STD Model for 14496 content Figure 2-5 – T-STD model for ISO/IEC 14496 content 2.11.3.9.2 Processing of FlexMux streams 2.11.3.9.3 Definition of FlexMux Buffer, FBnp 2.11.3.9.4 Processing of SL-packetized streams 2.11.3.9.5 Buffer management 2.11.3.10 Carriage within a transport stream 2.11.3.10.1 Overview 2.11.3.10.2 Initial Object Descriptor 2.11.3.11 P-STD Model for 14496 content Figure 2-6 – P-STD model for ISO/IEC 14496 Systems stream 2.11.3.11.1 Processing of FlexMux streams 2.11.3.11.2 Definition of FlexMux Buffer, FBnp 2.11.3.11.3 Processing of SL-packetized streams 2.11.3.11.4 Buffer management 2.11.3.12 Carriage within a program stream 2.11.3.12.1 Overview 2.11.3.12.2 Initial object descriptor 2.12 Carriage of metadata 2.12.1 Introduction 2.12.2 Metadata time-line model Figure 2-7 – Timing model for delivery of content and metadata Figure 2-8 – Delivery of metadata in PES packets 2.12.3 Options for transport of metadata 2.12.4 Use of PES packets to transport metadata 2.12.4.1 General 2.12.4.2 Metadata Access Unit Wrapper Table 2-133 – Metadata Access Unit Wrapper Table 2-135 – Cell fragment indication 2.12.5 Use of the DSM-CC synchronized download protocol to transport metadata 2.12.6 Use of metadata sections to transport metadata Table 2-136 – Section syntax for transport of metadata Table 2-137 – Section fragment indication 2.12.7 Use of the DSM-CC data carousel to transport metadata 2.12.8 Use of the DSM-CC object carousel to transport metadata 2.12.9 Metadata-related signalling 2.12.9.1 General 2.12.9.2 Signalling of metadata services and streams 2.12.9.3 Signalling of content for use by a metadata system 2.12.9.4 Association of metadata to content 2.12.9.5 Signalling decoder configuration data 2.12.9.6 Overview of metadata signalling Figure 2-9 – Metadata signalling and referencing 2.12.10 STD model for metadata Figure 2-10 – Metadata decoding in the STD 2.13 Carriage of ISO 15938 data 2.13.1 Introduction 2.13.2 ISO 15938 decoder configuration data 2.14 Carriage of Rec. ITU-T H.264 | ISO/IEC 14496-10 video 2.14.1 Introduction 2.14.2 Carriage in PES packets 2.14.3 STD extensions 2.14.3.1 T-STD extensions Figure 2-11 – T-STD model extensions for Rec. ITU-T H.264 | ISO/IEC 14496-10 video DPBn buffer management TBn, MBn and EBn buffer management Transfer between MBn and EBn STD delay Buffer management conditions 2.14.3.2 P-STD extensions Figure 2-12 – P-STD model extensions for Rec. ITU-T H.264 | ISO/IEC 14496-10 video DPBn buffer management Bn buffer management STD delay Buffer management conditions 2.14.3.3 View and dependency representation delimiter NAL unit Table 2-138 – View and dependency representation delimiter NAL unit 2.14.3.4 Semantics of view and dependency representation delimiter NAL unit 2.14.3.5 T-STD extensions for SVC Carriage in PES packets DPB buffer management TBn, MBn, EBn buffer management Rxn = bit_rate Rbxn = 1200 × MaxBR[level]n Access unit re-assembling and EB removal STD delay Buffer management conditions 2.14.3.6 P-STD extensions for SVC Carriage in PES packets DPB buffer management Bn buffer management Access unit re-assembling and B removal STD delay Buffer management conditions 2.14.3.7 T-STD extensions for MVC and MVCD Carriage in PES packets DPB buffer management TBn, MBn, EBn buffer management Rbxn = 1200 × MaxBR[level]n Access unit re-assembling and EB removal STD delay Buffer management conditions Carriage of MVCD sub-bitstream in PES packets 2.14.3.8 P-STD extensions for MVC Carriage in PES packets DPB buffer management Bn buffer management Access unit re-assembling and B removal STD delay Buffer management conditions 2.15 Carriage of ISO/IEC 14496-17 text streams 2.15.1 Introduction 2.15.2 Carriage in PES packets 2.15.3 STD extensions 2.15.3.1 T-STD extensions Figure 2-17 – T-STD model extensions for ISO/IEC 14496-17 text streams Buffer management 2.15.3.2 P-STD extensions 2.16 Carriage of auxiliary video streams 2.17 Carriage of HEVC 2.17.1 Constraints for the transport of HEVC Carriage in PES packets DPB buffer management 2.17.2 T-STD Extensions for single layer HEVC Figure 2-18 – T-STD model extensions for single layer HEVC TBn, MBn, EBn buffer management BSoh = (1/750) seconds ( max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s} BSmux = 0.004 seconds ( max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s} EBSn = cpb_size Rxn = bit_rate Rbxn = BrNalFactor × MaxBR[tier, level] STD delay Buffer management conditions 2.17.3 T-STD extensions for layered transport of HEVC temporal video subsets Figure 2-19 – T-STD model extensions for layered transport of HEVC temporal video subsets TBn,k, MBn,k, EBn buffer management MBSn,k = BSmux + BSoh + CpbNalFactor × MaxCPB[tier, level] – cpb_size EBSn = cpb_size Rxn,k = bit_rate Rbxn,k = BrNalFactor × MaxBR[tier, level] STD delay Buffer management conditions 2.17.4 T-STD extensions for layered transport of HEVC sub-partitions with bitstream-partition-specific CPB operation Figure 2-20 – T-STD model extensions for bitstream-partition-specific CPB operation TBl,k, MBl,k, EBl buffer management MBSl,k = BSmux + BSoh + CpbNalFactor × MaxCPB[tier, level] – cpb_size Aggregation of elementary streams Table 2-139 – Implied hierarchy_layer_index if no hierarchy descriptors are used Carriage in PES packets STD delay Buffer management conditions 2.17.5 T-STD Extensions for the carriage of HEVC tiles 2.17.5.1 Carriage of HEVC motion-constrained tile sets There are alternative options to carry the HEVC tile substreams: 2.17.5.2 Carriage of HEVC motion-constrained tile sets as separate elementary streams Figure 2-21 – T-STD model extensions for transport of HEVC tiles through individual ESs Process to access a subregion: The HEVC subregion descriptor and all HEVC tile substream descriptors shall be read. Next, it checks if the applicable HEVC tile substream descriptor provides a PatternReference value. With that PatternReference, it selects the applicable SubstreamOffset values: SubstreamOffset[k][PatternReference][i] with 0 < k < SubstreamCountMinus1[i] TBn, MBn, EBn buffer management The following additional notations are used to describe the T-STD extensions and are illustrated in Figure 2-21. MBSn = BSmux + BSoh + CpbNalFactor × MaxCPB[tier, level] – cpb_size (measured in bytes) BSoh = (1/750) seconds × max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s} BSmux = 0.004 seconds × max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s} SBS = cpb_sizeH SBS = ∑k (SBSk) with n ≤ k ≤ n+m 2.17.5.3 Carriage of HEVC motion-constrained tile sets in a common ES using AF descriptors Figure 2-22 – T-STD model extensions for transport of HEVC tiles in a common ES using AF descriptors Next, it selects the SubstreamIDx that relates to the upper left HEVC tile substream of the region to be displayed. The receiver sets up an empty list of SubstreamIDs that belong to the subregion and adds Next, it checks if the applicable HEVC tile substream af_descriptor provides a PatternReference value. If there is no explicit PatternReference value, which means that either descriptor_length equals 1 or ReferenceFlag equals 0, the index j defaults t... With that PatternReference, it selects the applicable SubstreamOffset values: SubstreamOffset[k][PatternReference][i] with 0 < k < SubstreamCountMinus1[i] TB, MB, RB buffer management MBS = BSmux + BSoh + CpbNalFactor × MaxCPB[tier, level] – cpb_size (measured in bytes) BSoh, packet overhead buffering, is defined as: BSoh = (1/750) seconds × max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s} BSmux = 0.004 seconds × max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s} RBS = cpb_sizeH 2.17.5.4 Carriage of HEVC motion-constrained tile sets in a common ES ignoring AF descriptors Figure 2-23 – T-STD model extension for transport of HEVC tiles in a common ES ignoring AF descriptors TB, MB, EB buffer management MBS = BSmux + BSoh + CpbNalFactor × MaxCPB[tier, level] – cpb_size (measured in bytes) BSoh = (1/750) seconds × max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s} BSmux = 0.004 seconds × max{ BrNalFactor × MaxBR[tier, level], 2 000 000 bit/s} EBS = cpb_size 2.18 Carriage of green access units 2.18.1 Carriage of green access units in MPEG-2 sections 2.18.2 Semantics of green access unit section 2.18.3 Green access unit Table 2-141 – Green access unit 2.18.4 Timing relationship between green access unit and media access unit 2.18.5 Buffer model for processing green access units Figure 2-24 – T-STD model extension for green access units 2.19 Carriage of ISO/IEC 23008-3 MPEG-H 3D audio data 2.19.1 Introduction 2.19.2 Carriage in PES packets 2.19.3 STD extensions for ISO/IEC 23008-3 elementary streams 2.19.4 STD extensions for multiple ISO/IEC 23008-3 elementary streams Figure 2-25 – Transport stream system target decoder for multiple audio elementary streams 2.19.5 MPEG-2 Transport stream random access constraints and signalling 2.20 Carriage of Quality Access Units in MPEG-2 sections 2.20.1 General description Table 2-142 – Quality Access Unit 2.20.2 Buffer model for processing Quality Access Units Figure 2-26 – Quality Access Unit decoder processing model 2.21 Carriage of sample variants 2.21.1 Introduction 2.21.2 Constraints Sample variants shall be carried in an MPEG-2 TS metadata stream that complies with the following constraints: 2.21.3 Sample Variants decoder configuration data 2.22 Carriage of Media Orchestration Access Units A.1 CRC decoder model Figure A.1 – 32-bit CRC decoder model B.1 Introduction B.1.1 Purpose B.1.2 Future applications Video on demand Interactive video services Video networks B.1.3 Benefits B.1.4 Basic functions B.1.4.1 Stream selection B.1.4.2 Retrieval B.1.4.3 Storage B.2 General elements B.2.1 Scope B.2.2 Overview of the DSM-CC application Figure B.1 – Configuration of DSM-CC application B.2.3 The transmission of DSM-CC commands and acknowledgements Figure B.2 – BSM-CC bitstream decoded as a stand-alone bitstream Figure B.3 – DSM-CC bitstream decoded as part of the system bitstream B.3 Technical elements B.3.1 Definitions B.3.2 Specification of DSM-CC syntax Table B.1 – DSM-CC syntax B.3.3 Semantics of fields in specification of DSM-CC syntax Table B.2 – Command_id assigned values B.3.4 Control layer Constraints on setting flags in DSM-CC control Table B.3 – DSM-CC control B.3.5 Semantics of fields in control layer Table B.4 – Select mode assigned values B.3.6 Acknowledgement layer Constraints on setting flags in DSM-CC control Table B.5 – DSM-CC Acknowledgement B.3.7 Semantics of fields in acknowledgement layer B.3.8 Time code Constraints on time code Table B.6 – Time code B.3.9 Semantics of fields in time code C.1 Explanation of program-specific information in transport streams C.2 Introduction C.3 Functional mechanism i) table_id ii) table_id_extension iii) section_number iv) version_number v) current_next_indicator C.4 The mapping of sections into transport stream packets C.5 Repetition rates and random access C.6 What is a program? C.7 Allocation of program_number i) multilingual transmissions into separate markets ii) Very large program definitions C.8 Usage of PSI in a typical system C.9 The relationships of PSI structures Figure C.1 – Program and network mapping relationships C.9.1 Program Association Table C.9.2 Program map table C.9.3 Conditional access table C.9.4 Network information table C.9.5 Private_section() C.9.6 Descriptors Table C.1 – Composite_descriptor C.10 Bandwidth utilization and signal acquisition time 6-MHz CATV channel OC-3 fibre channel (155 Mbit/s) C-band satellite transponder D.1 Introduction D.1.1 Timing model Figure D.1 – Constant delay model D.1.2 Audio and video presentation synchronization D.1.3 System time clock recovery in the decoder Figure D.2 – STC recovery using PLL D.1.4 SCR and PCR jitter Table D.1 – Re-multiplexing strategy D.1.5 Clock recovery in the presence of network jitter D.1.6 System clock used for chroma sub-carrier generation D.1.7 Component video and audio reconstruction D.1.8 Frame slipping D.1.9 Smoothing of network jitter E.1 General considerations E.2 Suggestion Table E.1 – PES packet header example F.1 Introduction F.1.1 Transport stream syntax Figure F.1 – Transport stream syntax diagram F.1.2 PES packet Figure F.2 – PES packet syntax diagram F.1.3 Program Association section Figure F.3 – Program association section diagram F.1.4 CA section Figure F.4 – Conditional access section diagram F.1.5 TS program map section Figure F.5 – TS program map section diagram F.1.6 Private section Figure F.6 – Private section diagram F.1.7 Program stream Figure F.7 – Program stream diagram F.1.8 Program stream map Figure F.8 – Program stream map diagram G.1 General information G.1.1 Sync byte emulation G.1.2 Skipped picture status and decoding process G.1.3 Selection of PID values G.1.4 PES start_code emulation H.1 Private data 1) Transport stream packet Table 2-2 2) Transport stream adaptation field Table 2-6 3) PES packet Table 2-21 4) Descriptors 5) Private Section I.1 Systems conformance and real-time interface J.1 Introduction J.2 Network compliance models Figure J.1 – Sending system streams over a jitter-inducing network J.3 Network specification for jitter smoothing Figure J.2 – Jitter-smoothing using network-layer timestamps Bdj = JRmax J.4 Example decoder implementations J.4.1 Network adapter followed by an MPEG-2 decoder J.4.2 Integrated decoder Figure J.3 – Integrated dejittering and MPEG-2 decoding K.1 Introduction K.2 The different types of splicing point K.2.1 Ordinary splicing points K.2.2 Seamless splicing points K.3 Decoder behaviour on splices K.3.1 On non-seamless splices K.3.2 On seamless splices K.3.3 Buffer overflow L.1 Procedure for the request of a Registered Identifier (RID) L.2 Responsibilities of the Registration Authority L.2.1 Contact information of the Registration Authority L.3 Responsibilities of parties requesting an RID L.4 Appeal procedure for denied applications M.1 Contact information of organization requesting a Registered Identifier (RID) M.2 Statement of an intention to apply the assigned RID M.3 Date of intended implementation of the RID M.4 Authorized representative M.5 For official use only of the Registration Authority Examples O.1 Procedure for the request of an RID O.2 Responsibilities of the Registration Authority O.3 Contact information for the Registration Authority O.4 Responsibilities of parties requesting an RID O.5 Appeal procedure for denied applications P.1 Contact information of organization requesting an RID P.2 Request for a specific RID P.3 Short description of RID that is in use and date system that was implemented P.4 Statement of an intention to apply the assigned RID P.5 Date of intended implementation of the RID P.6 Authorized representative P.7 For official use of the Registration Authority Q.1 Introduction Q.2 Leak rate from transport buffer Rxn = 1.2 × Rmax × N bits per second Q.3 Buffer size BSn = BSmux + BSdec + BSoh BSoh = 528 bytes BSmux = 0.004 seconds × Rmax × N Q.3.1 TBSn: same as other audio Q.3.2 BSmux: different from other audio Q.3.3 BSdec: different from other audio Q.3.4 BSoh: different from other audio Example: sampling frequency is 48 kHz Q.4 Conclusion R.1 Content access procedure for ISO/IEC 14496 program components within a program stream Figure R.1 – Example of ISO/IEC 14496 content in a program stream R.2 Content access procedure for ISO/IEC 14496 program components within a transport stream Figure R.2 – Example of ISO/IEC 14496 content in a transport stream Figure R.3 – Usage of MPEG-4 in a transport stream with BIFS scene referring to native PES S.1 Introduction S.2 J2K video access unit, J2K video elementary stream, J2K video sequence and J2K still picture S.3 Optional J2K block mode for high resolution support S.4 Optional J2K stripe mode for Ultra-Low Latency When this mode is enabled, each J2K access unit contains N Rec. ITU-T T.800 | ISO/IEC 15444-1 codestreams where: N = (strp_max_idx + 1) for progressive content N = 2*(strp_max_idx + 1) for interlaced content S.5 Elementary stream header (elsm) and mapping to PES packets Table S.1 – J2K Access unit elementary stream header Figure S.1 – Structure and order of JPEG 2000 access units S.6 J2K transport constraints S.7 Interpretation of flags in adaptation and PES headers for J2K video elementary streams S.8 T-STD extension for J2K video elementary streams S.8.1 General Figure S.2 – T-STD model extensions for J2K Video TBn and EBn buffer management Removal of J2K access units from EBn STD delay Buffer management conditions S.8.2 J2K video elementary stream buffer size T.1 Introduction T.2 MIME type and subtype Singer@apple.com ISO/IEC JTC1/SC29 (MPEG) T.3 Security considerations T.4 Parameters T.4.1 The profiles parameter T.4.2 The codecs parameter Examples: ISO/IEC 13818-2 Main Profile U.1 Introduction U.1.1 General U.1.2 Notation Table U.1 – Variable field length notation example Table U.1bis – Table U.1 in equivalent full notation U.1.3 Annex references U.2 TEMI access unit and TEMI elementary stream Table U.2 – TEMI access unit U.3 AF descriptors U.3.1 Introduction Table U.3 – AF descriptor tags U.3.2 Location descriptor Table U.4 – TEMI location descriptor U.3.3 Semantic definition of fields in location descriptor Table U.5 – TEMI URL scheme types Table U.6 – TEMI service types U.3.4 Base URL descriptor Table U.7 – TEMI base URL descriptor U.3.5 Semantic definition of fields in Base URL descriptor U.3.6 Timeline descriptor Table U.8 – TEMI timeline descriptor U.3.7 Semantic definition of fields in the timeline descriptor MTi = timescale * (PTSi - PTS0) / 90000 + MT0 NTPi = (PTSi - PTS0) / 90000.0 + NTP0 PTPi = (PTSi - PTS0) / 90000.0 + PTP0 U.3.8 MPEG-H 3dAudio extStreamID descriptor Table U.9 – TEMI MPEG-H_3dAudio_extStreamID descriptor U.3.9 Semantic definition of fields in AF_MPEG-H_3dAudio extStreamID descriptor U.3.10 MPEG-H_3dAudio multi-stream and command descriptors U.3.11 Boundary descriptor Figure U.1 – Stream partitioning into 2 and 5 second segments U.3.12 Semantic definition of fields in boundary descriptor Table U.11 – sequence_number_length_code interpretation U.3.13 Labelling Descriptor Table U.12 – Labelling Descriptor U.3.14 Semantic definition of fields in labelling descriptor Table U.13 – label_length_code interpretation Table U.14 – label_type values U.3.15 HEVC tile substream af_descriptor Table U.15 – HEVC tile substream af_descriptor V.1 Introduction Figure V.1 – Illustration of HEVC tiled encoding of panoramic content beyond UHD V.2 HEVC tile substream identification example Figure V.2 – Example of HEVC tile substream identification V.3 Subregion layout example Figure V.3 – Example of subregion layout for a 3 x 3 RoI