Page 123 - ITU Journal, ICT Discoveries, Volume 3, No. 1, June 2020 Special issue: The future of video and immersive media
P. 123

ITU Journal: ICT Discoveries, Vol. 3(1), June 2020











                  A NEW SET OF DIRECTIONAL WEIGHTS FOR ITU-R BS.1770 LOUDNESS
                                 MEASUREMENT OF MULTICHANNEL AUDIO

                                       1
                                                     2
                                                                            3
                                                                2
                            Leandro Pires , Maurílio Vieira , Hani Yehia , Tim Brookes , Russell Mason 3
                                 1
                                  Agência Nacional de Telecomunicações, Belo Horizonte, Brazil
                                  2
                                   Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
                                3
                                 Institute of Sound Recording, University of Surrey, Guildford, UK
          Abstract – The ITU-R BS.1770 multichannel loudness algorithm performs a sum of channel energies with weighting co-
          efficients based on azimuth and elevation angles of arrival of the audio signal. In its current version, these coefficients
          were estimated based on binaural summation gains and not on subjective directional loudness. Also, the algorithm lacks
                                                      ◦
          directional weights for wider elevation angles (|φ| ≥ 30 ). A listening test with broadband stimuli was conducted to collect
          subjective data on directional effects. The results were used to calculate a new set of directional weights. A modified ver-
          sion of the loudness algorithm with these estimated weights was tested against its benchmark using the collected data, and
          using program material rendered to reproduction systems with different loudspeaker configurations. The modified algorithm
          performed better than the benchmark, particularly with reproduction systems with more loudspeakers positioned out of the
          horizontal plane.
          Keywords – Broadcasting, loudness, signal processing, spatial audio, subjective test.





          1.  INTRODUCTION                                    gles, the Rapporteur Group decided to keep simple posi-
                                                              tional weights envisioning backward compatible extension
          In recent years, audio signal normalization based on loud-  of ITU-R BS.1770-3 and, consequently, avoiding imple-
          ness became a regular practice and the loudness algorithm  mentation issues.
          described in Recommendation ITU-R BS.1770 is now
          ubiquitous in broadcasting and content producing work-
          flows. With the advent of object and scene-based sound  Table 1 – Position-dependent channel weightings in ITU-R
          systems, it is reasonable to wonder if a channel-based  BS.1770-4
          loudness measurement, originally designed for stereo and                     Azimuth (θ)
                                                                Elevation (φ)
          5.1 content, would still suit the needs of program level    ◦    |θ| < 60 ◦  60 ≤ |θ| ≤ 120 ◦  120 < |θ| ≤ 180 ◦
                                                                                     ◦
                                                                                                  ◦
          management and control.  After a hiatus, Rapporteur    |φ| < 30  1.00 (± 0 dB)  1.41 (+ 1.50 dB)  1.00 (± 0 dB)
                                                                   else               1.00 (± 0 dB)
          Group on loudness measurement algorithm (RG-32) was
          re-established in March 2019 to resume studies on mea-
          surements for object and scene-based audio in light of the  At that time, RG-32 conducted experiments with two dif-
          new Recommendation ITU-R BS.2127, which specifies the  ferent datasets using 5.1 and 22.1 loudspeaker configura-
          reference renderer for these immersive audio formats [1].  tions. The obtained Pearson correlation coefficients, de-
          Whether the algorithm will evolve to a format-based loud-  scribing the linearity of the relationship between mea-
          ness computation or remain multichannel, and be used in  surements and subjective results, were r = [0.817,0.902],
          the output of the renderer, is still to be determined.  which were lower than the correlation obtained in the
                                                              original tests of ITU-R BS.1770 (r = 0.977) [2, p.17].
          An attempt to address this issue was made in the latest ver-  Also, authors in [3] tested this set of weights using 5.1
          sion of ITU-R loudness model (BS.1770-4), in which the  and 7.1 systems with and without screen loudspeakers.
          positional weighting coefficients were extended to an unre-  From a common database comprised of object and channel-
          stricted number of channels [2]. The weighting coefficient  based content, listening tests were made in three test
          G i for an i-th position is derived from Table 1, which is a  sites and the observed correlation coefficients were r =
          generalization of the original 5.1 weighing scheme for the  [0.962,0.820,0.898], being the first coefficient the only
          horizontal plane, and no weighting is applied in broader  one on par with the correlation from the original ITU-R
                               ◦
          elevation angles (|φ| ≥ 30 ). Although RG-32 considered  tests. Thus, the question on how to account for directional
          differences of directional loudness in wider elevation an-  weighting in the ITU-R loudness model is still valid.



                                             © International Telecommunication Union, 2020                   101
   118   119   120   121   122   123   124   125   126   127   128