Page 135 - ITU Journal, ICT Discoveries, Volume 3, No. 1, June 2020 Special issue: The future of video and immersive media
P. 135

ITU Journal: ICT Discoveries, Vol. 3(1), June 2020



          6.4  L-1 filter                                      7.1.2 Commercial implementation
          The L-1 filter, whose concept is similar to a simple   A second set of tests has been performed using a
          deblocking filter, can be applied on the L-1 residuals   commercial  LCEVC  implementation  provided  by
          if the transform with the larger kernel size (4x4) is   V-Nova,  which  uses  as  a  base  codec  x264  for
          used.  The  residuals  on the  outer  boundary  of  the   H.264/AVC and x265 for H.265/HEVC. This set of
          transform  block  are  multilplied  with  a  coefficient   tests has been performed using a very slow preset
          between 0 and 1. The value of these coefficients can   for  both  encoders.  The  same  settings  have  been
          be signalled independently for edges and corners.    used when encoding x264/x265 at full resolution as
                                                               an anchor and at quarter resolution as a base codec
          6.5  Temporal prediction
                                                               for LCEVC.  The default  constant rate factor (CRF)
          The temporal prediction uses a zero-motion vector    quality  setting  was  used  for  both  x264  and  x265,
          prediction  with  a  temporal  buffer  which  stores   both at full resolution and at a quarter resolution.
          residuals  from  the  previous  frame  only.  The    7.2  Objective and subjective metrics results
          decision, where to use temporal prediction, is done
          on a transform block basis. Additionally, an entire   The performance results are shown using the two
          tile of 32x32 residuals can be signalled to be used   objective  metrics  Peak  Signal-to-Noise  Ratio
          without  temporal  prediction,  reducing  the        (PSNR) and Video Multimethod Assessment Fusion
          signalling overhead for, e.g., a fast moving part of   (VMAF)  [8][9].  The  latter  is  operated  using  the
          the sequence.                                        default model (v0.6.1) for HD sequences and the 4K
                                                               model (4k_v0.6.1) for UHD sequences.
          6.6  Entropy encoding
                                                               All  the  results  compare  a  full  resolution  video
          The two coefficient layers and the temporal layer    encoded  using  the  anchor  codec  (H.264/AVC  or
          are processed independently by an entropy encoder    H.265/HEVC)  against  a  full  resolution  video
          before  the  encapuslation  into  the  bitstream.  The   encoded using LCEVC. When comparing against an
          entropy   coding   process   consists   of   two     H.264/AVC  anchor,  the  LCEVC  would  use
          components:  a  Run  Length  Encoder  (RLE)  and  a   H.264/AVC  as  the  base  codec.  When  comparing
          Prefix Coding encoder. Aditionally, it is possible to   against  an H.265/HEVC  anchor, the  LCEVC would
          only  use  the  RLE  for  the  entire  data  within  a   use H.265/HEVC as the base codec.
          coefficient group.
                                                               Two video data sets have been used, Set A for HD
          7.   PERFORMANCE RESULTS                             and  Set  B  for  UHD  resolutions.  The  sequences  of
                                                               each data set are listed in Table 1. Each sequence
          In  order  to  show  that  LCEVC  is  a  low-complexity   has been tested at four different operating points.
          scheme enabling performance improvements over
          a given base codec, several tests were run on LCEVC                 Table 1 – Test sequences
          using H.264/AVC and H.265/HEVC as video coding
          standards for the base codec.                          Seq.    Sequence name     Resolution   Frame
                                                                                                         rate
          7.1  Experimental set-up                               A1         Campfire       3840x2160      30

          The tests have been performed using two different      A2       ParkRunning3     3840x2160      50
          software implementations.                              A3       FoodMarket4      3840x2160      60
                                                                 A4      Fortnite (Part 1)   3840x2160    60
          7.1.1 Reference implementation                         B1          Cactus        1920x1080      50
          A  first  set  of  tests  has  been  performed  using  the   B2   BasketballDrive   1920x1080   50
          LCEVC  reference  implementation  in  its  current     B3        RitualDance     1920x1080      60
          version LTM 4.0 [5]. LTM 4.0 uses as a base codec      B4    EuroTruck Simulator   1920x1080    60
          either  reference  implementation  JM  19.0  for     Table 2 provides the average coding performances
          H.264/AVC [6] or HM 16.18 for H.265/HEVC [7].        of  LCEVC  using  the  Bjøntegaard  metric  (BD-rate)
                                                               [10].  The  encodes  are  performed  using  the
                                                               reference   implementation    as   described   in
                                                               Section 7.1.1.






                                                © International Telecommunication Union, 2020                113
   130   131   132   133   134   135   136   137   138   139   140