Page 46 - ITU Journal, ICT Discoveries, Volume 3, No. 1, June 2020 Special issue: The future of video and immersive media
P. 46

ITU Journal: ICT Discoveries, Vol. 3(1), June 2020



          [9]   W. Kim and C. Kim (2009). A New Approach for   [19]  H. Pang, C. Zhang, F. Wang J. Liu and L. Sun
               Overlay  Text  Detection  and  Extraction  From       (2019).  Towards  Low  Latency  Multi-
               Complex  Video  Scene,  IEEE  Transactions  on        viewpoint     360°    Interactive    Video:
               Image Processing, 18(2), 401-411.                     A Multimodal  Deep  Reinforcement  Learning
          [10]  S.  Lee  and  K.  Jo  (2017).  Automatic  Person     Approach,  IEEE  Conference  on  Computer
               Information Extraction Using Overlay Text in          Communications, Paris, France, 991-999.
               Television News Interview Videos, IEEE 15th     [20]   M. Salmimaa, J. Kimmel, T. Jokela, P. Eskolin,
               International  Conference  on  Industrial             T. Järvenpää,  P.  Piippo,  K.  Mu ̈ller  and
               Informatics, Emden, Germany, 583-588.                 J. Satopää   (2018).   Live   delivery   of
          [11]  T.  Nakatsuru,  Y.  Yokokohji,  D.  Eto  and         neurosurgical  operating  theater  experience
               T. Yoshikawa  (2003).  Image  Overlay  on             in  virtual  reality,  Journal  of  the  Society  for
               Optical  See-through  Displays  for  Vehicle          Information Display, 26(2), 98-104.
               Navigation,  IEEE/ACM  2nd  International       [21]  S.  Mate,  I.D.D.  Curcio,  A.  Eronen  and
               Symposium on Mixed and Augmented Reality,             A. Lehtiniemi  (2015).  Automatic  Multi-
               Tokyo, Japan.                                         Camera Remix from Single Video, ACM 30th
          [12]  Y. Yokokohji, Y. Sugawara and T. Yoshikawa           Symposium      on    Applied    Computing,
               (2000). Accurate Image Overlay on Video See-          Salamanca, Spain, 1270-1277.
               Through     HMDs     Using    Vision   and      [22]  I.D.D. Curcio, H. Toukomaa and D. Naik (2018).
               Accelerometers,    IEEE   Virtual   Reality           360-Degree  Video  Streaming  and  its
               Conference, New Brunswick, NJ, U.S.A.                 Subjective  Quality,  SMPTE  Motion  Imaging
                                                                     Journal, 127(7), 28-38.
          [13]  D. Madden, A. Scanlon, Y. Zhou, T.E. Choe and
               M. Smith (2014). Real Time Video Overlays,      [23]  ISO/IEC  23009-1,  Information  technology  –
               ACM SIGGRAPH, Vancouver, BC, Canada.                  Dynamic  adaptive  streaming  over  HTTP
                                                                     (DASH)  –  Part  1:  Media  presentation
          [14]  J.  Guo,  T.  Mei,  F.  Liu  and  X.-S.  Hua  (2009).   description and segment formats.
               AdOn:    An   Intelligent   Overlay   Video
               Advertising System, ACM 32nd International      [24]  ISO/IEC  23008-1,  Information  technology  –
               Conference on Research and Development of             High efficiency coding and media delivery in
               Information Retrieval, Boston, MA, U.S.A.             heterogeneous environments – Part 1: MPEG
                                                                     media transport (MMT).
          [15]  S. Vihavainen, S. Mate, L. Seppälä, F. Cricri’ and
               I.D.D. Curcio (2011). We Want More: Human-      [25]  ISO/IEC 14496-12, Information technology –
               Computer  Collaboration  in  Mobile  Social           Coding of audio-visual objects – Part 12: ISO
               Video  Remixing  of  Music  Concerts,  ACM            base media file format.
               Conference  on  Human  Factors  in  Computer    [26]  ISO/IEC  23008-3,  Information  technology  –
               Systems (CHI), Vancouver, Canada.                     High efficiency coding and media delivery in
          [16]  S. Vihavainen, S. Mate, L. Liikkanen and I.D.D.      heterogeneous  environments  –  Part  3:  3D
                                                                     audio.
               Curcio  (2012).  Video  as  Memorabilia:  User
               Needs  for  Collaborative  Automatic  Mobile    [27]  ISO/IEC  14496-3,  Information  technology  –
               Video Production, ACM Conference on Human             Coding  of  audio-visual  objects  –  Part  3:
               Factors  in  Computer  Systems  (CHI),  Austin,       Advanced audio coding.
               TX, U.S.A.                                      [28]  ISO/IEC  23008-2,  Information  technology  –
          [17]  S.  Mate  and  I.D.D.  Curcio  (2017).  Automatic    High efficiency coding and media delivery in
               Video      Remixing      Systems,     IEEE            heterogeneous  environments  –  Part  2:  High
               Communications Magazine, 55(1), 180-187.              efficiency video coding.
          [18]  X.  Corbillon,  F.  De  Simone,  G.  Simon  and   [29]  ISO/IEC 14496-10, Information technology –
               P. Frossard   (2018).   Dynamic   Adaptive            Coding  of  audio-visual  objects  –  Part  10:
               Streaming        for       Multi-Viewpoint            Advanced video coding.
               Omnidirectional  Video,  ACM  Multimedia        [30]  ISO/IEC     10918-1:1994,      Information
               Systems       Conference,      Amsterdam,             technology – Digital compression and coding
               The Netherlands, 237-249.                             of  continuous-tone  still  images  –  Part  1:
                                                                     Requirements and guidelines.






          24                                    © International Telecommunication Union, 2020
   41   42   43   44   45   46   47   48   49   50   51