Page 158 - Proceedings of the 2018 ITU Kaleidoscope
P. 158

2018 ITU Kaleidoscope Academic Conference




                                                              periodic  images  but  with  the  great  disadvantage  that  they
                                                              have  low  resolution,  which  makes  them  useless  for
                                                              identifying  patterns  in  small  buildings.  Companies  like
                                                              DigitalGlobe offer images of very good resolution, but at a
                                                              high  cost.  Due  to  the  nature  of  the  project,  the  volume
                                                              required is very high as well as the cost to obtain them. It
                                                              would  be  almost  impossible  to  face  both  situations
                                                              considering the academic nature of this project.

                                                              A  consortium  of  companies  including  DigitalGlobe,
                                                              CosmiQ  Works  and  NVIDIA  launched  SpaceNet  [10]  in
                                                              2016,  an  online  repository  of  satellite  images  and  co-
                                                              registered  map  data  to  train  algorithms.  It  is  a  corpus  of
               Figure 3 – Flow of input data until its final output   commercial  satellite  imagery  and  labeled  training  data  to
                                                              use  for  machine  learning  research.  The  corpus  of  images
           Finally,  the  wanted  output  is  achieved.  The  remaining   with  an  excellent  resolution  (they  are  satellite  images  of
           details have to do simply with non-functional requirements,   0.3m),  is  not  only  available  for  access,  but  it  also  has
           such as the way in which the output will be presented to the   geolocated labels to delimit  buildings, perfect  for training
           users  so  that  they  could  evaluate  it  and  give  it,  or  not,   the  algorithm.  What  is  even  better,  the  corpus  is  divided
           utility.  The  mockup  in  Figure  4  represents  the  proposed   into two volumes: the first, aimed to train algorithms -about
           graphic user interface. It shows the number of buildings and   40GB-, and the second, to perform validation tests  -about
           square  meters  detected  in  the  same  coordinates  but  at   20GB-.  Last  but  not  least,  there  are  different  image
           different times                                    alternatives, such as RGB images, panchromatic images or
                                                              8 multispectral bands. The package includes images of four
                                                              cities:  Las  Vegas,  Paris,  Shanghai  and  Khartoum,  which
                                                              allows us to test the algorithm in various urban structures.

                                                              All of the above represents an almost perfect combo for the
                                                              project  needs.  It  is  almost  perfect  because  although  it
                                                              allows training the algorithm and tests its accuracy it does
                                                              not  have  images  at  different  times  from  the  same
                                                              geographical  place  so  that,  in  the  final  output,  we  can
                                                              obtain the expected results. That is why, for the final testing
                                                              stage,  it  was  decided  to  generate  changes  in  the  satellite
                                                              images artificially to visualize in the final output the way in
                                                              which  the  changes  produced  are  detected.  This  decision,
                                                              although not ideal, allowed us to continue with the project
                                                              without  major  consequences  since,  although  the  scenario
                                                              we  were  generating  was  fictitious,  it  was  sufficiently
                                                              representative of reality to be useful.

              Figure 4 –Example of graphic user interface model.   4.  DEEP LEARNING BUILDING DETECTION
                                                                                    MODEL
                        3.  DATASET DETAIL
                                                              For  the  detection  building  model,  two  options  were
           To have a successful result, a big challenge was to find a   possible: to create a new model or reusing models already
           large  volume  of  input  data  with  a  high  resolution.  This   evaluated and with a useful function for the project scope.
           allows training the algorithm in such a way that a parameter   Both  options  have  advantages  and  disadvantages.  For
           configuration with an acceptable precision for the expected   example, lacking experience in these issues, the making of
           ranges could be obtained.                          a new model, implied to start down on the learning curve. It
                                                              is  necessary  to  check  continuously  if  the  architecture  is
           It  is  necessary  to  know  that  the  search  is  not  particularly   correct  and,  if  not,  proceeds  to  accordingly  correct  the
           simple,  considering  that  the  higher  resolution  images  are   errors.  On  the  other  hand,  when  reusing  models  with  a
           not available for free. Free access to satellite images only   checked  architecture  and  a  solution  within  limits  of
                                           1
           occurs  in  satellites  such  as  LandSat ,  which  generates   tolerable  errors  is  good,  but  implies  this  solution  to  be
                                                              adapted to the needs of the present project.

                                                              The  SpaceNet  algorithms  [11]  which  are  in  the  public
           1    The Landsat Program is a series of Earth-observing satellites   repository  and  with  a  free-use  license  were  used
             co-managed  by  USGS  &  NASA  and  offers  the  longest   considering that it was the most appropriate solution for this
             continuous space-based record of Earth's surface.



                                                          – 142 –
   153   154   155   156   157   158   159   160   161   162   163