Page 87 - ITU KALEIDOSCOPE, ATLANTA 2019
P. 87

TOWARDS INTERNATIONAL STANDARDS FOR THE EVALUATION OF
                                   ARTIFICIAL INTELLIGENCE FOR HEALTH




                                                           1
                                             Markus A. Wenzel , Thomas Wiegand 1,2
                                       1 Fraunhofer Heinrich Hertz Institute, Berlin, Germany
                                         2 Technische Universität Berlin, Berlin, Germany


                              ABSTRACT                        and journal branches to digital health with a focus on ML and
                                                              AI [cf. 5, 6, 7], and the English National Health Service has
           Healthcare  can  benefit  considerably  from  advanced   created several guiding documents on AI in healthcare [8, 9,
           information  processing  technologies,  in  particular  from   10].
           machine  learning  (ML)  and  artificial  intelligence  (AI).
           However,  the  health  domain  only  hesitantly  adopts  these   The capacity of ML and AI for applications in healthcare is
           powerful  but  complex  innovations  so  far,  because  any   founded  upon  the  increasing  availability  of  digital  health
           technical  fault  can  affect  people’s  health,  privacy,  and   data,  which  can  be  used  to  train  and  to  apply  advanced
           consequently their entire lives. In this paper, we substantiate   models, on increasingly powerful computing infrastructure,
           that  international  standards  are  required  for  thoroughly   potentially accessible from around the globe via the Internet.
           validating  AI  solutions  for  health,  by  benchmarking  their   Available data include, e.g. radiology or microscope images,
           performance. These standards might ultimately create well-  free  text,  sensor  time  series,  lab  measurements,  and  other
           founded  trust  in  those  AI  solutions  that  have  provided   information stored in electronic health records. These data
           conclusive evidence to be accurate, effective and reliable.   enable  ML  and  AI  algorithms  to  learn  to perform  a  wide
           We  give  reasons  that  standardized  benchmarking  of  AI   range  of  recognition,  early-detection,  classification,
           solutions for health is a necessary complement of established   prediction,  image-segmentation  and  image-reconstruction
           assessment procedures. In particular, we demonstrate that it   tasks that only humans had been able to perform previously
           is  beneficial  to  tackle  this  topic  on  a  global  scale  and   and that can now be automated very fast and at large scale
           summarize  the  achievements  of  the  first  year  of  the   with computers.
           ITU/WHO focus group on “AI for Health” that has tasked
           itself to work towards creating these evaluation standards.   Medical image analysis is a large field of application for ML
                                                              methods [11, 12, 13]. For instance, ML models can recognize
               Keywords – Artificial intelligence, benchmarking,   lung  nodules  from  radiology  images  [14],  detect  malaria
                 evidence, ITU-T, machine learning, medicine,    from microscope images of blood samples [15], or classify
               standards, validation, World Health Organization   skin cancer based on dermoscopy images [16]. The models
                                                              can process electronic health records in order to categorize
                            1.  HEALTH AI                     patients, make clinical decisions, predict patient trajectories
                                                              [17] or forecast the outcome [18], e.g. in the intensive care
           Advanced information technologies from machine learning   unit [19, 20]. From the electrocardiogram, algorithms can
           (ML) and  artificial  intelligence  (AI)  have  been  attracting  detect myocardial infarction and heart arrhythmia [21, 22].
           much interest in the health domain lately. The recent global  In addition, AI methods not rooted in ML can contribute to
           strategy on digital health of the World Health Organization  healthcare:  Knowledge-based  expert  systems  can  ask
           (WHO) mentions AI as technology that has “the potential to  systematic questions about symptoms and advise on how to
           enhance  health  significantly”  [1].  Regulatory  bodies,  in  proceed,  e.g.  recommend  visiting  a  hospital  immediately
           particular  the  U. S.  Food  and  Drug  Administration,  are  [23].  Access  via  the  Internet  is  a  considerable  driver  for
           beginning  to  permit  AI-based  medical  devices  [2,  3],  and  health AI technology because it makes novel types of health
           start  the  discussion  on  the  regulation  of  continuously  delivery possible, as well as innovative business models, in
           learning  ML/AI-based  medical  software  [4].  Renowned  particular via mobile applications that can give expert advice
           medical and scientific journals are dedicating special issues  to virtually everyone, everywhere, e.g. based on text input,
                                                              camera  or  even  cellphone-based,  deep-learning-enhanced
                                                              microscopy [24].











           978-92-61-28401-5/CFP1968P-ART @ ITU 2019       – 67 –                                    Kaleidoscope
   82   83   84   85   86   87   88   89   90   91   92