Page 49 - Crowdsourcing AI and Machine Learning solutions for SDGs - ITU AI/ML Challenges 2024 Report
P. 49

Crowdsourcing AI and Machine Learning solutions for SDGs



               •    Temporal: Data may not be sensitive now but may become sensitive in the future due to
                    changes in context, such as shifts in policies and/or safety of specific populations.
               •    Relational: One dataset on its own may not be sensitive, however, it could become
                    sensitive if analyzed in combination with other datasets.


               4       Classification

               Based on the risk assessment, classification must be carried out at the dataset level to identify
               which of the above data classification categories the data belongs to.

               5       Standards, metadata, and documentation

               For data sharing to be a success it is important that data are prepared in such a way that those
               using the dataset have a clear understanding of what the data mean so that they can be used
               appropriately. To enable this, data owners are encouraged to include with the dataset all the
               necessary information (metadata) describing the data and their format. This information should
               include such information as
               •    the methodology used to collect data
               •    definitions of variables
               •    units of measurement
               •    data format
               •    file type of the data
               •    any assumptions made


               6       Data Sharing Guidelines

               The figure below shows the steps to be considered when an entity (data owner) is planning to
               share data for the ITU AI/ML Challenge.








































                                                                                                     41
   44   45   46   47   48   49   50   51   52   53   54