Page 118 - Big data - Concept and application for telecommunications
P. 118

3                                Big data - Concept and application for telecommunications





















                           Figure 7-11 – Configuration of logical components for big data provenance
            The logical components shown in Figure 7-11 are as follows:

            –       provenance  model  management.  This  logical  component  manages  provenance  information
                    compatibility among different BDSPs. This logical component validates the provenance information
                    transmitted  from  outside  based  on  the  big  data  provenance  model  (see  clause 7.2).  The  valid
                    provenance information is then encoded as a common model and delivered to provenance lifecycle
                    management component to store it;
            –       provenance  lifecycle  management.  This  component  performs  the  recording  and  deletion  of
                    provenance information according to store, update and delete data (see clauses 7.3.1 and 7.3.2).
                    This logical component supports retrieving provenance information (see clause 7.3.3);
            –       analysis support. This logical component extracts the workflows from provenance information, and
                    stores them. From the stored workflows, this logical component retrieves the candidate analysis
                    workflows based on the information of BDSP's data analysis functions and data. For the request of
                    provenance information or workflow from the different system (e.g., external BDSP), this logical
                    component  may  check  the  adaptability  of  the  computational  environment,  and  map  to  an
                    equivalent function for that system. This logical component also supports automating data analysis
                    process based on update of data, adding user annotation on provenance information, and managing
                    the relationship between BDSP's functions and data;
                    NOTE – Based on the relationship between functions and data in provenance information, it is
                    possible to query the list of available data with functions, and the list of functions applicable to the
                    data.
            –       provenance sharing policy management. This logical component manages multiple sharing policies
                    on provenance information. When exporting a provenance information, a BDSP checks the sharing
                    policy and may simplify it before sending to another BDSP;
            –       personally identifiable information (PII) management. This logical component checks whether data
                    instance contains PII when recording a provenance unit. This logical component also requests a
                    protection mechanism to BDSP on provenance information that includes PII;
            –       monitoring. This logical component monitors changes in value about computational environment
                    and  responsible  party  in  provenance  information.  When  changes  are  detected,  this  logical
                    component updates them.


            8       Functional requirements of big data provenance

            8.1     Provenance lifecycle requirements

            Provenance lifecycle requirements include:
            –       (provenance  model  description)  It  is  required  that  BDSP  supports  the  model  for  big  data
                    provenance information;




            110      Static data – Data provenance, data formats and trust
   113   114   115   116   117   118   119   120   121   122   123