Page 83 - Big data - Concept and application for telecommunications
P. 83

2                                Big data - Concept and application for telecommunications                       2


            8.1     Requirements for data registration and cataloguing

            Data registration and cataloguing requirements include:
            1)      it is required that DP:data broker provides a common data catalogue schema which cover various
                    type of data;
                    NOTE 1 – A common data catalogue includes data type, format, size, category, metadata information
                    and its URI, delivery mechanism, update frequency, electronic access methods such as API. This
                    catalogue may include data rights, license policy, price, quality of data, aggregation information,
                    pre-processing information, usage of data, related keywords, and sample data.
            2)      it  is  required  that  DP:data  broker  provides  data  catalogue  registration  mechanisms  to  DP:data
                    supplier;
                    NOTE 2 – Registration mechanisms are user interface or API for registration in forms of open API.
            3)      it is recommended that DP:data supplier supports automatic extraction of the information from data
                    in order to provide the associated metadata to be included in data catalogues;
            4)      it is recommended that DP:data broker provides notifications of newly registered metadata and
                    manages subscriptions to such notifications;
                    NOTE 3 – To improve data distribution and utilization, DP:data broker notifies the newly registered
                    data to BDSP as a task of publish data.
            5)      it is required that DP:data broker supports data classification with commonly used data  vocabulary
                    and taxonomy;

            6)      DP:data  broker  can  optionally  support  multiple  application  domain  specific  vocabularies  and
                    taxonomies for a single source of data;
            7)      it is recommended that DP:data broker supports multiple data classifications by its areas of use;

            8)      it is required that DP:data broker supports the publication of data specifications to BDSPs;
                    NOTE 4 – A data specification is used for manipulating data from storage or streaming data in an on-
                    demand manner  (e.g.,  triggered  when  a  user  is  requesting  data).  It  includes  information about
                    source of data, a process of generating data, selling policies for data, etc.
            9)      it is recommended that DP:data broker performs verification of data before publishing these data.

                    NOTE 5 – The DP:data supplier registers metadata to the data catalogue of the DP:data broker to
                    make the data available for distribution.
                    NOTE 6 – For the verification phase, the DP:data broker may request the additional information
                    about the data and sampling data from the DP:data supplier.

            8.2     Requirements for data retrieval

            Data retrieval requirements include:
            1)      it is required that DP:data broker provides the BDSP with an interface to access data catalogues;
                    NOTE 1 – This interface may support user specific taxonomy (e.g., Web Ontology Language file) to
                    extend a keyword search.
            2)      it is required that DP:data broker provides metadata searching functionalities to BDSP;
                    NOTE 2 – Examples of search method are keyword search and directory search.
                    NOTE 3  – The  results of search  can  be  listed  by  registered  date,  price,  sales  ranking,  and  data
                    supplier's credit according to the metadata schema.
            3)      it is recommended that DP:data broker recommends data to BDSP based on multiple criteria;
                    NOTE 4 – The criteria include price, discount rate, application category, etc.

            4)      it is recommended that DP:data broker supports the best match searching with a data request from
                    BDSP.




                                                            Moving data – Data exchange and data flow      75
   78   79   80   81   82   83   84   85   86   87   88