Page 128 - Trust in ICT 2017
P. 128

2                                                    Trust in ICT


            lifespan of an Internet page is less than one month. To preserve the accumulated information in a knowledge
            society, one of the possible solutions is to utilize electronic capturing devices and cloud computing storage
            on the web. To archive data in the cloud computing system, there is the problem of indexing files. By the
            uniform resource locator (URL) of the web, the successive version of the same documents should be lined up
            with its date of release. Digital storage is unlimited by time, geography, culture or format. It may be culture-
            specific but remains potentially accessible to every person in the world. The new storage technologies permit
            important advances regarding the accessibility and manageability of knowledge. The digital content itself has
            a subject to some degree of standardization without problems of incompatible formats.

            Data classification and filtering
            In the era of zeta-bytes, digital data would be well arranged, sorted, and prepared for searching, filtering,
            grouping and classification. Data classification is the process of organizing data into categories for effective
            and efficient use. A well-planned data classification makes essential data easy to find and retrieve. This can
            be of particular importance for access and search. The relevant procedures for data classification should
            define what categories and criteria people will use to classify data. If a data-classification scheme has been
            created,  the  appropriate  handling  procedures  for  each  category  should  be  addressed  with  data's  life
            cyle requirements.  It  is  essential  that  data  classification is  closely  linked  with  data  categories.  Data
            classification  is  clustering  the  data  sets  by  an  iterative  process  of  data  category.  New  data  sets  can  be
            categorized by new classification rules of knowledge and intelligence. The effectiveness of data classification
            is measured by predictive accuracy, speed of sorting and clustering, scalability on large amounts of data, and
            robustness of data quality.
            In scientific and engineering fields, data classification raises the issues of identifying new observations from
            the existing categories of knowledge. It is considered as a kind of researching, analysing, and learning. It
            involves  grouping  data  into  categories  based  on  the  measure  of  inherent  similarity.  Data  clustering  for
            pattern recognition from a large amount of statistic data of images and speeches is used to identify a member
            of possible classes with the highest probability. Probabilistic algorithm with statistical inference is to find a
            best instance. In experimental and statistical analysis, data classification is done with logistic regression or a
            similar  procedure.  New  observations  on  experimental  results  are  referred  to  create  new  categories  of
            possible values or outcomes.

            Meaning of hyperlink, linked data, and linked open data
            The outstanding difference of the web page compared with other plain documents is the hyperlink, which
            points to a specific web page or to a specific element within a document [19]. The hyperlink is used to link
            information to any other information over the Internet. It is integral to the creation of the World Wide Web.
            Web pages are written in the hypertext markup language (HTML). Hypertext is the text with hyperlinks.
            The hyperlink is  a reference to data that  the  reader  can  directly  follow  by  clicking.  Users
            navigate or browse the web page following the hyperlinks. On the web page, most hyperlinks cause the
            target document to replace the document being displayed. The effect of the hyperlink may vary with the
            hypertext system. A link from one domain to another for a common destination anchor is a uniform resource
            locator (URL) used in the World Wide Web. It is achieved by means of an HTML element with a "name" or
            "id" attribute at the HTML document. A web browser usually displays a hyperlink in some distinguishing way,
            e.g. in a different colour, font or style. The behaviour and style of links can be specified using the cascading
            style sheets (CSS) language.
            In a graphical user interface of web browsers, the hyperlinks are displayed in underlined blue texts when
            they have not been visited, but are displayed in underlined purple texts when they have been visited. When
            the user activates the hyperlink (e.g. by clicking on it with the mouse), the browser will display the target of
            the link. If the target is not an HTML file, depending on the file type and on the browser and its plug-ins,
            another  program  may  be  activated  to  open  the  file.  The  document  containing  a  hyperlink  is  known  as
            its source code document. For example, in an online reference work such as Wikipedia, many words and
            terms in the text are hyperlinked to definitions of those terms. Hyperlinks are often used to implement
            reference mechanisms, such as tables of contents, footnotes, bibliographies, indexes, letters, and glossaries.





            120
   123   124   125   126   127   128   129   130   131   132   133