Page 128 - Trust in ICT 2017
P. 128
2 Trust in ICT
lifespan of an Internet page is less than one month. To preserve the accumulated information in a knowledge
society, one of the possible solutions is to utilize electronic capturing devices and cloud computing storage
on the web. To archive data in the cloud computing system, there is the problem of indexing files. By the
uniform resource locator (URL) of the web, the successive version of the same documents should be lined up
with its date of release. Digital storage is unlimited by time, geography, culture or format. It may be culture-
specific but remains potentially accessible to every person in the world. The new storage technologies permit
important advances regarding the accessibility and manageability of knowledge. The digital content itself has
a subject to some degree of standardization without problems of incompatible formats.
Data classification and filtering
In the era of zeta-bytes, digital data would be well arranged, sorted, and prepared for searching, filtering,
grouping and classification. Data classification is the process of organizing data into categories for effective
and efficient use. A well-planned data classification makes essential data easy to find and retrieve. This can
be of particular importance for access and search. The relevant procedures for data classification should
define what categories and criteria people will use to classify data. If a data-classification scheme has been
created, the appropriate handling procedures for each category should be addressed with data's life
cyle requirements. It is essential that data classification is closely linked with data categories. Data
classification is clustering the data sets by an iterative process of data category. New data sets can be
categorized by new classification rules of knowledge and intelligence. The effectiveness of data classification
is measured by predictive accuracy, speed of sorting and clustering, scalability on large amounts of data, and
robustness of data quality.
In scientific and engineering fields, data classification raises the issues of identifying new observations from
the existing categories of knowledge. It is considered as a kind of researching, analysing, and learning. It
involves grouping data into categories based on the measure of inherent similarity. Data clustering for
pattern recognition from a large amount of statistic data of images and speeches is used to identify a member
of possible classes with the highest probability. Probabilistic algorithm with statistical inference is to find a
best instance. In experimental and statistical analysis, data classification is done with logistic regression or a
similar procedure. New observations on experimental results are referred to create new categories of
possible values or outcomes.
Meaning of hyperlink, linked data, and linked open data
The outstanding difference of the web page compared with other plain documents is the hyperlink, which
points to a specific web page or to a specific element within a document [19]. The hyperlink is used to link
information to any other information over the Internet. It is integral to the creation of the World Wide Web.
Web pages are written in the hypertext markup language (HTML). Hypertext is the text with hyperlinks.
The hyperlink is a reference to data that the reader can directly follow by clicking. Users
navigate or browse the web page following the hyperlinks. On the web page, most hyperlinks cause the
target document to replace the document being displayed. The effect of the hyperlink may vary with the
hypertext system. A link from one domain to another for a common destination anchor is a uniform resource
locator (URL) used in the World Wide Web. It is achieved by means of an HTML element with a "name" or
"id" attribute at the HTML document. A web browser usually displays a hyperlink in some distinguishing way,
e.g. in a different colour, font or style. The behaviour and style of links can be specified using the cascading
style sheets (CSS) language.
In a graphical user interface of web browsers, the hyperlinks are displayed in underlined blue texts when
they have not been visited, but are displayed in underlined purple texts when they have been visited. When
the user activates the hyperlink (e.g. by clicking on it with the mouse), the browser will display the target of
the link. If the target is not an HTML file, depending on the file type and on the browser and its plug-ins,
another program may be activated to open the file. The document containing a hyperlink is known as
its source code document. For example, in an online reference work such as Wikipedia, many words and
terms in the text are hyperlinked to definitions of those terms. Hyperlinks are often used to implement
reference mechanisms, such as tables of contents, footnotes, bibliographies, indexes, letters, and glossaries.
120