Back to Search Start Over

Using Shape to Index and Query Web Document Contents

Authors :
FERRI, FERNANDO
GRIFONI, PATRIZIA
PADULA, MARCO
Source :
Journal of Visual Languages & Computing. Aug2002, Vol. 13 Issue 4, p355. 19p.
Publication Year :
2002

Abstract

The mass of information now available on web sites has greatly increased the web''s popularity and, consequently, the demand for its convenient access and use. The InternetIntranet phenomenon provides some 50 million people with access to multiple sources of information. However, the current techniques for retrieving and navigating do not make it easy for the user to satisfy his/her needs. This paper aims to show how user communities can be enabled to unlock the information stored in web documents. We start from the observation that authors shape their documents to clarify the intended meaning, and readers in turn exploit document shape to synthetically grasp this meaning. Thus, the web document can be seen as a structure composed of different types of information units (such as images, tables, movies, videos, sounds, titles and paragraphs). These units are shaped, represented and organized in the document so that it transmits its message according to the cultural formation of its author. From our investigation of collections of web documents we have derived some heuristics to use in shaping the document in order to emphasize its content. These heuristics can be exploited to manage and retrieve semantic information on the web.Since human computer interaction with the web is preponderantly visual, we propose a visual approach in customizing web documents, and in indexing and querying the Web through the browser. This approach is based on a method of annotating HTML documents so that their shape and contents can be reorganized to satisfy the requirements of different readers. [Copyright &y& Elsevier]

Details

Language :
English
ISSN :
1045926X
Volume :
13
Issue :
4
Database :
Academic Search Index
Journal :
Journal of Visual Languages & Computing
Publication Type :
Academic Journal
Accession number :
8518023
Full Text :
https://doi.org/10.1006/jvlc.2002.0221