1. Content Collection for the Labelling of Health-Related Web Content.
- Author
-
Carbonell, Jaime G., Siekmann, Jörg, Bellazzi, Riccardo, Abu-Hanna, Ameen, Hunter, Jim, Stamatakis, K., Metsis, V., Karkaletsis, V., Ruzicka, M., Svátek, V., Amigó, E., Pöllä, M., and Spyropoulos, C.
- Abstract
As the number of health-related web sites in various languages increases, so does the need for control mechanisms that give the users adequate guarantee on whether the web resources they are visiting meet a minimum level of quality standards. Based upon state-of-the-art technology in the areas of semantic web, content analysis and quality labelling, the MedIEQ project, integrates existing technologies and tests them in a novel application: the automation of the labelling process in health-related web content. MedIEQ provides tools that crawl the web to locate unlabelled health web resources, to label them according to pre-defined labelling criteria, as well as to monitor them. This paper focuses on content collection and discusses our experiments in the English language. [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF