Back to Search Start Over

The influence of pre-processing on the estimation of readability of web documents

Authors :
Bailey, J
Moffat, A
Palotti, Joao
Zuccon, Guido
Hanbury, Allan
Bailey, J
Moffat, A
Palotti, Joao
Zuccon, Guido
Hanbury, Allan
Source :
Proceedings of the 24th ACM International on Conference on Information and Knowledge Management
Publication Year :
2015

Abstract

This paper investigates the effect that text pre-processing approaches have on the estimation of the readability of web pages. Readability has been highlighted as an important aspect of web search result personalisation in previous work. The most widely used text readability measures rely on surface level characteristics of text, such as the length of words and sentences. We demonstrate that different tools for extracting text from web pages lead to very different estimations of readability. This has an important implication for search engines because search result personalisation strategies that consider users reading ability may fail if incorrect text readability estimations are computed.

Details

Database :
OAIster
Journal :
Proceedings of the 24th ACM International on Conference on Information and Knowledge Management
Notes :
application/pdf
Publication Type :
Electronic Resource
Accession number :
edsoai.on1146606407
Document Type :
Electronic Resource