Back to Search
Start Over
The influence of pre-processing on the estimation of readability of web documents
- Source :
- Proceedings of the 24th ACM International on Conference on Information and Knowledge Management
- Publication Year :
- 2015
-
Abstract
- This paper investigates the effect that text pre-processing approaches have on the estimation of the readability of web pages. Readability has been highlighted as an important aspect of web search result personalisation in previous work. The most widely used text readability measures rely on surface level characteristics of text, such as the length of words and sentences. We demonstrate that different tools for extracting text from web pages lead to very different estimations of readability. This has an important implication for search engines because search result personalisation strategies that consider users reading ability may fail if incorrect text readability estimations are computed.
Details
- Database :
- OAIster
- Journal :
- Proceedings of the 24th ACM International on Conference on Information and Knowledge Management
- Notes :
- application/pdf
- Publication Type :
- Electronic Resource
- Accession number :
- edsoai.on1146606407
- Document Type :
- Electronic Resource