Back to Search Start Over

Generating descriptive multi-document summaries of geo-located entities using entity type models

Authors :
Ahmet Aker
Robert Gaizauskas
Source :
Journal of the Association for Information Science and Technology. 66:721-738
Publication Year :
2014
Publisher :
Wiley, 2014.

Abstract

In this article, we investigate the application of entity type models in extractive multi-document summarization using automatic caption generation for images of geo-located entities e.g., Westminsteri¾?Abbey as an application scenario. Entity type models contain sets of patterns aiming to capture the ways geo-located entities are described in natural language. They are automatically derived from texts about geo-located entities of the same type e.g., churches, lakes. We integrate entity type models into a multi-document summarizer and use them to address the 2 major tasks in extractive multi-document summarization: sentence scoring and summary composition. We experiment with 3 different representation methods for entity type models: signature words, n-gram language models, and dependency patterns. We evaluate the summarizer with integrated entity type models relative to a a summarizer using standard text-related features commonly used in text summarization and b the Wikipedia location descriptions. Our results show that entity type models significantly improve the quality of output summaries over that of summaries generated using standard summarization features and Wikipedia summaries. The representation of entity type models using dependency patterns is superior to the representations using signature words and n-gram language models.

Details

ISSN :
23301635
Volume :
66
Database :
OpenAIRE
Journal :
Journal of the Association for Information Science and Technology
Accession number :
edsair.doi...........5ba9995d0fd8eca84ea383bc60a51546