Back to Search Start Over

Text segmentation with topic modeling and entity coherence

Authors :
Adebayo Kolawole John
Guido Boella
Luigi Di Caro
Abraham, A
Haqiq, A
Alimi, AM
Mezzour, G
Rokbani, N
Muda, AK
Adebayo Kolawole, John
Di Caro, Luigi
Boella, Guido
Source :
Advances in Intelligent Systems and Computing ISBN: 9783319529400, HIS, Proceedings of the 16th International Conference on Hybrid Intelligent Systems (HIS 2016), Advances in Intelligent Systems and Computing, Advances in Intelligent Systems and Computing-Proceedings of the 16th International Conference on Hybrid Intelligent Systems (HIS 2016)
Publication Year :
2016
Publisher :
Springer Verlag, 2016.

Abstract

This paper describes a system which uses entity and topic coherence for improved Text Segmentation (TS) accuracy. First, the Linear Dirichlet Allocation (LDA) algorithm was used to obtain topics for sentences in the document. We then performed entity mapping across a window in order to discover the transition of entities within sentences. We used the information obtained to support our LDA-based boundary detection for proper boundary adjustment. We report the significance of the entity coherence approach as well as the superiority of our algorithm over existing work.

Details

Language :
English
ISBN :
978-3-319-52940-0
978-3-319-52941-7
ISSN :
21945357 and 21945365
ISBNs :
9783319529400 and 9783319529417
Database :
OpenAIRE
Journal :
Advances in Intelligent Systems and Computing ISBN: 9783319529400, HIS, Proceedings of the 16th International Conference on Hybrid Intelligent Systems (HIS 2016), Advances in Intelligent Systems and Computing, Advances in Intelligent Systems and Computing-Proceedings of the 16th International Conference on Hybrid Intelligent Systems (HIS 2016)
Accession number :
edsair.doi.dedup.....daf0e7b1aae4594bc559ac617ae7ac38