1. MultiWiBi: The multilingual Wikipedia bitaxonomy project
- Author
-
Tiziano Flati, Tommaso Pasini, Daniele Vannella, and Roberto Navigli
- Subjects
Linguistics and Language ,taxonomy extraction ,Exploit ,Process (engineering) ,Computer science ,media_common.quotation_subject ,taxonomy induction ,machine learning ,natural language processing ,collaborative resources ,Wikipedia ,02 engineering and technology ,Language and Linguistics ,World Wide Web ,Artificial Intelligence ,020204 information systems ,Taxonomy (general) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Quality (business) ,media_common - Abstract
We present MultiWiBi, an approach to the automatic creation of two integrated taxonomies for Wikipedia pages and categories written in different languages. In order to create both taxonomies in an arbitrary language, we first build them in English and then project the two taxonomies to other languages automatically, without the help of language-specific resources or tools. The process crucially leverages a novel algorithm which exploits the information available in either one of the taxonomies to reinforce the creation of the other taxonomy. Our experiments show that the taxonomical information in MultiWiBi is characterized by a higher quality and coverage than state-of-the-art resources like DBpedia, YAGO, MENTA, WikiNet, LHD and WikiTaxonomy, also across languages. MultiWiBi is available online at http://wibitaxonomy.org/multiwibi .
- Published
- 2016