1,950 results on '"entity linking"'
Search Results
2. XLORE 3: A Large-Scale Multilingual Knowledge Graph from Heterogeneous Wiki Knowledge Resources.
- Author
-
Zeng, Kaisheng, Jin, Hailong, Lv, Xin, Zhu, Fangwei, Hou, Lei, Zhang, Yi, Pang, Fan, Qi, Yu, Liu, Dingxiao, Li, Juanzi, and Feng, Ling
- Abstract
The article introduces focuses on advancements in XLORE 3, a multilingual knowledge graph built to integrate and manage knowledge from diverse resources. Topics include the creation of an extensive XLORE ontology for managing entities across languages, the merging of equivalent entities for enhanced knowledge sharing, and a multi-strategy framework for knowledge completion using pre-trained language models.
- Published
- 2024
- Full Text
- View/download PDF
3. Entity Linking Model Based on Cascading Attention and Dynamic Graph.
- Author
-
Li, Hongchan, Li, Chunlei, Sun, Zhongchuan, and Zhu, Haodong
- Subjects
RANDOM walks ,DEEP learning ,KNOWLEDGE base ,ENTROPY - Abstract
The purpose of entity linking is to connect entity mentions in text to real entities in the knowledge base. Existing methods focus on using the text topic, entity type, linking order, and association between entities to obtain the target entities. Although these methods have achieved good results, they ignore the exploration of candidate entities, leading to insufficient semantic information among entities. In addition, the implicit relationship and discrimination within the candidate entities also affect the accuracy of entity linking. To address these problems, we introduce information about candidate entities from Wikipedia and construct a graph model to capture implicit dependencies between different entity decisions. Specifically, we propose a cascade attention mechanism and develop a novel local entity linkage model termed CAM-LEL. This model leverages the interaction between entity mentions and candidate entities to enhance the semantic representation of entities. Furthermore, a global entity linkage model termed DG-GEL based on a dynamic graph is established to construct an entity association graph, and a random walking algorithm and entity entropy are used to extract the implicit relationships within entities to increase the differentiation between entities. Experimental results and in-depth analyses of multiple datasets show that our model outperforms other state-of-the-art models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. Evolution of topics and trends in emerging research fields: multiple analyses with entity linking, Mann–Kendall test and burst methods in cloud computing.
- Author
-
Coccia, Mario and Roshani, Saeed
- Abstract
The principal goal of this study is to analyze the evolution of topics and trends in emerging research fields by a combination of entity linking, Mann–Kendall test, and burst detection techniques. Multiple methods are applied here in the emerging field of cloud computing by focusing on the frequency of critical topics from 2004 to 2021. Statistical analysis reveals that the Internet of Things exhibits a significant scientific growth compared to other subject areas within the research field of cloud computing. Other emerging topics with rapid growth are computer networks, encryption, big data, distributed computing, and interaction of cloud computing with virtual machine research. The combination of different techniques can better show the complex dynamics and complementary aspects of scientific topics and trends underlying evolutionary pathways in emerging fields, such as the science and technology advances of architecture, hardware, and software components in the field of cloud computing. In scientometrics, the analysis with multiple techniques provides comprehensive scientific and technological information driving new directions in the evolution of research fields to guide R&D investments towards growing topics and technologies having the potential of supporting fruitful scientific and technological change. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. MBJELEL: An End-to-End Knowledge Graph Entity Linking Method Applied to Civil Aviation Emergencies
- Author
-
Jiayi Qu, Jintao Wang, Zuyi Zhao, and Xingguo Chen
- Subjects
Civil aviation emergencies ,Entity linking ,End-to-end federated coding ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
Abstract Aviation emergency management is playing a more and more important role in the aviation field. How to make effective use of massive heterogeneous and multi-source aviation accident knowledge has become a great challenge for aviation emergency management. Aiming at the problems such as too long physical length, mixed and composite entities, similar character of domain entity names, information difference between entities, separation of codes between entities, coding errors during transmission, etc., the construction method of knowledge map of civil aviation emergencies is studied. In previous research methods, entity link is always divided into two parts, that is, first detection and then disambiguation, which makes the mentioned entity and the candidate entity are encoded separately, and there is error transmission between the two parts, modules cannot communicate with each other, and the close association between entities cannot be well learned. In this paper, we proposed an end-to-end entity linking method based on two-layer BiLSTM model joint coding vectorize each word of civil aviation text information, and then concatenate feature vectors into two-layer BiLSTM model to obtain high-level context representation. Because the joint encoding of boundary information can reduce the error transmission, information is exchanged between candidate entities during the initial encoding to enhance the closeness between candidate entities and candidate entities. The experimental results show that compared with other sota models, the F1 value of the proposed model reaches 88.97%.
- Published
- 2024
- Full Text
- View/download PDF
6. Leveraging Medical Knowledge Graphs and Large Language Models for Enhanced Mental Disorder Information Extraction.
- Author
-
Park, Chaelim, Lee, Hayoung, and Jeong, Ok-ran
- Subjects
LANGUAGE models ,MENTAL health services ,KNOWLEDGE graphs ,MENTAL illness ,DATA mining ,DATA extraction - Abstract
The accurate diagnosis and effective treatment of mental health disorders such as depression remain challenging owing to the complex underlying causes and varied symptomatology. Traditional information extraction methods struggle to adapt to evolving diagnostic criteria such as the Diagnostic and Statistical Manual of Mental Disorders fifth edition (DSM-5) and to contextualize rich patient data effectively. This study proposes a novel approach for enhancing information extraction from mental health data by integrating medical knowledge graphs and large language models (LLMs). Our method leverages the structured organization of knowledge graphs specifically designed for the rich domain of mental health, combined with the powerful predictive capabilities and zero-shot learning abilities of LLMs. This research enhances the quality of knowledge graphs through entity linking and demonstrates superiority over traditional information extraction techniques, making a significant contribution to the field of mental health. It enables a more fine-grained analysis of the data and the development of new applications. Our approach redefines the manner in which mental health data are extracted and utilized. By integrating these insights with existing healthcare applications, the groundwork is laid for the development of real-time patient monitoring systems. The performance evaluation of this knowledge graph highlights its effectiveness and reliability, indicating significant advancements in automating medical data processing and depression management. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Entity linking for English and other languages: a survey.
- Author
-
Guellil, Imane, Garcia-Dominguez, Antonio, Lewis, Peter R., Hussain, Shakeel, and Smith, Geoffrey
- Subjects
ENGLISH language ,MACHINE translating ,DOMINANT language ,SENTIMENT analysis ,DATA mining - Abstract
Extracting named entities text forms the basis for many crucial tasks such as information retrieval and extraction, machine translation, opinion mining, sentiment analysis and question answering. This paper presents a survey of the research literature on named entity linking, including named entity recognition and disambiguation. We present 200 works by focusing on 43 papers (5 surveys and 38 research works). We also describe and classify 56 resources, including 25 tools and 31 corpora. We focus on the most recent papers, where more than 95% of the described research works are after 2015. To show the efficiency of our construction methodology and the importance of this state of the art, we compare it to other surveys presented in the research literature, which were based on different criteria (such as the domain, novelty and presented models and resources). We also present a set of open issues (including the dominance of the English language in the proposed studies and the frequent use of NER rather than the end-to-end systems proposing NED and EL) related to entity linking based on the research questions that this survey aims to answer. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. NEREL: a Russian information extraction dataset with rich annotation for nested entities, relations, and wikidata entity links.
- Author
-
Loukachevitch, Natalia, Artemova, Ekaterina, Batura, Tatiana, Braslavski, Pavel, Ivanov, Vladimir, Manandhar, Suresh, Pugachev, Alexander, Rozhkov, Igor, Shelmanov, Artem, Tutubalina, Elena, and Yandutov, Alexey
- Subjects
- *
DATA mining , *ANNOTATIONS , *NEWS websites - Abstract
This paper describes NEREL—a Russian news dataset suited for three tasks: nested named entity recognition, relation extraction, and entity linking. Compared to flat entities, nested named entities provide a richer and more complete annotation while also increasing the coverage of relations annotation and entity linking. Relations between nested named entities may cross entity boundaries to connect to shorter entities nested within longer ones, which makes it harder to detect such relations. NEREL is currently the largest Russian dataset annotated with entities and relations: it comprises 29 named entity types and 49 relation types. At the time of writing, the dataset contains 56 K named entities and 39 K relations annotated in 933 person-oriented news articles. NEREL is annotated with relations at three levels: (1) within nested named entities, (2) within sentences, and (3) with relations crossing sentence boundaries. We provide benchmark evaluation of current state-of-the-art methods in all three tasks. The dataset is freely available at https://github.com/nerel-ds/NEREL. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. MESS: Coarse-Grained Modular Two-Way Dialogue Entity Linking Framework
- Author
-
Qi, Pengnian, Zha, Zhiyuan, Qin, Biao, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Bifet, Albert, editor, Davis, Jesse, editor, Krilavičius, Tomas, editor, Kull, Meelis, editor, Ntoutsi, Eirini, editor, and Žliobaitė, Indrė, editor
- Published
- 2024
- Full Text
- View/download PDF
10. Entity Annotation with Wikipedia Using Neural Networks
- Author
-
Szymański, Julian, Olewniczak, Szymon, Piotrowski, Mateusz, Pont, Maria Teresa Signes, Mora, Higinio, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Saeed, Khalid, editor, and Dvorský, Jiří, editor
- Published
- 2024
- Full Text
- View/download PDF
11. Matching Tabular Data to Knowledge Graph Based on Multi-level Scoring Filters for Table Entity Disambiguation
- Author
-
Li, Xinhe, Jiang, Chenghuan, Wang, Peng, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Zhang, Wenjie, editor, Tung, Anthony, editor, Zheng, Zhonglong, editor, Yang, Zhengyi, editor, Wang, Xiaoyang, editor, and Guo, Hongjie, editor
- Published
- 2024
- Full Text
- View/download PDF
12. Medical Concept Normalization
- Author
-
Xu, Hua, Demner Fushman, Dina, Hong, Na, Raja, Kalpana, Patel, Vimla L., Series Editor, Xu, Hua, editor, and Demner Fushman, Dina, editor
- Published
- 2024
- Full Text
- View/download PDF
13. Knowledge-augmented Methods for Natural Language Understanding
- Author
-
Jiang, Meng, Lin, Bill Yuchen, Wang, Shuohang, Xu, Yichong, Yu, Wenhao, Zhu, Chenguang, Jiang, Meng, Lin, Bill Yuchen, Wang, Shuohang, Xu, Yichong, Yu, Wenhao, and Zhu, Chenguang
- Published
- 2024
- Full Text
- View/download PDF
14. Multi-source Autoregressive Entity Linking Based on Generative Method
- Author
-
Yang, Dongju, Lan, Weishui, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Sun, Yuqing, editor, Lu, Tun, editor, Wang, Tong, editor, Fan, Hongfei, editor, Liu, Dongning, editor, and Du, Bowen, editor
- Published
- 2024
- Full Text
- View/download PDF
15. -MG: End-to-End Expert Linking via Multi-Granularity Representation Learning
- Author
-
Zha, Zhiyuan, Qi, Pengnian, Bao, Xigang, Qin, Biao, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Luo, Biao, editor, Cheng, Long, editor, Wu, Zheng-Guang, editor, Li, Hongyi, editor, and Li, Chaojie, editor
- Published
- 2024
- Full Text
- View/download PDF
16. Entity and Relation Linking for Knowledge Graph Question Answering Using Gradual Searching
- Author
-
Adila Alfa Krisnadhi, Mohammad Yani, and Indra Budi
- Subjects
entity linking ,relation linking ,kgqa system ,knowledge graph ,Engineering (General). Civil engineering (General) ,TA1-2040 - Abstract
Knowledge graph question answering (KGQA) systems have an important role in retrieving data from a knowledge graph (KG). With the system, regular users can access data from a KG without the need to construct a formal SPARQL query. KGQA systems receive a natural language question (NLQ) and translate it into a SPARQL query through three main tasks, namely, entity and relation detection, entity and relation linking, and query construction. However, the translation is not trivial due to lexical gaps and entity ambiguity that may occur during entity or relation linking. This research proposed an approach based on multiclass classification of NLQ whose entity occurrences are detected into categories based on KG relations to address the lexical gap challenge. Next, to solve the entity ambiguity challenge, this research proposed a three-stage searching procedure to determine appropriate KG entities associated with the NLQ entities, given the correspondence between the NLQ and a particular KG relation. This three-stage searching consisted of text-based searching, vector-based searching, and entity and relation pairing. The proposed approach was evaluated on the SimpleQuestions and LC-QuAD 2.0 datasets. The experiments demonstrated that the proposed approach outperformed the state-of-the-art baseline. For the relation linking task, the proposed approach reached 89.87% and 74.83% recall for the SimpleQuestions and LC-QuAD 2.0, respectively. This approach also achieved 91.74% and 61.96% recall on the entity linking tasks for the SimpleQuestions and LC-QuAD 2.0, respectively.
- Published
- 2024
- Full Text
- View/download PDF
17. 医学领域知识融合研究进展.
- Author
-
彭琳, 宋珺, 熊玲珠, 杜建强, 叶青, and 刘安栋
- Abstract
Copyright of Journal of Computer Engineering & Applications is the property of Beijing Journal of Computer Engineering & Applications Journal Co Ltd. and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2024
- Full Text
- View/download PDF
18. A Survey of Knowledge Graph Construction Using Machine Learning.
- Author
-
Zhigang Zhao, Xiong Luo, Maojian Chen, and Ling Ma
- Abstract
Knowledge graph (KG) serves as a specialized semantic network that encapsulates intricate relationships among real-world entities within a structured framework. This framework facilitates a transformation in information retrieval, transitioning it from mere string matching to far more sophisticated entity matching. In this transformative process, the advancement of artificial intelligence and intelligent information services is invigorated. Meanwhile, the role of machine learningmethod in the construction of KG is important, and these techniques have already achieved initial success. This article embarks on a comprehensive journey through the last strides in the field of KG via machine learning. With a profound amalgamation of cutting-edge research in machine learning, this article undertakes a systematical exploration of KG construction methods in three distinct phases: entity learning, ontology learning, and knowledge reasoning. Especially, a meticulous dissection of machine learningdriven algorithms is conducted, spotlighting their contributions to critical facets such as entity extraction, relation extraction, entity linking, and link prediction. Moreover, this article also provides an analysis of the unresolved challenges and emerging trajectories that beckon within the expansive application of machine learning-fueled, large-scale KG construction. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Managing Personal Identifiable Information in Data Lakes
- Author
-
Drazen Orescanin, Tomislav Hlupic, and Boris Vrdoljak
- Subjects
Data lake ,personal identifiable information metadata ,personal data ,data discovery ,entity linking ,data removal ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Privacy is a fundamental human right according to the Universal Declaration of Human Rights of the United Nations. Adoption of the General Data Protection Regulation (GDPR) in European Union in 2018 was turning point in management of personal data, specifically personal identifiable information (PII). Although there were many previous privacy laws in existence before, GDPR has brought privacy topic in the regulatory spotlight. Two most important novelties are seven basic principles related to processing of personal data and huge fines defined for violation of the regulation. Many other countries have followed the EU with the adoption of similar legislation. Personal data management processes in companies, especially in analytical systems and Data Lakes, must comply with the regulatory requirements. In Data Lakes, there are no standard architectures or solutions for the need to discover personal identifiable information, match data about the same person from different sources, or remove expired personal data. It is necessary to upgrade the existing Data Lake architectures and metadata models to support these functionalities. The goal is to study the current Data Lake architecture and metadata models and to propose enhancements to improve the collection, discovery, storage, processing, and removal of personal identifiable information. In this paper, a new metadata model that supports the handling of personal identifiable information in a Data Lake is proposed.
- Published
- 2024
- Full Text
- View/download PDF
20. Entity normalization in a Spanish medical corpus using a UMLS-based lexicon: findings and limitations
- Author
-
Báez, Pablo, Campillos-Llanos, Leonardo, Núñez, Fredy, and Dunstan, Jocelyn
- Published
- 2024
- Full Text
- View/download PDF
21. Path-Breaking Directions in Quantum Computing Technology: A Patent Analysis with Multiple Techniques
- Author
-
Coccia, Mario and Roshani, Saeed
- Published
- 2024
- Full Text
- View/download PDF
22. Generalization performance optimization of KBQA system for Chinese open domain.
- Author
-
Chen, Yang, Wan, Weibing, Zhao, Yuming, and Huang, Bo
- Abstract
Knowledge-Based Question Answering (KBQA) is a technique that utilizes the rich semantic information present in knowledge bases to comprehensively understand questions and obtain answers. The mainstream approaches consist of two methods: Semantic Parsing-Based (SP-based) and Information Retrieval-Based (IR-based). The former converts the question into a logical form that can be understood and executed by machines through semantic analysis, and then queries the knowledge base for answers. The latter first identifies the topic entity in the question and retrieves candidate answers, and then extracts features from both the question and candidate answers. Finally, a ranking model is used to model and predict the question and candidate answers. Compared to the impressive results achieved by English KBQA systems, Chinese KBQA systems face challenges due to the sparse semantic expression and limited features of the Chinese knowledge base, as well as the large number of similar entities that are difficult to differentiate. This makes it difficult for general models to properly understand the text's characteristics, resulting in a challenge to improve the accuracy of Entity Linking and to maximize the performance of the KBQA system. To address this, this paper proposes two steps to improve Entity Linking in the KBQA system: Candidate Generation (CG) and Entity Disambiguation (ED), with a focus on realizing Entity Disambiguation. In this paper, Entity Disambiguation is treated as a classification task, and a Dual-Channel Network Model based on Bi-LSTM and CNN is constructed. By combining different featuresextracted from Bi-LSTM and CNN, this paper also introduces an attention mechanism to fully explore the weak semantic relationship between the question answering system and candidate entity, effectively reducing the reliance of the question answering system on additional feature rules. Experimental results show that the Entity Linking model proposed in this paper can effectively improve the performance of the question and answer system, has strong generalization, weakens dependence on additional information, and ensures the quality of Q &A while reducing manual intervention. Our method has achieved the current best average F1 value in the Chinese open domain datasets NLPCC-2016KBQA and CCKS2019KBQA. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. Multi-perspective thought navigation for source-free entity linking.
- Author
-
Peng, Bohua, He, Wei, Chen, Bin, Villavicencio, Aline, and Wu, Chengfu
- Subjects
- *
LANGUAGE models , *NAVIGATION , *DATA mining - Abstract
Neural entity-linking models excel at bridging the lexical gap of multiple facets of facts, such as entity-related claims or evidence documents. Despite advancements in self-supervised learning and pretrained language models, challenges persist in entity linking, particularly in interpretability and transferability. Moreover, these models need many aligned documents to adapt to emerging entities, which may not be available due to data scarcity. In this work, we propose a novel Demonstrative Self-TrAining fRamework (D-STAR) that leverages multi-perspective thought navigation. D-STAR iteratively optimizes a question generator and an entity retriever by navigating thoughts on a dynamic graph reasoning across multiple perspectives for question generation. The generated question–answer pairs, along with hard negatives shared in the graph, enable adaptation with minimal computational overhead. Additionally, we introduce a new task, source-free entity linking, focusing on unsupervised transfer learning without direct access to original domain data. To demonstrate the feasibility of this task, we provide a generated question–answering dataset, FandomWiki , for novel entities. Our experiments show that D-STAR significantly improves baselines on SciFact, Zeshel, and FandomWiki. • Adapting a pretrained entity linking model for entity queries of emerging domains. • Dynamically prompting a large language model with multiperspective thought navigation algorithm. • Harnessing topological data to mine hard negatives in contrastive learning. • Conceptualizing source-free entity linking. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. Improving entity linking by combining semantic entity embeddings and cross-attention encoder.
- Author
-
Li, Shi and Zhang, Yongkang
- Subjects
- *
KNOWLEDGE graphs , *CONTEXTUAL learning , *INFORMATION retrieval - Abstract
Entity linking is an important task for information retrieval and knowledge graph construction. Most existing methods use a bi-encoder structure to encode mentions and entities in the same space, and learn contextual features for entity linking. However, this type of system still faces some problems: (1) the entity embedding part of the model only learns from the local context of the target entity, which is too unique for entity linking model to learn the context commonality of information; (2) the entity disambiguation part only uses similarity calculation once to determine the target entity, resulting in insufficient interaction between the mentions and candidate entities, and ineffective recall of real entities. We propose a new entity linking model based on graph neural network. Different from other bi-encoder retrieval systems, this paper introduces a fine-grained semantic enhancement information into the entity embedding part of the bi-encoder to reduce the specificity of the model. Then, the cross-attention encoder is used to re-rank the target mention and each candidate entity after the entity retrieval model. Experimental results show that although the model is not optimal in inference speed, it outperforms all baseline methods on the AIDA-CoNLL dataset, and has good generalization effects on four datasets in different fields such as MSNBC and ACE2004. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. Joint linking of entity and relation for question answering over knowledge graph.
- Author
-
Li, Huiying, Yu, Wenqi, and Dai, Xinbang
- Abstract
Entity linking and relation linking are two crucial components in many question answering systems over knowledge graphs, which aim to identify the relevant entity or relation mentions in a question and link them to the target entity or relation in the knowledge graph. Previous studies mostly solve these two tasks independently or as sequential tasks, which usually leads to poor performance since the short texts in most questions lack the context information needed for disambiguation. In this paper, we propose an approach to jointly perform entity linking and relation linking. The idea is to exploit both the independent and joint features of the candidates for disambiguation, which captures different characteristics when the knowledge graph information and the semantics of the question are both considered. We evaluated our approach on the QALD-7 and LC-QuAD datasets and the experimental results shows that our approach significantly outperforms the existing entity and relation linking approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
26. Extraction of data events from the computational biology literature
- Author
-
Albahlal, Manal, Nenadic, Goran, and Stevens, Robert
- Subjects
microarray analysis literature ,Machine learning ,text mining ,natural language processing ,entity linking ,normalisation ,named entity recognition ,attention-based BiLSTM ,relation extraction ,database ,BioBERT ,Methods section ,workflow ,tools ,software ,operation ,process ,data ,data events ,methods - Abstract
With the current rate of research activities, it is widely accepted that scientists face a challenge of keeping up-to-date with new findings, even within a sub-field of a discipline. This difficulty extends to methods that have been used in the research. Understanding reported methods gives us confidence that the findings have resulted from an appropriate, rigorous and sound scientific process. However, the modern dynamic of science is also characterised with ever-changing methods, so scientists need to be able to learn about new ones and identify the common or most appropriate methods to use in a given situation. One of the best sources of information about methods is the scientific literature. In this thesis, we developed a computational model to automatically represent the text that describes reported methods as an abstract method workflow. We focus on computational sciences, which centre on data processing. Specifically, we consider data events as a representation of processes and changes that happen to data. A data event contains the main components of each step in computational experiments, such as input/output data, processes and operations on data, databases where the data is stored and software and tools that are used in these processes. An abstract method workflow then models relationships between data events, ordering them in a way that represents the methodology as reported in the literature. This thesis introduces ODNoRFlow, a text mining method that extracts and represents an abstract method workflow from a Methods section of a publication. It relies on a hybrid text mining approach (ODNoR) that combines machine learning and a rule-based method to recognise data event components, normalise them to existing ontologies and identify the links and relations between them. Specifically, we fine-tuned a pre-trained transformer model (BioBERT) to extract mentions of data and operations, and used an existing named entity recognition system (bioNerDS) to extract software and database mentions. Mentions were normalised to the EDAM ontology. We used a combination of syntactic rules and a pre-trained attention-based BiLSTM model to identify relations and links between components, and considered whether an automated discourse analysis tool can be used to improve the outcomes. We used the microarray analysis literature as a case study to demonstrate the feasibility of the proposed approaches. At the data event level, the approach achieved F-scores for the identification and normalisation of components between 78% (for data) and 92% (for operations), whereas the relationship extraction F-scores were between 62% and 92.5%. At the workflow level, we manually analysed automatically reconstructed workflows from 25 papers, with the F-score between 61% and 93.5%. We also applied ODNoRFlow to a large corpus of the microarray analysis literature to identify and analyse the distribution of data events components, the differences in their usage and the associations between them. Overall, the thesis provides a new computational framework that contributes to the automated extraction, representation and analysis of methods used in the computational biology literature.
- Published
- 2022
27. Integration of Knowledge Bases and External Information Sources via Magic Properties and Query-Driven Entity Linking
- Author
-
Ohmori, Yuuki, Kitagawa, Hiroyuki, Amagasa, Toshiyuki, Matono, Akiyoshi, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Delir Haghighi, Pari, editor, Pardede, Eric, editor, Dobbie, Gillian, editor, Yogarajan, Vithya, editor, ER, Ngurah Agus Sanjaya, editor, Kotsis, Gabriele, editor, and Khalil, Ismail, editor
- Published
- 2023
- Full Text
- View/download PDF
28. Overview of NLPCC 2023 Shared Task 6: Chinese Few-Shot and Zero-Shot Entity Linking
- Author
-
Xu, Zhenran, Shan, Zifei, Hu, Baotian, Zhang, Min, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Liu, Fei, editor, Duan, Nan, editor, Xu, Qingting, editor, and Hong, Yu, editor
- Published
- 2023
- Full Text
- View/download PDF
29. Improving Few-Shot and Zero-Shot Entity Linking with Coarse-to-Fine Lexicon-Based Retriever
- Author
-
Huang, Shijue, Wang, Bingbing, Qin, Libo, Zhao, Qin, Xu, Ruifeng, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Liu, Fei, editor, Duan, Nan, editor, Xu, Qingting, editor, and Hong, Yu, editor
- Published
- 2023
- Full Text
- View/download PDF
30. Semantic Candidate Retrieval for Few-Shot Entity Linking
- Author
-
Chen, Jianyong, Liu, Jiangming, Wang, Jin, Zhang, Xuejie, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Liu, Fei, editor, Duan, Nan, editor, Xu, Qingting, editor, and Hong, Yu, editor
- Published
- 2023
- Full Text
- View/download PDF
31. ERNIE-AT-CEL: A Chinese Few-Shot Emerging Entity Linking Model Based on ERNIE and Adversarial Training
- Author
-
Zhou, Hongyu, Sun, Chengjie, Lin, Lei, Shan, Lili, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Liu, Fei, editor, Duan, Nan, editor, Xu, Qingting, editor, and Hong, Yu, editor
- Published
- 2023
- Full Text
- View/download PDF
32. Collective Entity Linking with Joint Subgraphs
- Author
-
Wang, Kedong, Xia, Yu, Kong, Fang, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Liu, Fei, editor, Duan, Nan, editor, Xu, Qingting, editor, and Hong, Yu, editor
- Published
- 2023
- Full Text
- View/download PDF
33. Cross-Lingual Candidate Retrieval and Re-ranking for Biomedical Entity Linking
- Author
-
Borchert, Florian, Llorca, Ignacio, Schapranow, Matthieu-P., Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Arampatzis, Avi, editor, Kanoulas, Evangelos, editor, Tsikrika, Theodora, editor, Vrochidis, Stefanos, editor, Giachanou, Anastasia, editor, Li, Dan, editor, Aliannejadi, Mohammad, editor, Vlachos, Michalis, editor, Faggioli, Guglielmo, editor, and Ferro, Nicola, editor
- Published
- 2023
- Full Text
- View/download PDF
34. Graph-Enriched Biomedical Entity Representation Transformer
- Author
-
Sakhovskiy, Andrey, Semenova, Natalia, Kadurin, Artur, Tutubalina, Elena, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Arampatzis, Avi, editor, Kanoulas, Evangelos, editor, Tsikrika, Theodora, editor, Vrochidis, Stefanos, editor, Giachanou, Anastasia, editor, Li, Dan, editor, Aliannejadi, Mohammad, editor, Vlachos, Michalis, editor, Faggioli, Guglielmo, editor, and Ferro, Nicola, editor
- Published
- 2023
- Full Text
- View/download PDF
35. DAMGCN: Entity Linking in Visually Rich Documents with Dependency-Aware Multimodal Graph Convolutional Network
- Author
-
Chen, Yi-Ming, Hou, Xiang-Ting, Lou, Dong-Fang, Liao, Zhi-Lin, Liu, Cheng-Lin, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Fink, Gernot A., editor, Jain, Rajiv, editor, Kise, Koichi, editor, and Zanibbi, Richard, editor
- Published
- 2023
- Full Text
- View/download PDF
36. Linking Scholarly Datasets—The EOSC Perspective
- Author
-
Wolski, Marcin, Klorek, Antoni, Mazurek, Cezary, Kobusińska, Anna, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Mikyška, Jiří, editor, de Mulatier, Clélia, editor, Paszynski, Maciej, editor, Krzhizhanovskaya, Valeria V., editor, Dongarra, Jack J., editor, and Sloot, Peter M.A., editor
- Published
- 2023
- Full Text
- View/download PDF
37. Entity Linking for KGQA Using AMR Graphs
- Author
-
Steinmetz, Nadine, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Pesquita, Catia, editor, Jimenez-Ruiz, Ernesto, editor, McCusker, Jamie, editor, Faria, Daniel, editor, Dragoni, Mauro, editor, Dimou, Anastasia, editor, Troncy, Raphael, editor, and Hertling, Sven, editor
- Published
- 2023
- Full Text
- View/download PDF
38. Class-Dynamic and Hierarchy-Constrained Network for Entity Linking
- Author
-
Wang, Kehang, Liu, Qi, Zhang, Kai, Liu, Ye, Tao, Hanqing, Huang, Zhenya, Chen, Enhong, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Wang, Xin, editor, Sapino, Maria Luisa, editor, Han, Wook-Shin, editor, El Abbadi, Amr, editor, Dobbie, Gill, editor, Feng, Zhiyong, editor, Shao, Yingxiao, editor, and Yin, Hongzhi, editor
- Published
- 2023
- Full Text
- View/download PDF
39. Connecting the Content of Books to the Web and the Real World
- Author
-
Cristea, Dan, Pistol, Ionuț, Gîfu, Daniela, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, and Gelbukh, Alexander, editor
- Published
- 2023
- Full Text
- View/download PDF
40. : Experience Enhanced Entity Linking for Question Answering Over Knowledge Graphs
- Author
-
Hou, Zhirong, Wang, Meiling, Li, Min, Li, Ying, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Li, Bohan, editor, Yue, Lin, editor, Tao, Chuanqi, editor, Han, Xuming, editor, Calvanese, Diego, editor, and Amagasa, Toshiyuki, editor
- Published
- 2023
- Full Text
- View/download PDF
41. Text Classification Models for Form Entity Linking
- Author
-
Villota, María, Domínguez, César, Heras, Jónathan, Mata, Eloy, Pascual, Vico, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Omatu, Sigeru, editor, Mehmood, Rashid, editor, Sitek, Pawel, editor, Cicerone, Serafino, editor, and Rodríguez, Sara, editor
- Published
- 2023
- Full Text
- View/download PDF
42. MBJELEL: An End-to-End Knowledge Graph Entity Linking Method Applied to Civil Aviation Emergencies
- Author
-
Qu, Jiayi, Wang, Jintao, Zhao, Zuyi, and Chen, Xingguo
- Published
- 2024
- Full Text
- View/download PDF
43. Multilingual bi‐encoder models for biomedical entity linking.
- Author
-
Guven, Zekeriya Anil and Lamurias, Andre
- Subjects
- *
NATURAL language processing , *LANGUAGE models , *SENTIMENT analysis , *SPAM email , *TASK analysis , *DATA analysis - Abstract
Natural language processing (NLP) is a field of study that focuses on data analysis on texts with certain methods. NLP includes tasks such as sentiment analysis, spam detection, entity linking, and question answering, to name a few. Entity linking is an NLP task that is used to map mentions specified in the text to the entities of a Knowledge Base. In this study, we analysed the efficacy of bi‐encoder entity linking models for multilingual biomedical texts. Using surface‐based, approximate nearest neighbour search and embedding approaches during the candidate generation phase, accuracy, and recall values were measured on language representation models such as BERT, SapBERT, BioBERT, and RoBERTa according to language and domain. The proposed entity linking framework was analysed on the BC5CDR and Cantemist datasets for English and Spanish, respectively. The framework achieved 76.75% accuracy for the BC5CDR and 60.19% for the Cantemist. In addition, the proposed framework was compared with previous studies. The results highlight the challenges that come with domain‐specific multilingual datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
44. Multi-task entity linking with supervision from a taxonomy.
- Author
-
Wang, Xuwu, Chen, Lihan, Zhu, Wei, Ni, Yuan, Xie, Guotong, Yang, Deqing, and Xiao, Yanghua
- Subjects
KNOWLEDGE graphs ,NONLINEAR programming ,INTEGER programming ,TAXONOMY ,SUPERVISION - Abstract
Entity linking is the task of resolving ambiguous mentions in documents to their referent entities in a knowledge graph (KG). Existing solutions mainly rely on three kinds of information: local contextual similarity, global coherence, and prior probability. But the information of the mentions' types is rarely utilized, which is helpful for precise entity linking. That is to say, if the type information of a mention is obtained from a mention classifier, we can exclude candidate entities with different types. However, the key challenge of realizing it lies in obtaining the type labels with appropriate granularity and performing entity linking with the error propagated from the mention classifier. To solve the challenges, we propose a model named type-oriented multi-task entity linking (TMTEL). First, we select types with appropriate granularity from the taxonomy of a KG, which is modeled as a nonlinear integer programming problem. Second, we use a multi-task learning framework to incorporate the selected types into entity linking. The type information is used to enhance the representation of the mentions' context, which is more robust to the errors of the mention classifier. Experimental results show that our model outperforms multiple existing solutions. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
45. A Purely Entity-Based Semantic Search Approach for Document Retrieval.
- Author
-
Sidi, Mohamed Lemine and Gunal, Serkan
- Subjects
INFORMATION retrieval ,KNOWLEDGE base ,KNOWLEDGE graphs - Abstract
Over the past decade, knowledge bases (KB) have been increasingly utilized to complete and enrich the representation of queries and documents in order to improve the document retrieval task. Although many approaches have used KB for such purposes, the problem of how to effectively leverage entity-based representation still needs to be resolved. This paper proposes a Purely Entity-based Semantic Search Approach for Information Retrieval (PESS4IR) as a novel solution. The approach includes (i) its own entity linking method and (ii) an inverted indexing method, and for document retrieval and ranking, (iii) an appropriate ranking method is designed to take advantage of all the strengths of the approach. We report the findings on the performance of our approach, which is tested by queries annotated by two known entity linking tools, REL and DBpedia-Spotlight. The experiments are performed on the standard TREC 2004 Robust and MSMARCO collections. By using the REL method on the Robust collection, for the queries whose terms are all annotated and whose average annotation scores are greater than or equal to 0.75, our approach achieves the maximum nDCG@5 score (1.00). Also, it is shown that using PESS4IR alongside another document retrieval method would improve performance, unless that method alone achieves the maximum nDCG@5 score for those highly annotated queries. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
46. Entity graphs for exploring online discourse.
- Author
-
Botzer, Nicholas and Weninger, Tim
- Subjects
NATURAL language processing ,VIRTUAL communities ,DISCOURSE ,COGNITIVE psychology ,SOCIAL networks - Abstract
A vast amount of human communication occurs online. These digital traces of natural human communication along with recent advances in natural language processing technology provide for computational analysis of these discussions. In the study of social networks, the typical perspective is to view users as nodes and concepts as flowing through and among the user nodes within the social network. In the present work, we take the opposite perspective: we extract and organize massive amounts of group discussion into a concept space we call an entity graph where concepts and entities are static and human communicators move about the concept space via their conversations. Framed by this perspective, we performed several experiments and comparative analysis on large volumes of online discourse from Reddit. In quantitative experiments, we found that discourse was difficult to predict, especially as the conversation carried on. We also developed an interactive tool to visually inspect conversation trails over the entity graph; although they were difficult to predict, we found that conversations, in general, tended to diverge to a vast swath of topics initially, but then tended to converge to simple and popular concepts as the conversation progressed. An application of the spreading activation function from the field of cognitive psychology also provided compelling visual narratives from the data. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
47. MEDDOPLACE Shared Task overview: recognition, normalization and classification of locations and patient movement in clinical texts.
- Author
-
Lima-López, Salvador, Farré-Maduell, Eulàlia, Briva--Iglesias, Vicent, Gasco-Sanchez, Luis, and Krallinger, Martin
- Subjects
SPANISH language ,LOCATION analysis ,ITALIAN language ,ENGLISH language ,PORTUGUESE language - Abstract
Copyright of Procesamiento del Lenguaje Natural is the property of Sociedad Espanola para el Procesamiento del Lenguaje Natural and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2023
- Full Text
- View/download PDF
48. Knowledge Graph Engineering Based on Semantic Annotation of Tables.
- Author
-
Dorodnykh, Nikita and Yurin, Aleksandr
- Subjects
KNOWLEDGE graphs ,INSPECTION & review ,INDUSTRIAL safety ,SEMANTICS ,LABOR market ,ENGINEERING ,MACHINE learning - Abstract
A table is a convenient way to store, structure, and present data. Tables are an attractive knowledge source in various applications, including knowledge graph engineering. However, a lack of understanding of the semantic structure and meaning of their content may reduce the effectiveness of this process. Hence, the restoration of tabular semantics and the development of knowledge graphs based on semantically annotated tabular data are highly relevant tasks that have attracted a lot of attention in recent years. We propose a hybrid approach using heuristics and machine learning methods for the semantic annotation of relational tabular data and knowledge graph populations with specific entities extracted from the annotated tables. This paper discusses the main stages of the approach, its implementation, and performance testing. We also consider three case studies for the development of domain-specific knowledge graphs in the fields of industrial safety inspection, labor market analysis, and university activities. The evaluation results revealed that the application of our approach can be considered the initial stage for the rapid filling of domain-specific knowledge graphs based on tabular data. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
49. Modular Bibliographical Profiling of Historic Book Reviews
- Author
-
Matthew J. Lavin
- Subjects
book reviews ,cultural analytics ,information extraction ,entity linking ,natural language processing ,named entity recognition ,History of scholarship and learning. The humanities ,AZ20-999 ,Language and Literature - Abstract
This paper examines different methods of predicting bibliographical details (e.g. author, title, and publisher) of books under review in a corpus of approximately 1,100 historical book reviews. The dataset is comprised of book reviews from ProQuest’s American Periodicals Series (APS). This kind of bibliographical profiling is often characterized as a Natural Language Processing (NLP) or Named Entity Recognition (NER) task, but it can be more specifically described as a two-part Named Entity Linking (NEL) task, beginning with a feature extraction stage followed by one of several available matching or classification methods. An attempt has been made to formalize constraints for modular bibliographical profiling (MBP) and shed light on some important choices that are often glossed over or obscured by digital humanities practitioners. Applying these constraints, the paper evaluates combinations of feature selection (naive bag-of-words [BOW], rule-based feature extraction, and NER using a pre-trained model) with a standardized similarity-based matching strategy (cosine similarity). All tasks are performed on derived text data (term frequency tables), so that data can be shared and all methods can be used on materials available only in non-consumptive formats. These comparisons suggest that naive BOW can perform quite robustly, and that using even a basic pre-trained NER model in conjunction with a BOW approach may reduce false positives.
- Published
- 2024
- Full Text
- View/download PDF
50. Entity Linking Method for Chinese Short Texts with Multiple Embedded Representations.
- Author
-
Shi, Yongqi, Yang, Ruopeng, Yin, Changsheng, Lu, Yiwei, Yang, Yuantao, and Tao, Yu
- Subjects
KNOWLEDGE graphs ,FEATURE extraction ,NATURAL language processing ,VIRTUAL networks - Abstract
Entity linking, a crucial task in the realm of natural language processing, aims to link entity mentions in a text to their corresponding entities in the knowledge base. While long documents provide abundant contextual information, facilitating feature extraction for entity identification and disambiguation, entity linking in Chinese short texts presents significant challenges. This study introduces an innovative approach to entity linking within Chinese short texts, combining multiple embedding representations. It integrates embedding representations from both entities and relations in the knowledge graph triples, as well as embedding representations from the descriptive text of entities and relations, to enhance the performance of entity linking. The method also incorporates external semantic supplements to strengthen the model's feature learning capabilities. The Multi-Embedding Representation–Bidirectional Encoder Representation from Transformers–Bidirectional Gated Recurrent Unit (MER-BERT-BiGRU) neural network model is employed for embedding learning. The precision, recall, and F1 scores reached 89.73%, 92.18%, and 90.94% respectively, demonstrating the effectiveness of our approach. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.