Descriptor: "WSD" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"WSD"' showing total 224 results

Start Over Descriptor "WSD"

224 results on '"WSD"'

1. Comparative Analysis of Decision Tree and k-NN to Solve WSD Problem in Kashmiri

Author: Mir, Tawseef Ahmad, Lawaye, Aadil Ahmad, Rana, Parveen, Ahmed, Ghayas, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Hassanien, Aboul Ella, editor, Castillo, Oscar, editor, Anand, Sameer, editor, and Jaiswal, Ajay, editor
Published: 2024
Full Text: View/download PDF

2. Homograph recognition algorithm based on Euclidean metric

Author: Elisa S. Izrailova, Arslanbek V. Astemirov, Ayshat S. Badaeva, Zelimhan A. Sultanov, Salaudin M. Umarkhadzhiev, Mokhmad-Salekh L. Khekhaev, and Madina L. Yasaeva
Subjects: graphic homonymy, homographs, wsd, speech synthesis, chechen language, low resource languages, text corpus, Optics. Light, QC350-467, Electronic computers. Computer science, QA75.5-76.95
Abstract: The problem of resolving the uncertainties associated with homonymy for the Chechen language has become especially relevant after the creation of speech synthesis systems. The main disadvantage of speech synthesizers in the Chechen language are errors in reading homograph words that differ in the length / brevity of vowels — the longitude of such sounds is not displayed in any way when writing. The reproduction of diphthongs, which are indicated on the letter in the same way as monophthongs close to them in sound, causes problems. To improve the quality of synthesized speech in the Chechen language, an automatic homograph recognition program is needed. To solve this problem, the article considers the task of eliminating the ambiguity of the meaning of the words WSD (Word Sense Disambiguation). Algorithmic (supervised) methods based on a pre-marked database have been selected for the Chechen language. These methods are the most common solutions for eliminating the ambiguity of the meaning of words. The implementation of such methods is possible in the presence of large marked-up corpora that are inaccessible to most languages of the world including Chechen. The Chechen language belongs to low-resource languages for which the optimal approach from the point of view of saving labor and time resources is a semi-controlled hybrid method of homograph recognition based on the use of algorithmic and statistical methods. The algorithm created by the authors for recognizing homographs by six adjacent words in a sentence is presented. The method is implemented as a program. Preliminary preparation of the initial data for the operation of the algorithm includes marking of proposals by the values of homographs performed “manually”. The results of the program were evaluated using generally recognized accuracy metrics and amounted to F1 — 39 %, Accuracy — 45 %. A comparative analysis of the data obtained with the results of other methods and models showed that the accuracy of the algorithm presented in this article is closest to the results of the accuracy of algorithms based on the Lesk method. Using Lesk method for English, the results of F1 accuracy were obtained — 41.1 % (simple Lesk) and 51.1 % (extended Lesk). Methods using neural network algorithms provide higher WSD accuracy rates for most languages; however, their implementation requires large data bodies, which is not always available for low-resource languages, including Chechen.
Published: 2024
Full Text: View/download PDF

3. Contextual word disambiguates of Ge'ez language with homophonic using machine learning

Author: Mequanent Degu Belete, Ayodeji Olalekan Salau, Girma Kassa Alitasb, and Tigist Bezabh
Subjects: Ge'ez language, WSD, Text vectorization, Machine learning, Philology. Linguistics, P1-1091
Abstract: According to natural language processing experts, there are numerous ambiguous words in languages. Without automated word meaning disambiguation for any language, the development of natural language processing technologies such as information extraction, information retrieval, machine translation, and others are still challenging task. Therfore, this paper presents the development of a word sense disambiguation model for duplicate alphabet words for the Ge'ez language using corpus-based methods. Because there is no wordNet or public dataset for the Ge'ez language, 1010 samples of ambiguous words were gathered. Afterwards, the words were preprocessed and the text was vectorized using bag of words, Term Frequency-Inverse Document Frequency, and word embeddings such as word2vec and fastText. The vectorized texts are then analysed using the supervised machine learning algorithms such Naive Bayes, decision trees, random forests, K-nearest neighbor, linear support vector machine, and logistic regression. Bag of words paired with random forests outperformed all other combinations, with an accuracy of 99.52%. However, when Deep learning algorithms such as Deep neural network and Long Short-Term memory were used for the same dataset, a 100% accuracy was achieved.
Published: 2024
Full Text: View/download PDF

4. Chinese Financial Comments Sentiment Detection Based on the Bert-TCN Model Based on HowNet Disambiguation

Author: Xia, Linhan, Li, Kan, Editor-in-Chief, Li, Qingyong, Associate Editor, Fournier-Viger, Philippe, Series Editor, Hong, Wei-Chiang, Series Editor, Liang, Xun, Series Editor, Wang, Long, Series Editor, Xu, Xuesong, Series Editor, Chen, Charles, editor, Singh, Satya Narayan, editor, Saxena, Sandeep, editor, and Wheeb, Ali Hussein, editor
Published: 2023
Full Text: View/download PDF

5. A Hybrid Approach for Sentiment Analysis Using Game Theory in Word Sense Disambiguation

Author: Singhania, Aryan, Gupta, Harsh, Jain, Minni, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Swaroop, Abhishek, editor, Polkowski, Zdzislaw, editor, Correia, Sérgio Duarte, editor, and Virdee, Bal, editor
Published: 2023
Full Text: View/download PDF

6. An Analysis of Word Sense Disambiguation (WSD)

Author: Nanjundan, Preethi, Mathews, Eappen Zachariah, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Hirche, Sandra, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Möller, Sebastian, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Zhang, Junjie James, Series Editor, Jain, Sarika, editor, Groppe, Sven, editor, and Mihindukulasooriya, Nandana, editor
Published: 2023
Full Text: View/download PDF

7. DETERMINATION OF LRFD ENVIRONMENTAL LOAD FACTORS OF OFFSHORE PLATFORM IN THE NORTH OF JAVA SEA AND MAKASSAR STRAIT.

Author: Paramashanti, Rildova, Hermanto, Mochammad Fathurridho, and Nandalianadhira, Nafisa
Subjects: OFFSHORE structures, STRAITS, WAVE analysis, BP Deepwater Horizon Explosion & Oil Spill, 2010
Abstract: The environmental load factor in commonly used offshore platforms design code, API RP-2A (American Petroleum Institute - Recommended Practice 2A), is developed based on the environmental conditions of American waters, especially the Gulf of Mexico, which have relatively extreme environmental conditions when compared to Indonesian waters. Case studies were conducted to determine environmental load factors in Indonesian waters, particularly the North Java Sea and Makassar Strait, categorized as shallow seas. This analysis was carried out on the performance criteria of pushover failure. In this study, the base shear was analyzed to describe the strength of the structure in the form of a collapse base shear (CBS) and the load in the form of a wave base shear (WBS). CBS was obtained through pushover analysis with yield strength randomness. WBS was obtained through in-place analysis with wave height randomness. This concept was applied to the structure of the Monopod and Braced Monopod types of offshore platforms located in the North of Java Sea and Makassar Strait waters which had been optimized for the WSD and LRFD design methods. The reliability of the structure was analyzed based on the CBS and WBS values using the First Order Reliability Method (FORM) II. The reliability analysis results were in the form of a reliability index (ß). North Java Sea gives a reliability index in the range from 3.58 to 4.38 for every design criteria. While Makassar Strait gives a reliability index in the range from 3.17 to 3.54 for every design criteria. With a high target safety level for the North Java Sea location, a 1.10 environmental load factor is recommended for further offshore structure design. But, for the Makassar Strait location, more studies need to be done to get better environmental load factor recommendations. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

8. A Wide Network Scanning for Discovery of UDP-Based Reflectors in the Nordic Countries

Author: Bjerre, Alexander, Westh, Andreas Philip, Villefrance, Emil, Haque, A S M Farhan Al, Andersen, Jonas Bukrinski, Helgogaard, Lucas K., Anagnostopoulos, Marios, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Reiser, Hans P., editor, and Kyas, Marcel, editor
Published: 2022
Full Text: View/download PDF

9. The Role of Artificial Neural Network in Word Sense Disambiguation (WSD)—A Survey

Author: Roopa, H. R., Panneer Arockiaraj, S., Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Rathore, Vijay Singh, editor, Sharma, Subhash Chander, editor, Tavares, Joao Manuel R.S., editor, Moreira, Catarina, editor, and Surendiran, B., editor
Published: 2022
Full Text: View/download PDF

10. A Unified Sense Inventory for Word Sense Disambiguation in Polish

Author: Janz, Arkadiusz, Dziob, Agnieszka, Oleksy, Marcin, Baran, Joanna, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Groen, Derek, editor, de Mulatier, Clélia, editor, Paszynski, Maciej, editor, Krzhizhanovskaya, Valeria V., editor, Dongarra, Jack J., editor, and Sloot, Peter M. A., editor
Published: 2022
Full Text: View/download PDF

11. Word Sense Disambiguation System for Information Retrieval in Telugu Language

Author: Koppula, Neeraja, Pradeep Kumar, J., Srinivas Rao, Koppula, Kiran Kumar, G., Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Mandal, Jyotsna Kumar, editor, and De, Debashis, editor
Published: 2022
Full Text: View/download PDF

12. A new rainfall prediction model based on ICEEMDAN-WSD-BiLSTM and ESN.

Author: Zhang, Xianqi, Chen, Haiyang, Wen, Yihao, Shi, Jinwen, and Xiao, Yimeng
Subjects: PREDICTION models, HILBERT-Huang transform, SIGNAL denoising, REGIONAL development, FLOOD control, FLOODS, DROUGHTS
Abstract: Precipitation, as an important indicator describing the evolution of the regional climate system, plays an important role in understanding the spatial and temporal distribution characteristics of regional precipitation. Scientific and accurate prediction of regional precipitation is helpful to provide theoretical basis for relevant departments to guide flood and drought control. To address the uncertainty and nonlinear characteristics of precipitation series, this paper uses the established improved complete ensemble empirical mode decomposition with adaptive noise (ICEEMDAN)-wavelet signal denoising (WSD)-bi-directional long short-term memory (BiLSTM), and echo state network (ESN) models to predict precipitation of four cities in southern Anhui Province. The BiLSTM is used to predict the high-frequency components and the ESN to predict the low-frequency components, thus avoiding the influence between the two neural network predictions. The results show that the ICEEMDAN-WSD-BiLSTM and ESN models are more accurate. The average relative error reached 2.64% and the NSE (Nash–Sutcliffe efficiency coefficient) was 0.91, which was significantly better than the other four models. The model reveals the temporal change pattern and evolution characteristics of future precipitation, guides flood prevention and mitigation, and has certain theoretical significance and application value for promoting regional sustainable development. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

13. Word Sense Disambiguation for Marathi Language Using WordNet and the Lesk Approach

Author: Kharate, Namrata G., Patil, Varsha H., Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Patil, Varsha H., editor, Dey, Nilanjan, editor, N. Mahalle, Parikshit, editor, Shafi Pathan, Mohd, editor, and Kimbahune, Vinod. V., editor
Published: 2021
Full Text: View/download PDF

14. Multimodal Web Content Mining to Filter Non-learning Sites Using NLP

Author: Modi, Sangita S., Jagtap, Sudhir B., Xhafa, Fatos, Series Editor, Pandian, A. Pasumpon, editor, Senjyu, Tomonobu, editor, Islam, Syed Mohammed Shamsul, editor, and Wang, Haoxiang, editor
Published: 2020
Full Text: View/download PDF

15. Assessment of Information Extraction Techniques, Models and Systems.

Author: Rahman, Atta-ur, Musleh, Dhiaa, Nabil, Majed, Alubaidan, Haya, Gollapalli, Mohammed, Krishnasamy, Gomathi, Almoqbil, Dakheel, Khan, Mohammad Aftab Alam, Farooqui, Mehwash, Ahmed, Mohammed Imran Basheer, Ahmed, Mohammed Salih, and Mahmud, Maqsood
Subjects: DATA mining, EXTRACTION techniques, SEARCH engines, ABSTRACTING & indexing services, DIGITAL libraries
Abstract: The present article aims to review and evaluate the practiced and classical techniques, tools, models, and systems concerning automatic information extraction (IE) from published scientific documents like research articles, patents, theses, technical reports, and case studies etc. IE is performed for various reasons such as better indexing, archiving, searching, and retrieving. That is mainly used by the search engines and the indexing services as well the digital libraries and semantic web. In this regard, several studies have been conducted targeting various nature of documents. The study pays special consideration to the successful IE models, algorithms and approaches applied to structural IE from published documents. To grasp this, the paper is classified into several segments and each segment covers a significant aspect of IE. Furthermore, to validate their benefits and drawbacks, a comparative study of all the approaches have been conducted in terms of various performance factors like precision, accuracy, recall and F-score. Potential areas of improvement have been emphasized as research gap for the scholars in the closely related areas. Ultimately, a comprehensive summary of the evaluation is presented in tabular form and review is concluded. It was observed that the hybrid methods outperform the other methods due to their versatile nature to address various document formats. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

16. A Lemmatizer Tool for Assamese Language

Author: Roy, Arindam, Sarkar, Sunita, Borkakoty, Hsubhas, Barbosa, Simone Diniz Junqueira, Editorial Board Member, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Kotenko, Igor, Editorial Board Member, Yuan, Junsong, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Mandal, Jyotsna Kumar, editor, Mukhopadhyay, Somnath, editor, Dutta, Paramartha, editor, and Dasgupta, Kousik, editor
Published: 2019
Full Text: View/download PDF

17. Verifying Usefulness of Algorithms for WordNet Based Similarity Sense Disambiguation

Author: Kukla, Elżbieta, Siemiński, Andrzej, Kacprzyk, Janusz, Series Editor, Pal, Nikhil R., Advisory Editor, Bello Perez, Rafael, Advisory Editor, Corchado, Emilio S., Advisory Editor, Hagras, Hani, Advisory Editor, Kóczy, László T., Advisory Editor, Kreinovich, Vladik, Advisory Editor, Lin, Chin-Teng, Advisory Editor, Lu, Jie, Advisory Editor, Melin, Patricia, Advisory Editor, Nedjah, Nadia, Advisory Editor, Nguyen, Ngoc Thanh, Advisory Editor, Wang, Jun, Advisory Editor, Choroś, Kazimierz, editor, Kopel, Marek, editor, Kukla, Elżbieta, editor, and Siemiński, Andrzej, editor
Published: 2019
Full Text: View/download PDF

18. Word Sense Disambiguation with Massive Contextual Texts

Author: Liu, Ya-fei, Wei, Jinmao, Hutchison, David, Editorial Board Member, Kanade, Takeo, Editorial Board Member, Kittler, Josef, Editorial Board Member, Kleinberg, Jon M., Editorial Board Member, Mattern, Friedemann, Editorial Board Member, Mitchell, John C., Editorial Board Member, Naor, Moni, Editorial Board Member, Pandu Rangan, C., Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Terzopoulos, Demetri, Editorial Board Member, Tygar, Doug, Editorial Board Member, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Li, Guoliang, editor, Yang, Jun, editor, Gama, Joao, editor, Natwichai, Juggapong, editor, and Tong, Yongxin, editor
Published: 2019
Full Text: View/download PDF

19. Word Sense Induction with Attentive Context Clustering

Author: Moshe Stekel, Amos Azaria, and Shai Gordin
Subjects: clustering, nlp, wsi, wsd, [info.info-ai]computer science [cs]/artificial intelligence [cs.ai], History of scholarship and learning. The humanities, AZ20-999, Bibliography. Library science. Information resources
Abstract: This paper presents ACCWSI (Attentive Context Clustering WSI), a method for Word Sense Induction, suitable for languages with limited resources. Pretrained on a small corpus and given an ambiguous word (a query word) and a set of excerpts that contain it, ACCWSI uses an attention mechanism for generating context-aware embeddings, distinguishing between the different senses assigned to the query word. These embeddings are then clustered to provide groups of main common uses of the query word. We show that ACCWSI performs well on the SemEval-2 2010 WSI task. ACCWSI also demonstrates practical applicability for shedding light on the meanings of ambiguous words in ancient languages, such as Classical Hebrew and Akkadian. In the near future, we intend to turn ACCWSI into a practical tool for linguists and historians.
Published: 2022
Full Text: View/download PDF

20. Practice of Word Sense Disambiguation

Author: Siemiński, Andrzej, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Nguyen, Ngoc Thanh, editor, Hoang, Duong Hung, editor, Hong, Tzung-Pei, editor, Pham, Hoang, editor, and Trawiński, Bogdan, editor
Published: 2018
Full Text: View/download PDF

21. Word Sense Disambiguation Using IndoWordNet

Author: Bhingardive, Sudha, Bhattacharyya, Pushpak, Dash, Niladri Sekhar, editor, Bhattacharyya, Pushpak, editor, and Pawar, Jyoti D., editor
Published: 2017
Full Text: View/download PDF

22. Assamese Word Sense Disambiguation using Cuckoo Search Algorithm.

Author: Gogoi, Arjun, Baruah, Nomi, and Nath, Lakhya Jyoti
Subjects: SEARCH algorithms, MACHINE translating, ALGORITHMS, NATURAL languages, TABU search algorithm, PROBLEM solving, NATURAL language processing
Abstract: Natural language processing is associated with human-computer interaction, where several challenges require natural language understanding. The Word sense disambiguation problem comprises the computational assignment of meaning to a word according to a specific context in which it occurs. There are numerous natural language processing applications, such as machine translation, information retrieval, and information extraction, which require this task which takes place at the semantic level. To solve this problem unsupervised computation proposals can be effective since they have been successfully used for many real-world optimization problems. In this paper, we propose to solve the word sense disambiguation problem using the cuckoo search algorithm in the Assamese language. We illustrate the performance of our algorithm by carrying out experiments on an Assamese corpus. And comparing them against an unsupervised genetic algorithm that is implemented in the Assamese language. Results of the experiment show that the cuckoo algorithm can achieve more precision, recall and F-measure, attaining 87.5, 84, and 85.71 percentages respectively. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

23. Arabic Gloss WSD Using BERT.

Author: El-Razzaz, Mohammed, Fakhr, Mohamed Waleed, Maghraby, Fahima A., and Prati, Andrea
Subjects: VOCABULARY, CORPORA
Abstract: Word Sense Disambiguation (WSD) aims to predict the correct sense of a word given its context. This problem is of extreme importance in Arabic, as written words can be highly ambiguous; 43% of diacritized words have multiple interpretations and the percentage increases to 72% for non-diacritized words. Nevertheless, most Arabic written text does not have diacritical marks. Gloss-based WSD methods measure the semantic similarity or the overlap between the context of a target word that needs to be disambiguated and the dictionary definition of that word (gloss of the word). Arabic gloss WSD suffers from a lack of context-gloss datasets. In this paper, we present an Arabic gloss-based WSD technique. We utilize the celebrated Bidirectional Encoder Representation from Transformers (BERT) to build two models that can efficiently perform Arabic WSD. These models can be trained with few training samples since they utilize BERT models that were pretrained on a large Arabic corpus. Our experimental results show that our models outperform two of the most recent gloss-based WSDs when we test them against the same test data used to evaluate our model. Additionally, our model achieves an F1-score of 89% compared to the best-reported F1-score of 85% for knowledge-based Arabic WSD. Another contribution of this paper is introducing a context-gloss benchmark that may help to overcome the lack of a standardized benchmark for Arabic gloss-based WSD. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

24. Word sense disambiguation based on stretchable matching of the semantic template.

Author: Wang, Wei, Huang, Degen, and Yu, Haitao
Subjects: *NATURAL language processing, *VARIATION in language, *NATURAL languages, *VOCABULARY
Abstract: It is evident that the traditional hard matching of a fixed-length template cannot satisfy the nearly indefinite variations in natural language. This issue mainly results from three major problems of the traditional matching mode: 1) in matching with a short template, the context of natural language cannot be effectively captured; 2) in matching with a long template, serious data sparsity will lead to a low success rate of template matching (i.e., low recall); and 3) due to a lack of flexible matching ability, traditional hard matching is more prone to failure. Therefore, this paper proposed a novel method of stretchable matching of the semantic template (SMOST) to deal with the above problems. We have applied this method to word sense disambiguation in the natural language processing field. In the same case of using only the SemCor corpus, the result of our system is very close to the best result of existing systems, which shows the effectiveness of new proposed method. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

25. Dataset of white spot disease affected shrimp farmers disaggregated by the variables of farm site, environment, disease history, operational practices, and saline zones

Author: Neaz A. Hasan and Mohammad Mahfujul Haque
Subjects: Disaggregated data, Shrimp farming, Risk factors, WSD, Bangladesh, Computer applications to medicine. Medical informatics, R858-859.7, Science (General), Q1-390
Abstract: The article presents the summary of a dataset related to the risks factors of white spot disease (WSD) of farmed shrimp (Penaeus monodon) in Khulna, Bagerhat and Satkhira districts of Bangladesh. This dataset was developed following two consecutive steps. In the first step, participatory rural appraisal tools were applied to get the conceptual framework for data collection regarding lists of farmers and the variables of the risk factors of WSD. In the second step, sampling of farmers, google featured questionnaire development, and mobile phone-assisted survey were carried out. The total surveyed farms were 233 consisting of 21 and 212 semi-intensive and extensive farms, respectively. The data were collected in the form of continuous, nominal and binary variables disaggregated by saline zones. The dataset contains some basic socio-economic data of shrimp farmers, farm characteristics, environmental attributes and disease history of shrimp farms. The dataset also has GPS coordinates of all the surveyed farms individually which are very useful for spatial analysis. In total, the dataset in MS Excel has 46 variables and attached as the supplementary material with this article.
Published: 2020
Full Text: View/download PDF

26. Evaluation of Disambiguation Strategies on Biomedical Text Categorization

Author: Rais, Mohammed, Lachkar, Abdelmonaime, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Ortuño, Francisco, editor, and Rojas, Ignacio, editor
Published: 2016
Full Text: View/download PDF

27. Multilingual Projections

Author: Bhattacharyya, Pushpak, Ide, Nancy, Series editor, Gala, Núria, editor, Rapp, Reinhard, editor, and Bel-Enguix, Gemma, editor
Published: 2015
Full Text: View/download PDF

28. DURHAM : a word sense disambiguation system

Author: Hawkins, Paul Martin
Subjects: 005, SEMCOR, WSD
Abstract: Ever since the 1950's when Machine Translation first began to be developed, word sense disambiguation (WSD) has been considered a problem to developers. In more recent times, all NLP tasks which are sensitive to lexical semantics potentially benefit from WSD although to what extent is largely unknown. The thesis presents a novel approach to the task of WSD on a large scale. In particular a novel knowledge source is presented named contextual information. This knowledge source adopts a sub-symbolic training mechanism to learn information from the context of a sentence which is able to aid disambiguation. The system also takes advantage of frequency information and these two knowledge sources are combined. The system is trained and tested on SEMCOR. A novel disambiguation algorithm is also developed. The algorithm must tackle the problem of a large possible number of sense combinations in a sentence. The algorithm presented aims to make an appropriate choice between accuracy and efficiency. This is performed by directing the search at a word level. The performance achieved on SEMCOR is reported and an analysis of the various components of the system is performed. The results achieved on this test data are pleasing, but are difficult to compare with most of the other work carried out in the field. For this reason the system took part in the SENSEVAL evaluation which provided an excellent opportunity to extensively compare WSD systems. SENSEVAL is a small scale WSD evaluation using the HECTOR lexicon. Despite this, few adaptations to the system were required. The performance of the system on the SENSEVAL task are reported and have also been presented in [Hawkins, 2000].
Published: 1999

29. Improving stemming for Assamese information retrieval

Author: Gogoi, Arjun, Baruah, Nomi, Sarma, Sikhar Kr., and Phukan, Rakhee D.
Published: 2021
Full Text: View/download PDF

30. Disambiguation of Biomedical Acronyms Based on a Bidirectional Recurrent Neural Network of Character-level Features.

Author: Ren Kai, Li Na, Xiong Wei, and Wang Shi-Wen
Subjects: *RECURRENT neural networks, *ACRONYMS, *MODEL railroads
Abstract: Polysemic acronyms are very common in the field of biomedicine. These acronyms have different senses in different contexts. The ambiguity of acronyms may cause significant negative impact on the understanding of the full text by machine learning. To address the disambiguation of acronyms in the biomedical domain, most associated studies are based on methods using word-level contextual features. These methods require abundant relevant external resources for model training, and the accuracy of their disambiguation of acronyms may decrease greatly upon the lack of external resources. In this study, disambiguation of biomedical acronyms was investigated on the basis of the character-level feature model to realize the disambiguation of biomedical acronyms with largely limited external corpora. First, sentences containing ambiguous acronyms were extracted through retrieval and the feature vector of the context were initialized by using the character-level features. Second, these initial vectors were input into the bidirectional long shortterm memory neutral network model for training. Finally, the disambiguation of acronyms was realized by the outputs of the neutral network model through the Softmax classification approach. The results of acronym disambiguation based on character-level feature model were also compared with those based on word-level feature models. Results demonstrate that the average accuracy of the character-level feature neutral network algorithm reaches 85.82% on the dataset of 106 common biomedical acronyms. Thus, the character-level feature neutral network algorithm is superior to the traditional methods, which use a large number of external resources. This study confirms that the disambiguation method based on character-level features is applicable to the disambiguation of biomedical acronyms under limited relevant data. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

31. Arabic Gloss WSD Using BERT

Author: Mohammed El-Razzaz, Mohamed Waleed Fakhr, and Fahima A. Maghraby
Subjects: WSD, BERT, Arabic, context gloss, Technology, Engineering (General). Civil engineering (General), TA1-2040, Biology (General), QH301-705.5, Physics, QC1-999, Chemistry, QD1-999
Abstract: Word Sense Disambiguation (WSD) aims to predict the correct sense of a word given its context. This problem is of extreme importance in Arabic, as written words can be highly ambiguous; 43% of diacritized words have multiple interpretations and the percentage increases to 72% for non-diacritized words. Nevertheless, most Arabic written text does not have diacritical marks. Gloss-based WSD methods measure the semantic similarity or the overlap between the context of a target word that needs to be disambiguated and the dictionary definition of that word (gloss of the word). Arabic gloss WSD suffers from a lack of context-gloss datasets. In this paper, we present an Arabic gloss-based WSD technique. We utilize the celebrated Bidirectional Encoder Representation from Transformers (BERT) to build two models that can efficiently perform Arabic WSD. These models can be trained with few training samples since they utilize BERT models that were pretrained on a large Arabic corpus. Our experimental results show that our models outperform two of the most recent gloss-based WSDs when we test them against the same test data used to evaluate our model. Additionally, our model achieves an F1-score of 89% compared to the best-reported F1-score of 85% for knowledge-based Arabic WSD. Another contribution of this paper is introducing a context-gloss benchmark that may help to overcome the lack of a standardized benchmark for Arabic gloss-based WSD.
Published: 2021
Full Text: View/download PDF

32. A Linguistic Approach to Opinion Mining

Author: Tuveri, Franco, Angioni, Manuela, Lai, Cristian, editor, Semeraro, Giovanni, editor, and Vargiu, Eloisa, editor
Published: 2013
Full Text: View/download PDF

33. Semantic Rule Filtering for Web-Scale Relation Extraction

Author: Moro, Andrea, Li, Hong, Krause, Sebastian, Xu, Feiyu, Navigli, Roberto, Uszkoreit, Hans, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Alani, Harith, editor, Kagal, Lalana, editor, Fokoue, Achille, editor, Groth, Paul, editor, Biemann, Chris, editor, Parreira, Josiane Xavier, editor, Aroyo, Lora, editor, Noy, Natasha, editor, Welty, Chris, editor, and Janowicz, Krzysztof, editor
Published: 2013
Full Text: View/download PDF

34. WSD algorithm based on a new method of vector-word contexts proximity calculation via epsilon-filtration

Author: Andrew Krizhanovsky, Alexander Kirillov, and Natalia Krizhanovskaya
Subjects: synonym, synset, corpus linguistics, word2vec, wikisource, wsd, rusvectores, wiktionary, Science
Abstract: The problem of word sense disambiguation (WSD) is considered in the article. Set of synonyms (synsets) and sentences with these synonyms are taken. It is necessary to automatically select the meaning of the word in the sentence. 1285 sentences were tagged by experts, namely, one of the dictionary meanings was selected by experts for target words. To solve the WSD problem, an algorithm based on a new method of vector-word contexts proximity calculation is proposed. A preliminary epsilon-filtering of words is performed, both in the sentence and in the set of synonyms, in order to achieve higher accuracy. An extensive program of experiments was carried out. Four algorithms are implemented, including the new algorithm. Experiments have shown that in some cases the new algorithm produces better results. The developed software and the tagged corpus have an open license and are available online. Wiktionary and Wikisource are used. A brief description of this work can be viewed as slides (https://goo.gl/9ak6Gt). A video lecture in Russian about this research is available online (https://youtu.be/-DLmRkepf58).
Published: 2018
Full Text: View/download PDF

35. Annotating Words Using WordNet Semantic Glosses

Author: Szymański, Julian, Duch, Włodzisław, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Huang, Tingwen, editor, Zeng, Zhigang, editor, Li, Chuandong, editor, and Leung, Chi Sing, editor
Published: 2012
Full Text: View/download PDF

36. A Cognitive Approach to Word Sense Disambiguation

Author: Dutta, Sudakshina, Basu, Anupam, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, and Gelbukh, Alexander, editor
Published: 2012
Full Text: View/download PDF

37. An Automatic Service Classification Approach

Author: Zhao, Haiyan, Chen, Qingkui, Kacprzyk, Janusz, editor, Wang, Yinglin, editor, and Li, Tianrui, editor
Published: 2011
Full Text: View/download PDF

38. Automatic Word Sense Disambiguation and Construction Identification Based on Corpus Multilevel Annotation

Author: Lyashevskaya, Olga, Mitrofanova, Olga, Grachkova, Maria, Romanov, Sergey, Shimorina, Anastasia, Shurygina, Alexandra, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, Habernal, Ivan, editor, and Matoušek, Václav, editor
Published: 2011
Full Text: View/download PDF

39. Evaluating Tagsets for Sanskrit

Author: Gopal, Madhav, Mishra, Diwakar, Singh, Devi Priyanka, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, and Jha, Girish Nath, editor
Published: 2010
Full Text: View/download PDF

40. Selecting an Appropriate Web-Scale Discovery Service: A Study of the Big 4's.

Author: Kumar, Vinit
Subjects: *FEDERATED searching, *MARKETPLACES, *USER interfaces, *ALGORITHMS
Abstract: Seeing the dynamic user demands the libraries are changing their approach to the user services. The web-scale discovery (WSD) service is the latest attempt in this direction. There are several players in the marketplace providing solutions for WSD with products having basic features and subtle features as well. As more players are entering the marketplace, it becomes challenging to select appropriate WSD system. It is also paramount for the library managers to be aware of the myriad range of features and underlying technology of WSD. This understanding will be for making informed purchase decisions. This paper attempts to explain in detail the components of a typical WSD system. Further, the paper evaluates the features of the Big 4's in WSD. The paper concludes by discussing some of the parameters to consider while evaluating the WSD system. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

41. Statistical Word Sense Disambiguation in Contexts for Russian Nouns Denoting Physical Objects

Author: Mitrofanova, Olga, Lashevskaya, Olga, Panicheva, Polina, Carbonell, Jaime G., editor, Siekmann, Jörg, editor, Sojka, Petr, editor, Horák, Aleš, editor, Kopeček, Ivan, editor, and Pala, Karel, editor
Published: 2008
Full Text: View/download PDF

42. Micro and nanometric wear evaluation of metal discs used on determination of biodiesel fuel lubricity

Author: Aline Cristina Mendes de Farias, João Telésforo Nóbrega de Medeiros, and Salete Martins Alves
Subjects: AFM, WSD, roughness, biodiesel, lubricity, HFRR, Materials of engineering and construction. Mechanics of materials, TA401-492
Abstract: The contact of diesel fuel with engine subsystems demands a good wear resistance. Lubricity is an important feature for integrity of injection system and the sulphur composites are primarily responsible for lubrication of the injector nozzle. Biodiesel is responsible for partially restoring the lubricity of diesel fuel that presents low levels of sulphur composites and, furthermore, it causes less pollution than diesel fuel. The lubricity is measured through the wear scar diameter following the ASTM D 975 standards. However, the friction and wear with light loads of micro/nanocomponents are highly dependent on surface interactions that can be evaluated by microscopy techniques. This study aimed to measure and to analyze the biodiesel lubricity and their blends (B5, B20) with diesel by observing the wear scars of discs using the scanning electronic microscopy (SEM), atomic force microscopy (AFM) and micro roughness techniques. The fuels performance was evaluated using HFRR tribometer. The tests conditions were based on standard ADTM D-6079-04. The coefficient of friction was measure during the test. After the test, the worn ball and disc were analyzed by SEM, AFM and profilometer. The results showed that the addition of biodiesel in diesel improve the tribological performance of fuel. Also, the just WSD value is not sufficient to evaluate the lubrication ability of a fuel. Analysis of the worn disc surfaces proved to be compatible with WSD number and also more sensitive to these kinds of fuels, showing mainly the form and intensity of the wear.
Published: 2014

43. PHORA: A NLP System for Spanish

Author: Palomar, Manuel, Saiz-Noeda, Maximiliano, Muñoz, Rafael, Suárez, Armando, Martínez-Barco, Patricio, Montoyo, Andrés, and Gelbukh, Alexander, editor
Published: 2001
Full Text: View/download PDF

44. MODELING SEMANTIC DISTANCE IN THE PATTERN DICTIONARY OF ENGLISH VERBS.

Author: CINKOVÁ, SILVIE and HLÁVKA, ZDENĚK
Subjects: *SEMANTICS, *VERBS, *LAMMA language, *KWIC (Indexing system), *ENCYCLOPEDIAS & dictionaries
Abstract: We explore human judgments on how well individual patterns of 29 target verbs from the Pattern Dictionary of English Verbs describe their random KWICs. We focus on cases where more than one pattern is judged as highly appropriate for a given KWIC and seek to estimate the effect of event participants (arguments) being denotatively similar in two patterns, considering all pair combinations in a given lemma. We compare this effect to the effect of several contextual features of the KWICs, the effect of paired PDEV implicatures implying each other, and the effect of belonging to a given lemma. We show that the lemma effect is still stronger than any feature going across lemmas we have examined so far, so that each verb appears to be a little universe in its own right. [ABSTRACT FROM AUTHOR]
Published: 2017
Full Text: View/download PDF

45. Monitoring of low voltage grids with the use of SAIDI indexes.

Author: ŁUKASIK, Zbigniew, KOZYRA, Jacek, and KUŚMIŃSKA-FIJAŁKOWSKA, Aldona
Subjects: LOW voltage systems, ELECTRIC power distribution grids, ELECTRIC utility costs, ELECTRIC power production, ELECTRIC power consumption
Abstract: Copyright of Przegląd Elektrotechniczny is the property of Przeglad Elektrotechniczny and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2017
Full Text: View/download PDF

46. Generating the missing links for semantic relations within Wiktionary.

Author: Bawakid, Abdullah
Subjects: *SEMANTICS, *COMPARATIVE linguistics
Abstract: In many cases, a single presentation of a term may carry multiple meanings. Wiktionary provides a way for viewing the meanings of the different terms it stores in the form of senses. It also provides semantic relations. However, Wiktionary, in its current form, contains semantic relations linking Wiktionary entries at the term level. Links for semantic relations connecting entries at the word sense level do not currently exist in Wiktionary. In this paper, we propose a novel method for generating a new type of links for semantic relations within Wiktionary. This is effectively applied to aligning the source words senses for semantic relations in Wiktionary with their corresponding target word senses. We use surface-level features that rely only on the structure and content of Wiktionary for completing this task without the aid of any external lexical or knowledge bases. We present the details of the method and how it was implemented. Additionally, we describe the evaluations that we performed and illustrate the competitive results we obtained, especially when compared to other systems. Our findings indicate that our system outperforms the baselines and performs similar to state-of-art systems without requiring access to external online resources or training data to run. [ABSTRACT FROM AUTHOR]
Published: 2017

47. Word Sense Disambiguation using Aggregated Similarity based on WordNet Graph Representation

Author: Mădălina ZURINI
Subjects: WSD, Similarity Measure, WordNet, Ontology, Synset, Computer engineering. Computer hardware, TK7885-7895, Bibliography. Library science. Information resources
Abstract: The term of word sense disambiguation, WSD, is introduced in the context of text document processing. A knowledge based approach is conducted using WordNet lexical ontology, describing its structure and components used for the process of identification of context related senses of each polysemy words. The principal distance measures using the graph associated to WordNet are presented, analyzing their advantages and disadvantages. A general model for aggregation of distances and probabilities is proposed and implemented in an application in order to detect the context senses of each word. For the non-existing words from WordNet, a similarity measure is used based on probabilities of co-occurrences. The module of WSD is proposed for integration in the step of processing documents such as supervised and unsupervised classification in order to maximize the correctness of the classification. Future work is related to the implementation of different domain oriented ontologies.
Published: 2013
Full Text: View/download PDF

48. СИНДРОМ БЕЛОГО ПЯТНА У КРЕВЕТОК (WSD). ВЛИЯНИЕ ТЕМПЕРАТУРЫ ВОДЫ НА АКТИВНОСТЬ ВИРУСА. ИСПЫТАНИЯ ПО РАЗРАБОТКЕ ВАКЦИНЫ

Subjects: white spot disease, viral diseases of crustaceans, Whispovirus, WSD, shrimp, синдром белого пятна, вирусные болезни ракообразных, креветка
Abstract: В статье рассматривается синдром белого пятна креветок WSD, вызываемый двуцепочечным ДНК-вирусом семейства Nimaviridae рода Whispovirus, WSSV. Этот вид вируса вызывает повсеместные вспышки данного заболевания, наносящего огромный экономический ущерб хозяйствам по разведению креветок. Попадая в организм, вирус ассоциируется в тканях экто– и мезодермального происхождения, провоцируя возникновение покраснений под хитиновым покровом тела, а затем образование белых пятен по самому покрову. В случае возникновения эпидемии смертность ракообразных составляет 100%. Представляется необходимым изучить возможные способы лечения и профилактики распространения данного вируса для обеспечения полного функционирования креветочных ферм и снижения экономических потерь для производителя., The article studies the white spot disease in shrimp WSD, caused by the double-stranded DNA virus of the Nimaviridae family of the Whispovirus genus, WSSV. This type of virus causes widespread outbreaks of the disease, leading to enormous economic damage to shrimp farming. Once in the body, the virus manifests in the tissues of ectodermal and mesodermic origin, causing redness under the chitin cover of the integument, and white spots on the cover itself. In the event of an epidemic, the mortality of crustaceans would be 100%. It deems to be necessary to study treatment and prevention of the virus spread in order to ensure the full functioning of shrimp farms and to reduce the economic losses to the farmer., Международный научно-исследовательский журнал, Выпуск 6 (120) 2022, Pages 40-45
Published: 2022
Full Text: View/download PDF

49. Word Sense Disambiguation Based on Large Scale Polish CLARIN Heterogeneous Lexical Resources

Author: Paweł Kędzia, Maciej Piasecki, and Marlena Orlińska
Subjects: word sense disambiguation, WSD, page rank, plWordNet, graphs, lexical resources, Computational linguistics. Natural language processing, P98-98.5, Semantics, P325-325.5, Lexicography, P327-327.5
Abstract: Word Sense Disambiguation Based on Large Scale Polish CLARIN Heterogeneous Lexical Resources Lexical resources can be applied in many different Natural Language Engineering tasks, but the most fundamental task is the recognition of word senses used in text contexts. The problem is difficult, not yet fully solved and different lexical resources provided varied support for it. Polish CLARIN lexical semantic resources are based on the plWordNet — a very large wordnet for Polish — as a central structure which is a basis for linking together several resources of different types. In this paper, several Word Sense Disambiguation (henceforth WSD) methods developed for Polish that utilise plWordNet are discussed. Textual sense descriptions in the traditional lexicon can be compared with text contexts using Lesk’s algorithm in order to find best matching senses. In the case of a wordnet, lexico-semantic relations provide the main description of word senses. Thus, first, we adapted and applied to Polish a WSD method based on the Page Rank. According to it, text words are mapped on their senses in the plWordNet graph and Page Rank algorithm is run to find senses with the highest scores. The method presents results lower but comparable to those reported for English. The error analysis showed that the main problems are: fine grained sense distinctions in plWordNet and limited number of connections between words of different parts of speech. In the second approach plWordNet expanded with the mapping onto the SUMO ontology concepts was used. Two scenarios for WSD were investigated: two step disambiguation and disambiguation based on combined networks of plWordNet and SUMO. In the former scenario, words are first assigned SUMO concepts and next plWordNet senses are disambiguated. In latter, plWordNet and SUMO are combined in one large network used next for the disambiguation of senses. The additional knowledge sources used in WSD improved the performance. The obtained results and potential further lines of developments were discussed.
Published: 2015
Full Text: View/download PDF

50. An Integration Model of Semantic Annotation Based on Synergetic Neural Network.

Author: Huang, Zhehuang and Chen, Yidong
Subjects: SEMANTIC computing, ANNOTATIONS, SYNERGETICS, NEURAL circuitry, NATURAL language processing
Abstract: Correct and automatical semantic analysis has always been one of major goals in natural language understanding. However, due to the difficulties in deep semantic analysis, at present, the mainstream studies of semantic analysis are focused on semantic role labeling (SRL) and word sense disambiguation (WSD). Nowadays, these two issues are mostly considered as separate tasks. However, this approach ignores possible dependencies between them. In order to address the issue, an integrative semantic analysis model based on synergetic neural network (SNN) is proposed in this paper, which can easily express useful logic constraints between SRL and WSD. The semantic analysis process can be viewed as the competition process of semantic order parameters. The strongest order parameter will win by competition and desired semantic patterns will be recognized. There are three main innovations in this paper. First, an integrative semantic analysis model is proposed that jointly models word sense disambiguationand semantic role labeling. Second, integrative order parameter is reconstructed to reflect the relation among semantic patterns. Finally, integrative network parameters and integrative evolution equation are reconstructed, which can reflect the relationship of guiding and driving each other between word sense and semantic roles. The experiment results on OntoNotes 2.0 corpus shows the integrative method in this paper has a higher performance for semantic role labeling and word sense disambiguation, and provides a good practicability and a promising future for other natural language processing tasks. [ABSTRACT FROM AUTHOR]
Published: 2016
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

224 results on '"WSD"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources