224 results on '"WSD"'
Search Results
2. Homograph recognition algorithm based on Euclidean metric
- Author
-
Elisa S. Izrailova, Arslanbek V. Astemirov, Ayshat S. Badaeva, Zelimhan A. Sultanov, Salaudin M. Umarkhadzhiev, Mokhmad-Salekh L. Khekhaev, and Madina L. Yasaeva
- Subjects
graphic homonymy ,homographs ,wsd ,speech synthesis ,chechen language ,low resource languages ,text corpus ,Optics. Light ,QC350-467 ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
The problem of resolving the uncertainties associated with homonymy for the Chechen language has become especially relevant after the creation of speech synthesis systems. The main disadvantage of speech synthesizers in the Chechen language are errors in reading homograph words that differ in the length / brevity of vowels — the longitude of such sounds is not displayed in any way when writing. The reproduction of diphthongs, which are indicated on the letter in the same way as monophthongs close to them in sound, causes problems. To improve the quality of synthesized speech in the Chechen language, an automatic homograph recognition program is needed. To solve this problem, the article considers the task of eliminating the ambiguity of the meaning of the words WSD (Word Sense Disambiguation). Algorithmic (supervised) methods based on a pre-marked database have been selected for the Chechen language. These methods are the most common solutions for eliminating the ambiguity of the meaning of words. The implementation of such methods is possible in the presence of large marked-up corpora that are inaccessible to most languages of the world including Chechen. The Chechen language belongs to low-resource languages for which the optimal approach from the point of view of saving labor and time resources is a semi-controlled hybrid method of homograph recognition based on the use of algorithmic and statistical methods. The algorithm created by the authors for recognizing homographs by six adjacent words in a sentence is presented. The method is implemented as a program. Preliminary preparation of the initial data for the operation of the algorithm includes marking of proposals by the values of homographs performed “manually”. The results of the program were evaluated using generally recognized accuracy metrics and amounted to F1 — 39 %, Accuracy — 45 %. A comparative analysis of the data obtained with the results of other methods and models showed that the accuracy of the algorithm presented in this article is closest to the results of the accuracy of algorithms based on the Lesk method. Using Lesk method for English, the results of F1 accuracy were obtained — 41.1 % (simple Lesk) and 51.1 % (extended Lesk). Methods using neural network algorithms provide higher WSD accuracy rates for most languages; however, their implementation requires large data bodies, which is not always available for low-resource languages, including Chechen.
- Published
- 2024
- Full Text
- View/download PDF
3. Contextual word disambiguates of Ge'ez language with homophonic using machine learning
- Author
-
Mequanent Degu Belete, Ayodeji Olalekan Salau, Girma Kassa Alitasb, and Tigist Bezabh
- Subjects
Ge'ez language ,WSD ,Text vectorization ,Machine learning ,Philology. Linguistics ,P1-1091 - Abstract
According to natural language processing experts, there are numerous ambiguous words in languages. Without automated word meaning disambiguation for any language, the development of natural language processing technologies such as information extraction, information retrieval, machine translation, and others are still challenging task. Therfore, this paper presents the development of a word sense disambiguation model for duplicate alphabet words for the Ge'ez language using corpus-based methods. Because there is no wordNet or public dataset for the Ge'ez language, 1010 samples of ambiguous words were gathered. Afterwards, the words were preprocessed and the text was vectorized using bag of words, Term Frequency-Inverse Document Frequency, and word embeddings such as word2vec and fastText. The vectorized texts are then analysed using the supervised machine learning algorithms such Naive Bayes, decision trees, random forests, K-nearest neighbor, linear support vector machine, and logistic regression. Bag of words paired with random forests outperformed all other combinations, with an accuracy of 99.52%. However, when Deep learning algorithms such as Deep neural network and Long Short-Term memory were used for the same dataset, a 100% accuracy was achieved.
- Published
- 2024
- Full Text
- View/download PDF
4. Chinese Financial Comments Sentiment Detection Based on the Bert-TCN Model Based on HowNet Disambiguation
- Author
-
Xia, Linhan, Li, Kan, Editor-in-Chief, Li, Qingyong, Associate Editor, Fournier-Viger, Philippe, Series Editor, Hong, Wei-Chiang, Series Editor, Liang, Xun, Series Editor, Wang, Long, Series Editor, Xu, Xuesong, Series Editor, Chen, Charles, editor, Singh, Satya Narayan, editor, Saxena, Sandeep, editor, and Wheeb, Ali Hussein, editor
- Published
- 2023
- Full Text
- View/download PDF
5. A Hybrid Approach for Sentiment Analysis Using Game Theory in Word Sense Disambiguation
- Author
-
Singhania, Aryan, Gupta, Harsh, Jain, Minni, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Swaroop, Abhishek, editor, Polkowski, Zdzislaw, editor, Correia, Sérgio Duarte, editor, and Virdee, Bal, editor
- Published
- 2023
- Full Text
- View/download PDF
6. An Analysis of Word Sense Disambiguation (WSD)
- Author
-
Nanjundan, Preethi, Mathews, Eappen Zachariah, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Hirche, Sandra, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Möller, Sebastian, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Zhang, Junjie James, Series Editor, Jain, Sarika, editor, Groppe, Sven, editor, and Mihindukulasooriya, Nandana, editor
- Published
- 2023
- Full Text
- View/download PDF
7. DETERMINATION OF LRFD ENVIRONMENTAL LOAD FACTORS OF OFFSHORE PLATFORM IN THE NORTH OF JAVA SEA AND MAKASSAR STRAIT.
- Author
-
Paramashanti, Rildova, Hermanto, Mochammad Fathurridho, and Nandalianadhira, Nafisa
- Subjects
OFFSHORE structures ,STRAITS ,WAVE analysis ,BP Deepwater Horizon Explosion & Oil Spill, 2010 - Abstract
The environmental load factor in commonly used offshore platforms design code, API RP-2A (American Petroleum Institute - Recommended Practice 2A), is developed based on the environmental conditions of American waters, especially the Gulf of Mexico, which have relatively extreme environmental conditions when compared to Indonesian waters. Case studies were conducted to determine environmental load factors in Indonesian waters, particularly the North Java Sea and Makassar Strait, categorized as shallow seas. This analysis was carried out on the performance criteria of pushover failure. In this study, the base shear was analyzed to describe the strength of the structure in the form of a collapse base shear (CBS) and the load in the form of a wave base shear (WBS). CBS was obtained through pushover analysis with yield strength randomness. WBS was obtained through in-place analysis with wave height randomness. This concept was applied to the structure of the Monopod and Braced Monopod types of offshore platforms located in the North of Java Sea and Makassar Strait waters which had been optimized for the WSD and LRFD design methods. The reliability of the structure was analyzed based on the CBS and WBS values using the First Order Reliability Method (FORM) II. The reliability analysis results were in the form of a reliability index (ß). North Java Sea gives a reliability index in the range from 3.58 to 4.38 for every design criteria. While Makassar Strait gives a reliability index in the range from 3.17 to 3.54 for every design criteria. With a high target safety level for the North Java Sea location, a 1.10 environmental load factor is recommended for further offshore structure design. But, for the Makassar Strait location, more studies need to be done to get better environmental load factor recommendations. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
8. A Wide Network Scanning for Discovery of UDP-Based Reflectors in the Nordic Countries
- Author
-
Bjerre, Alexander, Westh, Andreas Philip, Villefrance, Emil, Haque, A S M Farhan Al, Andersen, Jonas Bukrinski, Helgogaard, Lucas K., Anagnostopoulos, Marios, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Reiser, Hans P., editor, and Kyas, Marcel, editor
- Published
- 2022
- Full Text
- View/download PDF
9. The Role of Artificial Neural Network in Word Sense Disambiguation (WSD)—A Survey
- Author
-
Roopa, H. R., Panneer Arockiaraj, S., Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Rathore, Vijay Singh, editor, Sharma, Subhash Chander, editor, Tavares, Joao Manuel R.S., editor, Moreira, Catarina, editor, and Surendiran, B., editor
- Published
- 2022
- Full Text
- View/download PDF
10. A Unified Sense Inventory for Word Sense Disambiguation in Polish
- Author
-
Janz, Arkadiusz, Dziob, Agnieszka, Oleksy, Marcin, Baran, Joanna, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Groen, Derek, editor, de Mulatier, Clélia, editor, Paszynski, Maciej, editor, Krzhizhanovskaya, Valeria V., editor, Dongarra, Jack J., editor, and Sloot, Peter M. A., editor
- Published
- 2022
- Full Text
- View/download PDF
11. Word Sense Disambiguation System for Information Retrieval in Telugu Language
- Author
-
Koppula, Neeraja, Pradeep Kumar, J., Srinivas Rao, Koppula, Kiran Kumar, G., Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Mandal, Jyotsna Kumar, editor, and De, Debashis, editor
- Published
- 2022
- Full Text
- View/download PDF
12. A new rainfall prediction model based on ICEEMDAN-WSD-BiLSTM and ESN.
- Author
-
Zhang, Xianqi, Chen, Haiyang, Wen, Yihao, Shi, Jinwen, and Xiao, Yimeng
- Subjects
PREDICTION models ,HILBERT-Huang transform ,SIGNAL denoising ,REGIONAL development ,FLOOD control ,FLOODS ,DROUGHTS - Abstract
Precipitation, as an important indicator describing the evolution of the regional climate system, plays an important role in understanding the spatial and temporal distribution characteristics of regional precipitation. Scientific and accurate prediction of regional precipitation is helpful to provide theoretical basis for relevant departments to guide flood and drought control. To address the uncertainty and nonlinear characteristics of precipitation series, this paper uses the established improved complete ensemble empirical mode decomposition with adaptive noise (ICEEMDAN)-wavelet signal denoising (WSD)-bi-directional long short-term memory (BiLSTM), and echo state network (ESN) models to predict precipitation of four cities in southern Anhui Province. The BiLSTM is used to predict the high-frequency components and the ESN to predict the low-frequency components, thus avoiding the influence between the two neural network predictions. The results show that the ICEEMDAN-WSD-BiLSTM and ESN models are more accurate. The average relative error reached 2.64% and the NSE (Nash–Sutcliffe efficiency coefficient) was 0.91, which was significantly better than the other four models. The model reveals the temporal change pattern and evolution characteristics of future precipitation, guides flood prevention and mitigation, and has certain theoretical significance and application value for promoting regional sustainable development. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
13. Word Sense Disambiguation for Marathi Language Using WordNet and the Lesk Approach
- Author
-
Kharate, Namrata G., Patil, Varsha H., Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Patil, Varsha H., editor, Dey, Nilanjan, editor, N. Mahalle, Parikshit, editor, Shafi Pathan, Mohd, editor, and Kimbahune, Vinod. V., editor
- Published
- 2021
- Full Text
- View/download PDF
14. Multimodal Web Content Mining to Filter Non-learning Sites Using NLP
- Author
-
Modi, Sangita S., Jagtap, Sudhir B., Xhafa, Fatos, Series Editor, Pandian, A. Pasumpon, editor, Senjyu, Tomonobu, editor, Islam, Syed Mohammed Shamsul, editor, and Wang, Haoxiang, editor
- Published
- 2020
- Full Text
- View/download PDF
15. Assessment of Information Extraction Techniques, Models and Systems.
- Author
-
Rahman, Atta-ur, Musleh, Dhiaa, Nabil, Majed, Alubaidan, Haya, Gollapalli, Mohammed, Krishnasamy, Gomathi, Almoqbil, Dakheel, Khan, Mohammad Aftab Alam, Farooqui, Mehwash, Ahmed, Mohammed Imran Basheer, Ahmed, Mohammed Salih, and Mahmud, Maqsood
- Subjects
DATA mining ,EXTRACTION techniques ,SEARCH engines ,ABSTRACTING & indexing services ,DIGITAL libraries - Abstract
The present article aims to review and evaluate the practiced and classical techniques, tools, models, and systems concerning automatic information extraction (IE) from published scientific documents like research articles, patents, theses, technical reports, and case studies etc. IE is performed for various reasons such as better indexing, archiving, searching, and retrieving. That is mainly used by the search engines and the indexing services as well the digital libraries and semantic web. In this regard, several studies have been conducted targeting various nature of documents. The study pays special consideration to the successful IE models, algorithms and approaches applied to structural IE from published documents. To grasp this, the paper is classified into several segments and each segment covers a significant aspect of IE. Furthermore, to validate their benefits and drawbacks, a comparative study of all the approaches have been conducted in terms of various performance factors like precision, accuracy, recall and F-score. Potential areas of improvement have been emphasized as research gap for the scholars in the closely related areas. Ultimately, a comprehensive summary of the evaluation is presented in tabular form and review is concluded. It was observed that the hybrid methods outperform the other methods due to their versatile nature to address various document formats. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
16. A Lemmatizer Tool for Assamese Language
- Author
-
Roy, Arindam, Sarkar, Sunita, Borkakoty, Hsubhas, Barbosa, Simone Diniz Junqueira, Editorial Board Member, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Kotenko, Igor, Editorial Board Member, Yuan, Junsong, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Mandal, Jyotsna Kumar, editor, Mukhopadhyay, Somnath, editor, Dutta, Paramartha, editor, and Dasgupta, Kousik, editor
- Published
- 2019
- Full Text
- View/download PDF
17. Verifying Usefulness of Algorithms for WordNet Based Similarity Sense Disambiguation
- Author
-
Kukla, Elżbieta, Siemiński, Andrzej, Kacprzyk, Janusz, Series Editor, Pal, Nikhil R., Advisory Editor, Bello Perez, Rafael, Advisory Editor, Corchado, Emilio S., Advisory Editor, Hagras, Hani, Advisory Editor, Kóczy, László T., Advisory Editor, Kreinovich, Vladik, Advisory Editor, Lin, Chin-Teng, Advisory Editor, Lu, Jie, Advisory Editor, Melin, Patricia, Advisory Editor, Nedjah, Nadia, Advisory Editor, Nguyen, Ngoc Thanh, Advisory Editor, Wang, Jun, Advisory Editor, Choroś, Kazimierz, editor, Kopel, Marek, editor, Kukla, Elżbieta, editor, and Siemiński, Andrzej, editor
- Published
- 2019
- Full Text
- View/download PDF
18. Word Sense Disambiguation with Massive Contextual Texts
- Author
-
Liu, Ya-fei, Wei, Jinmao, Hutchison, David, Editorial Board Member, Kanade, Takeo, Editorial Board Member, Kittler, Josef, Editorial Board Member, Kleinberg, Jon M., Editorial Board Member, Mattern, Friedemann, Editorial Board Member, Mitchell, John C., Editorial Board Member, Naor, Moni, Editorial Board Member, Pandu Rangan, C., Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Terzopoulos, Demetri, Editorial Board Member, Tygar, Doug, Editorial Board Member, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Li, Guoliang, editor, Yang, Jun, editor, Gama, Joao, editor, Natwichai, Juggapong, editor, and Tong, Yongxin, editor
- Published
- 2019
- Full Text
- View/download PDF
19. Word Sense Induction with Attentive Context Clustering
- Author
-
Moshe Stekel, Amos Azaria, and Shai Gordin
- Subjects
clustering ,nlp ,wsi ,wsd ,[info.info-ai]computer science [cs]/artificial intelligence [cs.ai] ,History of scholarship and learning. The humanities ,AZ20-999 ,Bibliography. Library science. Information resources - Abstract
This paper presents ACCWSI (Attentive Context Clustering WSI), a method for Word Sense Induction, suitable for languages with limited resources. Pretrained on a small corpus and given an ambiguous word (a query word) and a set of excerpts that contain it, ACCWSI uses an attention mechanism for generating context-aware embeddings, distinguishing between the different senses assigned to the query word. These embeddings are then clustered to provide groups of main common uses of the query word. We show that ACCWSI performs well on the SemEval-2 2010 WSI task. ACCWSI also demonstrates practical applicability for shedding light on the meanings of ambiguous words in ancient languages, such as Classical Hebrew and Akkadian. In the near future, we intend to turn ACCWSI into a practical tool for linguists and historians.
- Published
- 2022
- Full Text
- View/download PDF
20. Practice of Word Sense Disambiguation
- Author
-
Siemiński, Andrzej, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Nguyen, Ngoc Thanh, editor, Hoang, Duong Hung, editor, Hong, Tzung-Pei, editor, Pham, Hoang, editor, and Trawiński, Bogdan, editor
- Published
- 2018
- Full Text
- View/download PDF
21. Word Sense Disambiguation Using IndoWordNet
- Author
-
Bhingardive, Sudha, Bhattacharyya, Pushpak, Dash, Niladri Sekhar, editor, Bhattacharyya, Pushpak, editor, and Pawar, Jyoti D., editor
- Published
- 2017
- Full Text
- View/download PDF
22. Assamese Word Sense Disambiguation using Cuckoo Search Algorithm.
- Author
-
Gogoi, Arjun, Baruah, Nomi, and Nath, Lakhya Jyoti
- Subjects
SEARCH algorithms ,MACHINE translating ,ALGORITHMS ,NATURAL languages ,TABU search algorithm ,PROBLEM solving ,NATURAL language processing - Abstract
Natural language processing is associated with human-computer interaction, where several challenges require natural language understanding. The Word sense disambiguation problem comprises the computational assignment of meaning to a word according to a specific context in which it occurs. There are numerous natural language processing applications, such as machine translation, information retrieval, and information extraction, which require this task which takes place at the semantic level. To solve this problem unsupervised computation proposals can be effective since they have been successfully used for many real-world optimization problems. In this paper, we propose to solve the word sense disambiguation problem using the cuckoo search algorithm in the Assamese language. We illustrate the performance of our algorithm by carrying out experiments on an Assamese corpus. And comparing them against an unsupervised genetic algorithm that is implemented in the Assamese language. Results of the experiment show that the cuckoo algorithm can achieve more precision, recall and F-measure, attaining 87.5, 84, and 85.71 percentages respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
23. Arabic Gloss WSD Using BERT.
- Author
-
El-Razzaz, Mohammed, Fakhr, Mohamed Waleed, Maghraby, Fahima A., and Prati, Andrea
- Subjects
VOCABULARY ,CORPORA - Abstract
Word Sense Disambiguation (WSD) aims to predict the correct sense of a word given its context. This problem is of extreme importance in Arabic, as written words can be highly ambiguous; 43% of diacritized words have multiple interpretations and the percentage increases to 72% for non-diacritized words. Nevertheless, most Arabic written text does not have diacritical marks. Gloss-based WSD methods measure the semantic similarity or the overlap between the context of a target word that needs to be disambiguated and the dictionary definition of that word (gloss of the word). Arabic gloss WSD suffers from a lack of context-gloss datasets. In this paper, we present an Arabic gloss-based WSD technique. We utilize the celebrated Bidirectional Encoder Representation from Transformers (BERT) to build two models that can efficiently perform Arabic WSD. These models can be trained with few training samples since they utilize BERT models that were pretrained on a large Arabic corpus. Our experimental results show that our models outperform two of the most recent gloss-based WSDs when we test them against the same test data used to evaluate our model. Additionally, our model achieves an F1-score of 89% compared to the best-reported F1-score of 85% for knowledge-based Arabic WSD. Another contribution of this paper is introducing a context-gloss benchmark that may help to overcome the lack of a standardized benchmark for Arabic gloss-based WSD. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
24. Word sense disambiguation based on stretchable matching of the semantic template.
- Author
-
Wang, Wei, Huang, Degen, and Yu, Haitao
- Subjects
- *
NATURAL language processing , *VARIATION in language , *NATURAL languages , *VOCABULARY - Abstract
It is evident that the traditional hard matching of a fixed-length template cannot satisfy the nearly indefinite variations in natural language. This issue mainly results from three major problems of the traditional matching mode: 1) in matching with a short template, the context of natural language cannot be effectively captured; 2) in matching with a long template, serious data sparsity will lead to a low success rate of template matching (i.e., low recall); and 3) due to a lack of flexible matching ability, traditional hard matching is more prone to failure. Therefore, this paper proposed a novel method of stretchable matching of the semantic template (SMOST) to deal with the above problems. We have applied this method to word sense disambiguation in the natural language processing field. In the same case of using only the SemCor corpus, the result of our system is very close to the best result of existing systems, which shows the effectiveness of new proposed method. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
25. Dataset of white spot disease affected shrimp farmers disaggregated by the variables of farm site, environment, disease history, operational practices, and saline zones
- Author
-
Neaz A. Hasan and Mohammad Mahfujul Haque
- Subjects
Disaggregated data ,Shrimp farming ,Risk factors ,WSD ,Bangladesh ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Science (General) ,Q1-390 - Abstract
The article presents the summary of a dataset related to the risks factors of white spot disease (WSD) of farmed shrimp (Penaeus monodon) in Khulna, Bagerhat and Satkhira districts of Bangladesh. This dataset was developed following two consecutive steps. In the first step, participatory rural appraisal tools were applied to get the conceptual framework for data collection regarding lists of farmers and the variables of the risk factors of WSD. In the second step, sampling of farmers, google featured questionnaire development, and mobile phone-assisted survey were carried out. The total surveyed farms were 233 consisting of 21 and 212 semi-intensive and extensive farms, respectively. The data were collected in the form of continuous, nominal and binary variables disaggregated by saline zones. The dataset contains some basic socio-economic data of shrimp farmers, farm characteristics, environmental attributes and disease history of shrimp farms. The dataset also has GPS coordinates of all the surveyed farms individually which are very useful for spatial analysis. In total, the dataset in MS Excel has 46 variables and attached as the supplementary material with this article.
- Published
- 2020
- Full Text
- View/download PDF
26. Evaluation of Disambiguation Strategies on Biomedical Text Categorization
- Author
-
Rais, Mohammed, Lachkar, Abdelmonaime, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Ortuño, Francisco, editor, and Rojas, Ignacio, editor
- Published
- 2016
- Full Text
- View/download PDF
27. Multilingual Projections
- Author
-
Bhattacharyya, Pushpak, Ide, Nancy, Series editor, Gala, Núria, editor, Rapp, Reinhard, editor, and Bel-Enguix, Gemma, editor
- Published
- 2015
- Full Text
- View/download PDF
28. DURHAM : a word sense disambiguation system
- Author
-
Hawkins, Paul Martin
- Subjects
005 ,SEMCOR ,WSD - Abstract
Ever since the 1950's when Machine Translation first began to be developed, word sense disambiguation (WSD) has been considered a problem to developers. In more recent times, all NLP tasks which are sensitive to lexical semantics potentially benefit from WSD although to what extent is largely unknown. The thesis presents a novel approach to the task of WSD on a large scale. In particular a novel knowledge source is presented named contextual information. This knowledge source adopts a sub-symbolic training mechanism to learn information from the context of a sentence which is able to aid disambiguation. The system also takes advantage of frequency information and these two knowledge sources are combined. The system is trained and tested on SEMCOR. A novel disambiguation algorithm is also developed. The algorithm must tackle the problem of a large possible number of sense combinations in a sentence. The algorithm presented aims to make an appropriate choice between accuracy and efficiency. This is performed by directing the search at a word level. The performance achieved on SEMCOR is reported and an analysis of the various components of the system is performed. The results achieved on this test data are pleasing, but are difficult to compare with most of the other work carried out in the field. For this reason the system took part in the SENSEVAL evaluation which provided an excellent opportunity to extensively compare WSD systems. SENSEVAL is a small scale WSD evaluation using the HECTOR lexicon. Despite this, few adaptations to the system were required. The performance of the system on the SENSEVAL task are reported and have also been presented in [Hawkins, 2000].
- Published
- 1999
29. Improving stemming for Assamese information retrieval
- Author
-
Gogoi, Arjun, Baruah, Nomi, Sarma, Sikhar Kr., and Phukan, Rakhee D.
- Published
- 2021
- Full Text
- View/download PDF
30. Disambiguation of Biomedical Acronyms Based on a Bidirectional Recurrent Neural Network of Character-level Features.
- Author
-
Ren Kai, Li Na, Xiong Wei, and Wang Shi-Wen
- Subjects
- *
RECURRENT neural networks , *ACRONYMS , *MODEL railroads - Abstract
Polysemic acronyms are very common in the field of biomedicine. These acronyms have different senses in different contexts. The ambiguity of acronyms may cause significant negative impact on the understanding of the full text by machine learning. To address the disambiguation of acronyms in the biomedical domain, most associated studies are based on methods using word-level contextual features. These methods require abundant relevant external resources for model training, and the accuracy of their disambiguation of acronyms may decrease greatly upon the lack of external resources. In this study, disambiguation of biomedical acronyms was investigated on the basis of the character-level feature model to realize the disambiguation of biomedical acronyms with largely limited external corpora. First, sentences containing ambiguous acronyms were extracted through retrieval and the feature vector of the context were initialized by using the character-level features. Second, these initial vectors were input into the bidirectional long shortterm memory neutral network model for training. Finally, the disambiguation of acronyms was realized by the outputs of the neutral network model through the Softmax classification approach. The results of acronym disambiguation based on character-level feature model were also compared with those based on word-level feature models. Results demonstrate that the average accuracy of the character-level feature neutral network algorithm reaches 85.82% on the dataset of 106 common biomedical acronyms. Thus, the character-level feature neutral network algorithm is superior to the traditional methods, which use a large number of external resources. This study confirms that the disambiguation method based on character-level features is applicable to the disambiguation of biomedical acronyms under limited relevant data. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
31. Arabic Gloss WSD Using BERT
- Author
-
Mohammed El-Razzaz, Mohamed Waleed Fakhr, and Fahima A. Maghraby
- Subjects
WSD ,BERT ,Arabic ,context gloss ,Technology ,Engineering (General). Civil engineering (General) ,TA1-2040 ,Biology (General) ,QH301-705.5 ,Physics ,QC1-999 ,Chemistry ,QD1-999 - Abstract
Word Sense Disambiguation (WSD) aims to predict the correct sense of a word given its context. This problem is of extreme importance in Arabic, as written words can be highly ambiguous; 43% of diacritized words have multiple interpretations and the percentage increases to 72% for non-diacritized words. Nevertheless, most Arabic written text does not have diacritical marks. Gloss-based WSD methods measure the semantic similarity or the overlap between the context of a target word that needs to be disambiguated and the dictionary definition of that word (gloss of the word). Arabic gloss WSD suffers from a lack of context-gloss datasets. In this paper, we present an Arabic gloss-based WSD technique. We utilize the celebrated Bidirectional Encoder Representation from Transformers (BERT) to build two models that can efficiently perform Arabic WSD. These models can be trained with few training samples since they utilize BERT models that were pretrained on a large Arabic corpus. Our experimental results show that our models outperform two of the most recent gloss-based WSDs when we test them against the same test data used to evaluate our model. Additionally, our model achieves an F1-score of 89% compared to the best-reported F1-score of 85% for knowledge-based Arabic WSD. Another contribution of this paper is introducing a context-gloss benchmark that may help to overcome the lack of a standardized benchmark for Arabic gloss-based WSD.
- Published
- 2021
- Full Text
- View/download PDF
32. A Linguistic Approach to Opinion Mining
- Author
-
Tuveri, Franco, Angioni, Manuela, Lai, Cristian, editor, Semeraro, Giovanni, editor, and Vargiu, Eloisa, editor
- Published
- 2013
- Full Text
- View/download PDF
33. Semantic Rule Filtering for Web-Scale Relation Extraction
- Author
-
Moro, Andrea, Li, Hong, Krause, Sebastian, Xu, Feiyu, Navigli, Roberto, Uszkoreit, Hans, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Alani, Harith, editor, Kagal, Lalana, editor, Fokoue, Achille, editor, Groth, Paul, editor, Biemann, Chris, editor, Parreira, Josiane Xavier, editor, Aroyo, Lora, editor, Noy, Natasha, editor, Welty, Chris, editor, and Janowicz, Krzysztof, editor
- Published
- 2013
- Full Text
- View/download PDF
34. WSD algorithm based on a new method of vector-word contexts proximity calculation via epsilon-filtration
- Author
-
Andrew Krizhanovsky, Alexander Kirillov, and Natalia Krizhanovskaya
- Subjects
synonym ,synset ,corpus linguistics ,word2vec ,wikisource ,wsd ,rusvectores ,wiktionary ,Science - Abstract
The problem of word sense disambiguation (WSD) is considered in the article. Set of synonyms (synsets) and sentences with these synonyms are taken. It is necessary to automatically select the meaning of the word in the sentence. 1285 sentences were tagged by experts, namely, one of the dictionary meanings was selected by experts for target words. To solve the WSD problem, an algorithm based on a new method of vector-word contexts proximity calculation is proposed. A preliminary epsilon-filtering of words is performed, both in the sentence and in the set of synonyms, in order to achieve higher accuracy. An extensive program of experiments was carried out. Four algorithms are implemented, including the new algorithm. Experiments have shown that in some cases the new algorithm produces better results. The developed software and the tagged corpus have an open license and are available online. Wiktionary and Wikisource are used. A brief description of this work can be viewed as slides (https://goo.gl/9ak6Gt). A video lecture in Russian about this research is available online (https://youtu.be/-DLmRkepf58).
- Published
- 2018
- Full Text
- View/download PDF
35. Annotating Words Using WordNet Semantic Glosses
- Author
-
Szymański, Julian, Duch, Włodzisław, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Huang, Tingwen, editor, Zeng, Zhigang, editor, Li, Chuandong, editor, and Leung, Chi Sing, editor
- Published
- 2012
- Full Text
- View/download PDF
36. A Cognitive Approach to Word Sense Disambiguation
- Author
-
Dutta, Sudakshina, Basu, Anupam, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, and Gelbukh, Alexander, editor
- Published
- 2012
- Full Text
- View/download PDF
37. An Automatic Service Classification Approach
- Author
-
Zhao, Haiyan, Chen, Qingkui, Kacprzyk, Janusz, editor, Wang, Yinglin, editor, and Li, Tianrui, editor
- Published
- 2011
- Full Text
- View/download PDF
38. Automatic Word Sense Disambiguation and Construction Identification Based on Corpus Multilevel Annotation
- Author
-
Lyashevskaya, Olga, Mitrofanova, Olga, Grachkova, Maria, Romanov, Sergey, Shimorina, Anastasia, Shurygina, Alexandra, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, Habernal, Ivan, editor, and Matoušek, Václav, editor
- Published
- 2011
- Full Text
- View/download PDF
39. Evaluating Tagsets for Sanskrit
- Author
-
Gopal, Madhav, Mishra, Diwakar, Singh, Devi Priyanka, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, and Jha, Girish Nath, editor
- Published
- 2010
- Full Text
- View/download PDF
40. Selecting an Appropriate Web-Scale Discovery Service: A Study of the Big 4's.
- Author
-
Kumar, Vinit
- Subjects
- *
FEDERATED searching , *MARKETPLACES , *USER interfaces , *ALGORITHMS - Abstract
Seeing the dynamic user demands the libraries are changing their approach to the user services. The web-scale discovery (WSD) service is the latest attempt in this direction. There are several players in the marketplace providing solutions for WSD with products having basic features and subtle features as well. As more players are entering the marketplace, it becomes challenging to select appropriate WSD system. It is also paramount for the library managers to be aware of the myriad range of features and underlying technology of WSD. This understanding will be for making informed purchase decisions. This paper attempts to explain in detail the components of a typical WSD system. Further, the paper evaluates the features of the Big 4's in WSD. The paper concludes by discussing some of the parameters to consider while evaluating the WSD system. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
41. Statistical Word Sense Disambiguation in Contexts for Russian Nouns Denoting Physical Objects
- Author
-
Mitrofanova, Olga, Lashevskaya, Olga, Panicheva, Polina, Carbonell, Jaime G., editor, Siekmann, Jörg, editor, Sojka, Petr, editor, Horák, Aleš, editor, Kopeček, Ivan, editor, and Pala, Karel, editor
- Published
- 2008
- Full Text
- View/download PDF
42. Micro and nanometric wear evaluation of metal discs used on determination of biodiesel fuel lubricity
- Author
-
Aline Cristina Mendes de Farias, João Telésforo Nóbrega de Medeiros, and Salete Martins Alves
- Subjects
AFM ,WSD ,roughness ,biodiesel ,lubricity ,HFRR ,Materials of engineering and construction. Mechanics of materials ,TA401-492 - Abstract
The contact of diesel fuel with engine subsystems demands a good wear resistance. Lubricity is an important feature for integrity of injection system and the sulphur composites are primarily responsible for lubrication of the injector nozzle. Biodiesel is responsible for partially restoring the lubricity of diesel fuel that presents low levels of sulphur composites and, furthermore, it causes less pollution than diesel fuel. The lubricity is measured through the wear scar diameter following the ASTM D 975 standards. However, the friction and wear with light loads of micro/nanocomponents are highly dependent on surface interactions that can be evaluated by microscopy techniques. This study aimed to measure and to analyze the biodiesel lubricity and their blends (B5, B20) with diesel by observing the wear scars of discs using the scanning electronic microscopy (SEM), atomic force microscopy (AFM) and micro roughness techniques. The fuels performance was evaluated using HFRR tribometer. The tests conditions were based on standard ADTM D-6079-04. The coefficient of friction was measure during the test. After the test, the worn ball and disc were analyzed by SEM, AFM and profilometer. The results showed that the addition of biodiesel in diesel improve the tribological performance of fuel. Also, the just WSD value is not sufficient to evaluate the lubrication ability of a fuel. Analysis of the worn disc surfaces proved to be compatible with WSD number and also more sensitive to these kinds of fuels, showing mainly the form and intensity of the wear.
- Published
- 2014
43. PHORA: A NLP System for Spanish
- Author
-
Palomar, Manuel, Saiz-Noeda, Maximiliano, Muñoz, Rafael, Suárez, Armando, Martínez-Barco, Patricio, Montoyo, Andrés, and Gelbukh, Alexander, editor
- Published
- 2001
- Full Text
- View/download PDF
44. MODELING SEMANTIC DISTANCE IN THE PATTERN DICTIONARY OF ENGLISH VERBS.
- Author
-
CINKOVÁ, SILVIE and HLÁVKA, ZDENĚK
- Subjects
- *
SEMANTICS , *VERBS , *LAMMA language , *KWIC (Indexing system) , *ENCYCLOPEDIAS & dictionaries - Abstract
We explore human judgments on how well individual patterns of 29 target verbs from the Pattern Dictionary of English Verbs describe their random KWICs. We focus on cases where more than one pattern is judged as highly appropriate for a given KWIC and seek to estimate the effect of event participants (arguments) being denotatively similar in two patterns, considering all pair combinations in a given lemma. We compare this effect to the effect of several contextual features of the KWICs, the effect of paired PDEV implicatures implying each other, and the effect of belonging to a given lemma. We show that the lemma effect is still stronger than any feature going across lemmas we have examined so far, so that each verb appears to be a little universe in its own right. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
45. Monitoring of low voltage grids with the use of SAIDI indexes.
- Author
-
ŁUKASIK, Zbigniew, KOZYRA, Jacek, and KUŚMIŃSKA-FIJAŁKOWSKA, Aldona
- Subjects
LOW voltage systems ,ELECTRIC power distribution grids ,ELECTRIC utility costs ,ELECTRIC power production ,ELECTRIC power consumption - Abstract
Copyright of Przegląd Elektrotechniczny is the property of Przeglad Elektrotechniczny and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2017
- Full Text
- View/download PDF
46. Generating the missing links for semantic relations within Wiktionary.
- Author
-
Bawakid, Abdullah
- Subjects
- *
SEMANTICS , *COMPARATIVE linguistics - Abstract
In many cases, a single presentation of a term may carry multiple meanings. Wiktionary provides a way for viewing the meanings of the different terms it stores in the form of senses. It also provides semantic relations. However, Wiktionary, in its current form, contains semantic relations linking Wiktionary entries at the term level. Links for semantic relations connecting entries at the word sense level do not currently exist in Wiktionary. In this paper, we propose a novel method for generating a new type of links for semantic relations within Wiktionary. This is effectively applied to aligning the source words senses for semantic relations in Wiktionary with their corresponding target word senses. We use surface-level features that rely only on the structure and content of Wiktionary for completing this task without the aid of any external lexical or knowledge bases. We present the details of the method and how it was implemented. Additionally, we describe the evaluations that we performed and illustrate the competitive results we obtained, especially when compared to other systems. Our findings indicate that our system outperforms the baselines and performs similar to state-of-art systems without requiring access to external online resources or training data to run. [ABSTRACT FROM AUTHOR]
- Published
- 2017
47. Word Sense Disambiguation using Aggregated Similarity based on WordNet Graph Representation
- Author
-
Mădălina ZURINI
- Subjects
WSD ,Similarity Measure ,WordNet ,Ontology ,Synset ,Computer engineering. Computer hardware ,TK7885-7895 ,Bibliography. Library science. Information resources - Abstract
The term of word sense disambiguation, WSD, is introduced in the context of text document processing. A knowledge based approach is conducted using WordNet lexical ontology, describing its structure and components used for the process of identification of context related senses of each polysemy words. The principal distance measures using the graph associated to WordNet are presented, analyzing their advantages and disadvantages. A general model for aggregation of distances and probabilities is proposed and implemented in an application in order to detect the context senses of each word. For the non-existing words from WordNet, a similarity measure is used based on probabilities of co-occurrences. The module of WSD is proposed for integration in the step of processing documents such as supervised and unsupervised classification in order to maximize the correctness of the classification. Future work is related to the implementation of different domain oriented ontologies.
- Published
- 2013
- Full Text
- View/download PDF
48. СИНДРОМ БЕЛОГО ПЯТНА У КРЕВЕТОК (WSD). ВЛИЯНИЕ ТЕМПЕРАТУРЫ ВОДЫ НА АКТИВНОСТЬ ВИРУСА. ИСПЫТАНИЯ ПО РАЗРАБОТКЕ ВАКЦИНЫ
- Subjects
white spot disease ,viral diseases of crustaceans ,Whispovirus ,WSD ,shrimp ,синдром белого пятна ,вирусные болезни ракообразных ,креветка - Abstract
В статье рассматривается синдром белого пятна креветок WSD, вызываемый двуцепочечным ДНК-вирусом семейства Nimaviridae рода Whispovirus, WSSV. Этот вид вируса вызывает повсеместные вспышки данного заболевания, наносящего огромный экономический ущерб хозяйствам по разведению креветок. Попадая в организм, вирус ассоциируется в тканях экто– и мезодермального происхождения, провоцируя возникновение покраснений под хитиновым покровом тела, а затем образование белых пятен по самому покрову. В случае возникновения эпидемии смертность ракообразных составляет 100%. Представляется необходимым изучить возможные способы лечения и профилактики распространения данного вируса для обеспечения полного функционирования креветочных ферм и снижения экономических потерь для производителя., The article studies the white spot disease in shrimp WSD, caused by the double-stranded DNA virus of the Nimaviridae family of the Whispovirus genus, WSSV. This type of virus causes widespread outbreaks of the disease, leading to enormous economic damage to shrimp farming. Once in the body, the virus manifests in the tissues of ectodermal and mesodermic origin, causing redness under the chitin cover of the integument, and white spots on the cover itself. In the event of an epidemic, the mortality of crustaceans would be 100%. It deems to be necessary to study treatment and prevention of the virus spread in order to ensure the full functioning of shrimp farms and to reduce the economic losses to the farmer., Международный научно-исследовательский журнал, Выпуск 6 (120) 2022, Pages 40-45
- Published
- 2022
- Full Text
- View/download PDF
49. Word Sense Disambiguation Based on Large Scale Polish CLARIN Heterogeneous Lexical Resources
- Author
-
Paweł Kędzia, Maciej Piasecki, and Marlena Orlińska
- Subjects
word sense disambiguation ,WSD ,page rank ,plWordNet ,graphs ,lexical resources ,Computational linguistics. Natural language processing ,P98-98.5 ,Semantics ,P325-325.5 ,Lexicography ,P327-327.5 - Abstract
Word Sense Disambiguation Based on Large Scale Polish CLARIN Heterogeneous Lexical Resources Lexical resources can be applied in many different Natural Language Engineering tasks, but the most fundamental task is the recognition of word senses used in text contexts. The problem is difficult, not yet fully solved and different lexical resources provided varied support for it. Polish CLARIN lexical semantic resources are based on the plWordNet — a very large wordnet for Polish — as a central structure which is a basis for linking together several resources of different types. In this paper, several Word Sense Disambiguation (henceforth WSD) methods developed for Polish that utilise plWordNet are discussed. Textual sense descriptions in the traditional lexicon can be compared with text contexts using Lesk’s algorithm in order to find best matching senses. In the case of a wordnet, lexico-semantic relations provide the main description of word senses. Thus, first, we adapted and applied to Polish a WSD method based on the Page Rank. According to it, text words are mapped on their senses in the plWordNet graph and Page Rank algorithm is run to find senses with the highest scores. The method presents results lower but comparable to those reported for English. The error analysis showed that the main problems are: fine grained sense distinctions in plWordNet and limited number of connections between words of different parts of speech. In the second approach plWordNet expanded with the mapping onto the SUMO ontology concepts was used. Two scenarios for WSD were investigated: two step disambiguation and disambiguation based on combined networks of plWordNet and SUMO. In the former scenario, words are first assigned SUMO concepts and next plWordNet senses are disambiguated. In latter, plWordNet and SUMO are combined in one large network used next for the disambiguation of senses. The additional knowledge sources used in WSD improved the performance. The obtained results and potential further lines of developments were discussed.
- Published
- 2015
- Full Text
- View/download PDF
50. An Integration Model of Semantic Annotation Based on Synergetic Neural Network.
- Author
-
Huang, Zhehuang and Chen, Yidong
- Subjects
SEMANTIC computing ,ANNOTATIONS ,SYNERGETICS ,NEURAL circuitry ,NATURAL language processing - Abstract
Correct and automatical semantic analysis has always been one of major goals in natural language understanding. However, due to the difficulties in deep semantic analysis, at present, the mainstream studies of semantic analysis are focused on semantic role labeling (SRL) and word sense disambiguation (WSD). Nowadays, these two issues are mostly considered as separate tasks. However, this approach ignores possible dependencies between them. In order to address the issue, an integrative semantic analysis model based on synergetic neural network (SNN) is proposed in this paper, which can easily express useful logic constraints between SRL and WSD. The semantic analysis process can be viewed as the competition process of semantic order parameters. The strongest order parameter will win by competition and desired semantic patterns will be recognized. There are three main innovations in this paper. First, an integrative semantic analysis model is proposed that jointly models word sense disambiguationand semantic role labeling. Second, integrative order parameter is reconstructed to reflect the relation among semantic patterns. Finally, integrative network parameters and integrative evolution equation are reconstructed, which can reflect the relationship of guiding and driving each other between word sense and semantic roles. The experiment results on OntoNotes 2.0 corpus shows the integrative method in this paper has a higher performance for semantic role labeling and word sense disambiguation, and provides a good practicability and a promising future for other natural language processing tasks. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.