Descriptor: "attention-based BiLSTM" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"attention-based BiLSTM"' showing total 6 results

Start Over Descriptor "attention-based BiLSTM"

6 results on '"attention-based BiLSTM"'

1. A hybrid transformer and attention based recurrent neural network for robust and interpretable sentiment analysis of tweets

Author: Md Abrar Jahin, Md Sakib Hossain Shovon, M. F. Mridha, Md Rashedul Islam, and Yutaka Watanobe
Subjects: RoBERTa Transformer, Attention-based BiLSTM, Unsupervised Labeling, Tweet Sentiment Analysis, XAI, SHAP, Medicine, Science
Abstract: Abstract Sentiment analysis is a pivotal tool in understanding public opinion, consumer behavior, and social trends, underpinning applications ranging from market research to political analysis. However, existing sentiment analysis models frequently encounter challenges related to linguistic diversity, model generalizability, explainability, and limited availability of labeled datasets. To address these shortcomings, we propose the Transformer and Attention-based Bidirectional LSTM for Sentiment Analysis (TRABSA) model, a novel hybrid sentiment analysis framework that integrates transformer-based architecture, attention mechanism, and recurrent neural networks like BiLSTM. The TRABSA model leverages the powerful RoBERTa-based transformer model for initial feature extraction, capturing complex linguistic nuances from a vast corpus of tweets. This is followed by an attention mechanism that highlights the most informative parts of the text, enhancing the model’s focus on critical sentiment-bearing elements. Finally, the BiLSTM networks process these refined features, capturing temporal dependencies and improving the overall sentiment classification into positive, neutral, and negative classes. Leveraging the latest RoBERTa-based transformer model trained on a vast corpus of 124M tweets, our research bridges existing gaps in sentiment analysis benchmarks, ensuring state-of-the-art accuracy and relevance. Furthermore, we contribute to data diversity by augmenting existing datasets with 411,885 tweets from 32 English-speaking countries and 7,500 tweets from various US states. This study also compares six word-embedding techniques, identifying the most robust preprocessing and embedding methodologies crucial for accurate sentiment analysis and model performance. We meticulously label tweets into positive, neutral, and negative classes using three distinct lexicon-based approaches and select the best one, ensuring optimal sentiment analysis outcomes and model efficacy. Here, we demonstrate that the TRABSA model outperforms the current seven traditional machine learning models, four stacking models, and four hybrid deep learning models, yielding notable gain in accuracy (94%) and effectiveness with a macro average precision of 94%, recall of 93%, and F1-score of 94%. Our further evaluation involves two extended and four external datasets, demonstrating the model’s consistent superiority, robustness, and generalizability across diverse contexts and datasets. Finally, by conducting a thorough study with SHAP and LIME explainable visualization approaches, we offer insights into the interpretability of the TRABSA model, improving comprehension and confidence in the model’s predictions. Our study results make it easier to analyze how citizens respond to resources and events during pandemics since they are integrated into a decision-support system. Applications of this system provide essential assistance for efficient pandemic management, such as resource planning, crowd control, policy formation, vaccination tactics, and quick reaction programs.
Published: 2024
Full Text: View/download PDF

2. Extraction of data events from the computational biology literature

Author: Albahlal, Manal, Nenadic, Goran, and Stevens, Robert
Subjects: microarray analysis literature, Machine learning, text mining, natural language processing, entity linking, normalisation, named entity recognition, attention-based BiLSTM, relation extraction, database, BioBERT, Methods section, workflow, tools, software, operation, process, data, data events, methods
Abstract: With the current rate of research activities, it is widely accepted that scientists face a challenge of keeping up-to-date with new findings, even within a sub-field of a discipline. This difficulty extends to methods that have been used in the research. Understanding reported methods gives us confidence that the findings have resulted from an appropriate, rigorous and sound scientific process. However, the modern dynamic of science is also characterised with ever-changing methods, so scientists need to be able to learn about new ones and identify the common or most appropriate methods to use in a given situation. One of the best sources of information about methods is the scientific literature. In this thesis, we developed a computational model to automatically represent the text that describes reported methods as an abstract method workflow. We focus on computational sciences, which centre on data processing. Specifically, we consider data events as a representation of processes and changes that happen to data. A data event contains the main components of each step in computational experiments, such as input/output data, processes and operations on data, databases where the data is stored and software and tools that are used in these processes. An abstract method workflow then models relationships between data events, ordering them in a way that represents the methodology as reported in the literature. This thesis introduces ODNoRFlow, a text mining method that extracts and represents an abstract method workflow from a Methods section of a publication. It relies on a hybrid text mining approach (ODNoR) that combines machine learning and a rule-based method to recognise data event components, normalise them to existing ontologies and identify the links and relations between them. Specifically, we fine-tuned a pre-trained transformer model (BioBERT) to extract mentions of data and operations, and used an existing named entity recognition system (bioNerDS) to extract software and database mentions. Mentions were normalised to the EDAM ontology. We used a combination of syntactic rules and a pre-trained attention-based BiLSTM model to identify relations and links between components, and considered whether an automated discourse analysis tool can be used to improve the outcomes. We used the microarray analysis literature as a case study to demonstrate the feasibility of the proposed approaches. At the data event level, the approach achieved F-scores for the identification and normalisation of components between 78% (for data) and 92% (for operations), whereas the relationship extraction F-scores were between 62% and 92.5%. At the workflow level, we manually analysed automatically reconstructed workflows from 25 papers, with the F-score between 61% and 93.5%. We also applied ODNoRFlow to a large corpus of the microarray analysis literature to identify and analyse the distribution of data events components, the differences in their usage and the associations between them. Overall, the thesis provides a new computational framework that contributes to the automated extraction, representation and analysis of methods used in the computational biology literature.
Published: 2022

3. A hybrid transformer and attention based recurrent neural network for robust and interpretable sentiment analysis of tweets

Author: Jahin, Md Abrar, Shovon, Md Sakib Hossain, Mridha, M. F., Islam, Md Rashedul, and Watanobe, Yutaka
Published: 2024
Full Text: View/download PDF

4. A Multi-modal Time Series Intelligent Prediction Model

Author: Xian, Qingyu, Liang, Wenxuan, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Hirche, Sandra, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Möller, Sebastian, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Zhang, Junjie James, Series Editor, Qian, Zhihong, editor, Jabbar, M.A., editor, and Li, Xiaolong, editor
Published: 2022
Full Text: View/download PDF

5. A cognitive brain model for multimodal sentiment analysis based on attention neural networks.

Author: Li, Yuanqing, Zhang, Ke, Wang, Jingyu, and Gao, Xinbo
Subjects: *SENTIMENT analysis, *LIMBIC system, *RANDOM forest algorithms, *COGNITIVE science, *ARTIFICIAL intelligence
Abstract: Multimodal sentiment analysis is one of the most attractive interdisciplinary research topics in artificial intelligence (AI). Different from other classification issues, multimodal sentiment analysis of human is a much finer classification problem. However, most current work accept all multimodalities as the input together and then output final results at one time after fusion and decision processes. Rare models try to divide their models into more than one fusion modules with different fusion strategies for better adaption of different tasks. Additionally, most recent multimodal sentiment analysis methods pay great focuses on binary classification, but the accuracy of multi-classification still remains difficult to improve. Inspired by the emotional processing procedure in cognitive science, both binary and multi-classification abilities are improved in our method by dividing the complicated problem into smaller issues which are easier to be handled. In this paper, we propose a Hierarchal Attention-BiLSTM (Bidirectional Long-Short Term Memory) model based on Cognitive Brain limbic system (HALCB). HALCB splits the multimodal sentiment analysis into two modules responsible for two tasks, the binary classification and the multi-classification. The former module divides the input items into two categories by recognizing their polarity and then sends them to the latter module separately. In this module, Hash algorithm is utilized to improve the retrieve accuracy and speed. Correspondingly, the latter module contains a positive sub-net dedicated for positive inputs and a negative sub-nets dedicated for negative inputs. Each of these binary module and two sub-nets in multi-classification module possesses different fusion strategy and decision layer for matching its respective function. We also add a random forest at the final link to collect outputs from all modules and fuse them at the decision-level at last. Experiments are conducted on three datasets and compare the results with baselines on both binary classification and multi-classification tasks. Our experimental results surpass the state-of-the-art multimodal sentiment analysis methods on both binary and multi-classification by a big margin. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

6. A novel approach to fake news classification using LSTM-based deep learning models.

Author: Padalko H, Chomko V, and Chumachenko D
Abstract: The rapid dissemination of information has been accompanied by the proliferation of fake news, posing significant challenges in discerning authentic news from fabricated narratives. This study addresses the urgent need for effective fake news detection mechanisms. The spread of fake news on digital platforms has necessitated the development of sophisticated tools for accurate detection and classification. Deep learning models, particularly Bi-LSTM and attention-based Bi-LSTM architectures, have shown promise in tackling this issue. This research utilized Bi-LSTM and attention-based Bi-LSTM models, integrating an attention mechanism to assess the significance of different parts of the input data. The models were trained on an 80% subset of the data and tested on the remaining 20%, employing comprehensive evaluation metrics including Recall, Precision, F1-Score, Accuracy, and Loss. Comparative analysis with existing models revealed the superior efficacy of the proposed architectures. The attention-based Bi-LSTM model demonstrated remarkable proficiency, outperforming other models in terms of accuracy (97.66%) and other key metrics. The study highlighted the potential of integrating advanced deep learning techniques in fake news detection. The proposed models set new standards in the field, offering effective tools for combating misinformation. Limitations such as data dependency, potential for overfitting, and language and context specificity were acknowledged. The research underscores the importance of leveraging cutting-edge deep learning methodologies, particularly attention mechanisms, in fake news identification. The innovative models presented pave the way for more robust solutions to counter misinformation, thereby preserving the veracity of digital information. Future research should focus on enhancing data diversity, model efficiency, and applicability across various languages and contexts., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision., (Copyright © 2024 Padalko, Chomko and Chumachenko.)
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

6 results on '"attention-based BiLSTM"'

1. A hybrid transformer and attention based recurrent neural network for robust and interpretable sentiment analysis of tweets

2. Extraction of data events from the computational biology literature

3. A hybrid transformer and attention based recurrent neural network for robust and interpretable sentiment analysis of tweets

4. A Multi-modal Time Series Intelligent Prediction Model

5. A cognitive brain model for multimodal sentiment analysis based on attention neural networks.

6. A novel approach to fake news classification using LSTM-based deep learning models.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

6 results on '"attention-based BiLSTM"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources