Back to Search
Start Over
Sequence Classification of Tweets with Transfer Learning via BERT in the Field of Disaster Management
- Source :
- EAI Endorsed Transactions on Scalable Information Systems, Vol 8, Iss 31 (2021)
- Publication Year :
- 2021
- Publisher :
- European Alliance for Innovation (EAI), 2021.
-
Abstract
- Twitter is extensively used as an information-sharing platform during any kind of emergency like disasters etc. People tweet useful information about disaster-related events such as evacuations, volunteer need, help, warnings etc. This data is sometimes very useful for rescue teams, NGOs, military and various other government and private organisations who are tasked with responsibilities to save lives and provide volunteers. This data can also be used to analyze disaster behaviour. In this paper, we have collected labelled tweets from crisisLexT26 and crisisNLP and classified them into seven labels on the basis of information provided by them. The data was heavily skewed. So to improve the accuracy of classifiers, we have applied various techniques as a result of which we have created two datasets (Imbalanced and Balanced). We have compared the performance of various BERT-based models on these two datasets. For sequence classification, a balanced dataset performs better than an imbalanced dataset. We can improve accuracy of classifiers to great extent by adopting good data preprocessing and data splitting techniques.
- Subjects :
- Sequence
Government
Data splitting
Emergency management
Computer science
business.industry
balanced dataset
bert (bidirectional encoder representation from transformers)
Machine learning
computer.software_genre
Field (computer science)
ComputingMethodologies_PATTERNRECOGNITION
T58.6-58.62
imbalanced dataset
disaster management
tweet classification
Data pre-processing
Artificial intelligence
Management information systems
natural language processing
Transfer of learning
business
computer
Subjects
Details
- Language :
- English
- ISSN :
- 20329407
- Volume :
- 8
- Issue :
- 31
- Database :
- OpenAIRE
- Journal :
- EAI Endorsed Transactions on Scalable Information Systems
- Accession number :
- edsair.doi.dedup.....d2043340b3365602d6e2e093e0f52012
- Full Text :
- https://doi.org/10.4108/eai.23-3-2021.169071