Back to Search
Start Over
DM_NLP at SemEval-2018 Task 8: neural sequence labeling with linguistic features
- Source :
- SemEval@NAACL-HLT
- Publication Year :
- 2018
- Publisher :
- Association for Computational Linguistics, 2018.
-
Abstract
- This paper describes our submissions for SemEval-2018 Task 8: Semantic Extraction from CybersecUrity REports using NLP. The DM_NLP participated in two subtasks: SubTask 1 classifies if a sentence is useful for inferring malware actions and capabilities, and SubTask 2 predicts token labels (“Action”, “Entity”, “Modifier” and “Others”) for a given malware-related sentence. Since we leverage results of Subtask 2 directly to infer the result of Subtask 1, the paper focus on the system solving Subtask 2. By taking Subtask 2 as a sequence labeling task, our system relies on a recurrent neural network named BiLSTM-CNN-CRF with rich linguistic features, such as POS tags, dependency parsing labels, chunking labels, NER labels, Brown clustering. Our system achieved the highest F1 score in both token level and phrase level.
- Subjects :
- Phrase
Brown clustering
Computer science
02 engineering and technology
Sequence labeling
SemEval
Linguistics
03 medical and health sciences
ComputingMethodologies_PATTERNRECOGNITION
0302 clinical medicine
Recurrent neural network
Dependency grammar
Chunking (psychology)
030221 ophthalmology & optometry
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
F1 score
Sentence
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Proceedings of The 12th International Workshop on Semantic Evaluation
- Accession number :
- edsair.doi...........f95a2254904bb59ab370ab28f734f518
- Full Text :
- https://doi.org/10.18653/v1/s18-1114