1. NER in Cyber Threat Intelligence Domain Using Transformer with TSGL.
- Author
-
Huang, Yuhuang, su, Mang, Xu, Yuting, and Liu, Tian
- Subjects
CYBERTERRORISM ,CYBER intelligence (Computer security) - Abstract
In response to the continuous sophistication of cyber threat actors, it is imperative to make the best use of cyber threat intelligence converted from structured or semi-structured data and Named Entity Recognition (NER) techniques that contribute to extracting critical cyber threat intelligence. To promote the NER research in Cyber Threat Intelligence (CTI) domain, we provide a Large Dataset for NER in Cyber Threat Intelligence (LDNCTI). On the LDNCTI corpus, we investigated the feasibility of mainstream transformer-based models in CTI domain. To settle the problem of unbalanced label distribution, we introduce a transformer-based model with a Triplet Loss based on metric learning and Sorted Gradient harmonizing mechanism (TSGL). Our experimental results show that the LDNCTI well represents critical threat intelligence and that our transformer-based model with the new loss function outperforms previous schemes on the Dataset for NER in Threat Intelligence (DNRTI) and the dataset for NER in Advanced Persistent Threats (APTNER). [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF