计算机科学
领域(数学)
任务(项目管理)
命名实体识别
接头(建筑物)
人工智能
工程类
建筑工程
数学
系统工程
纯数学
作者
Xuren Wang,Songheng He,Zihan Xiong,Xinxin Wei,Zhengwei Jiang,Sihan Chen,Jun Jiang
标识
DOI:10.1109/cscwd54268.2022.9776031
摘要
This paper provides a new dataset for Named Entity Recognition (NER) missions in cyber threat intelligence (CTI) studying. To the best of our knowledge, the proposed dataset is the biggest and challenging one in the field to comply with the STIX 2.1 specification. We collected the APT (Advanced Persistent Threats) reports from different network security companies and manually annotated them. Then we constructed a dataset named APTNER, which can be used for NER joint and multi-task learning tasks in CTI. Apart from common labels like IP, URL, mal-ware, location and so on, APTNER contains 21 categories, which make APTNER more challenging than other NER datasets in CTI field and we have proved the rationality of the dataset. For ease of comparison studies, we realize several state-of-the-art baselines and report their analysis. To facilitate future work on fine-grained NER for CTI, we make APTNER public at https://github.com/wangxuren/APTNER.
科研通智能强力驱动
Strongly Powered by AbleSci AI