计算机科学
恶意软件
鉴定(生物学)
执法
领域(数学分析)
搜索引擎索引
情报检索
情报分析
非结构化数据
万维网
网络智能
数据科学
计算机安全
Web服务
大数据
数据挖掘
Web建模
数学分析
植物
数学
政治学
法学
生物
作者
Hyeonseong Jo,Jinwoo Kim,Phillip Porras,Vinod Yegneswaran,Seungwon Shin
标识
DOI:10.1109/tifs.2020.3003570
摘要
Textual data mining of open source intelligence on the Web has become an increasingly important topic across a wide range of domains such as business, law enforcement, military, and cybersecurity. Text mining efforts utilize natural language processing to transform unstructured web content into structured forms that can drive various machine learning applications and data indexing services. For example, applications for text mining in cybersecurity have produced a range of threat intelligence services that serve the IT industry. However, a less studied problem is that of automating the identification of semantic inconsistencies among various text input sources. In this paper, we introduce GapFinder, a new inconsistency checking system for identifying semantic inconsistencies within the cybersecurity domain. Specifically, we examine the problem of identifying technical inconsistencies that arise in the functional descriptions of open source malware threat reporting information. Our evaluation, using tens of thousands of relations derived from web-based malware threat reports, demonstrates the ability of GapFinder to identify the presence of inconsistencies.
科研通智能强力驱动
Strongly Powered by AbleSci AI