计算机科学
人工智能
流式数据
判别式
事件(粒子物理)
代表(政治)
情报检索
数据挖掘
政治学
量子力学
政治
物理
法学
作者
Chaodong Tong,Huailiang Peng,Xu Bai,Qiong Dai,Ruitong Zhang,Yangyang Li,Hanjie Xu,Xian-Ming Gu
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2023-12-01
卷期号:35 (12): 12295-12309
被引量:2
标识
DOI:10.1109/tkde.2021.3119686
摘要
Event detection on social platforms can help people perceive essential events and make actionable decisions. Existing document-pivot streaming social event detection methods generally embed documents and perform text clustering. They face the challenges of constantly changing context and unknown event categories and struggle by designing compound text representation methods and various similarity measures. However, phased, well-designed methods are excessively fragile and unable to utilize the potential of text representations fully. Meanwhile, their complex threshold settings result in clustering-based event detection suffering the pain of ever-changing environments. We propose a text representation learning method namely Text Similarity Contrastive Learning Neural Network (Text-SimCLNN) to tackle these challenges. Text-SimCLNN uses smaller parts to learn the similarity probability of text pairs from semantic and structural perspectives, effectively bridging the gap between text representation learning and similarity measure in streaming event detection. Event discovery and merging in streams can be easily performed based on the learned representations, and we use various techniques to speed up such processes. Furthermore, we introduce an online update mechanism that uses heterogeneous graphs to generate high-quality samples to enable stable and reliable inductive learning. Extensive experiments on two real-world datasets demonstrate that our method far exceeds state-of-the-art (SOTA).
科研通智能强力驱动
Strongly Powered by AbleSci AI