Semantic Feature Graph Consistency with Contrastive Cluster Assignments for Multilingual Document Clustering

计算机科学 一致性(知识库) 聚类分析 自然语言处理 图形 特征(语言学) 文档聚类 人工智能 情报检索 星团(航天器) 语言学 理论计算机科学 程序设计语言 哲学
作者
Teng Sun,Zhenqiu Shu,Yuxin Huang,Hongbin Wang,Zhengtao Yu
出处
期刊:ACM Transactions on Asian and Low-Resource Language Information Processing
标识
DOI:10.1145/3708887
摘要

Multilingual document clustering (MDC) aims to partition multilingual documents into distinct clusters based on topic categories in an unsupervised manner. However, existing MDC methods still suffer from several limitations in practice tasks. Firstly, most of them optimize multiple objectives within the same feature space, thereby leading to the conflict between learning consistently shared semantics and reconstructing inconsistent view-specific information. Secondly, several methods directly integrate information from multilingual documents during the fusion stage, thereby overlooking the semantic differences between different language features. To address the aforementioned problems, we propose a novel multi-view learning method, called Semantic Feature Graph Consistency with Contrastive Cluster Assignments (SFGC 3 A), for multilingual document clustering. Specifically, the proposed SFGC 3 A method implements consistency objective and reconstruction objective in different feature spaces, thus effectively avoiding conflicts between consistency learning and inconsistency reconstruction. Subsequently, we design the semantic feature graph consistency and semantic label consistency modules to further explore consistent semantic information among multilingual documents, thereby reducing the semantic differences among different language views. Extensive experiments on several multilingual document datasets have shown the effectiveness of the proposed SFGC 3 A method in MDC tasks. The source codes for this work will be released later.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
科研通AI6.4应助slp123456采纳,获得10
1秒前
嘻嘻发布了新的文献求助10
2秒前
青槐发布了新的文献求助10
3秒前
4秒前
4秒前
kcl发布了新的文献求助10
4秒前
xxyh完成签到,获得积分10
4秒前
6秒前
单身的钧完成签到,获得积分10
6秒前
bkagyin应助Prospect采纳,获得10
6秒前
半颗橙子完成签到 ,获得积分10
7秒前
雨下整夜完成签到,获得积分10
9秒前
上官若男应助优雅柏柳采纳,获得10
9秒前
稷下听风完成签到,获得积分10
9秒前
12秒前
我不爱池鱼应助DZ采纳,获得10
12秒前
12秒前
真实的火车完成签到,获得积分10
12秒前
顺心的傲柔完成签到,获得积分10
12秒前
bamboo发布了新的文献求助10
13秒前
13秒前
轨道交通振动与噪声小白完成签到,获得积分10
13秒前
科研通AI6.4应助袅袅采纳,获得10
16秒前
JamesPei应助孙周采纳,获得10
16秒前
hyt发布了新的文献求助30
17秒前
海不扬波发布了新的文献求助10
17秒前
猫和老鼠发布了新的文献求助10
17秒前
pengchen发布了新的文献求助30
18秒前
JamesPei应助ggeng采纳,获得10
18秒前
小壳儿完成签到 ,获得积分10
20秒前
wtzzzh发布了新的文献求助10
20秒前
诚心的小x应助风里等你采纳,获得10
22秒前
22秒前
0805zz应助蓝天采纳,获得10
22秒前
mumu完成签到,获得积分10
23秒前
23秒前
24秒前
24秒前
硫酸亚铬应助1111采纳,获得10
25秒前
26秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Cronologia da história de Macau 1600
Decentring Leadership 1000
Lloyd's Register of Shipping's Approach to the Control of Incidents of Brittle Fracture in Ship Structures 1000
BRITTLE FRACTURE IN WELDED SHIPS 1000
Intentional optical interference with precision weapons (in Russian) Преднамеренные оптические помехи высокоточному оружию 1000
Atlas of Anatomy 5th original digital 2025的PDF高清电子版(非压缩版,大小约400-600兆,能更大就更好了) 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 计算机科学 化学工程 生物化学 物理 复合材料 内科学 催化作用 物理化学 光电子学 细胞生物学 基因 电极 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6184391
求助须知:如何正确求助?哪些是违规求助? 8011685
关于积分的说明 16664077
捐赠科研通 5283697
什么是DOI,文献DOI怎么找? 2816584
邀请新用户注册赠送积分活动 1796376
关于科研通互助平台的介绍 1660883