锆石
计算机科学
聚类分析
钥匙(锁)
语义学(计算机科学)
相似性(几何)
特征(语言学)
人工智能
自然语言处理
模式识别(心理学)
地质学
图像(数学)
古生物学
语言学
哲学
计算机安全
程序设计语言
作者
Xiangben Hu,Zhichen Hu,Jielin Jiang,Weiwei Xue,Xiumian Hu,Xiaolong Xu
标识
DOI:10.1007/s12145-022-00847-y
摘要
Similarity calculations for zircons are vital to topical issues in sedimentology, such as provenance analysis, dating of sediment and identification of geotectonic effects. In general, zircon data is stored in a table where each column represents a key-value pair. According to the semantics of the keys, multiple tables are merged to extract data for analyzing the variability of single feature. However, there are conflicts between the different indicators due to sedimentation, which leads to inaccuracy of similarity. Moreover, unknown and semantically ambiguous keys are not recognized by the knowledge base, which results in the inefficiency of aggregating key-value pairs. Therefore, this paper proposed a Fast Much zircon (FM-zircon) framework that combines natural language processing (NLP) and multidimensional scaling (MDS) for calculating the similarity of zircons. First, NLP classifies keys by extracting semantic features. After the key-value pairs with the same key are fused, MDS is implemented to calculate multiple features. Ultimately, the results are represented in a visual representation To evaluate the performance, experiments were performed with zircon tables, that showed the good performance of FM-zircon.
科研通智能强力驱动
Strongly Powered by AbleSci AI