匹配(统计)
序列(生物学)
相似性(几何)
计算机科学
最佳匹配
考试(生物学)
度量(数据仓库)
价值(数学)
索引
计量经济学
数据挖掘
数学
统计
人工智能
机器学习
古生物学
生物化学
遗传学
化学
基因型
单核苷酸多态性
基因
图像(数学)
生物
标识
DOI:10.1177/0049124109346164
摘要
Optimal matching (OM) is a method for measuring the similarity between pairs of sequences (e.g., work histories). This article discusses two problems with optimal matching. First, the author identifies a flaw in OM ‘‘indel costs’’ and proposes a solution to this flaw. Second, the author discusses the need for benchmarks to measure the added value of OM and to test competing versions. To that end, the author conducts an empirical test of traditional OM, the alternative localized OM, and sequence comparison. The test documents the problem with traditional OM and shows that it is solved by localized OM. The test also demonstrates the value of OM and sequence comparison in examining occupational sequences; both methods capture variation beyond traditional human capital and status attainment measures, although the marginal improvements of OM over sequence comparison may not justify its computational complexity. These results point to the need for more systematic approaches to sequence analysis methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI