时间轴
自动汇总
计算机科学
判决
代表(政治)
自然语言处理
语义学(计算机科学)
意义(存在)
人工智能
情报检索
特征(语言学)
任务(项目管理)
语言学
程序设计语言
数学
心理学
统计
哲学
管理
政治
政治学
法学
经济
心理治疗师
作者
Behrooz Mansouri,Ricardo Campos,Adam Jatowt
标识
DOI:10.1145/3543873.3587670
摘要
Timeline summarization (TLS) is a challenging research task that requires researchers to distill extensive and intricate temporal data into a concise and easily comprehensible representation. This paper proposes a novel approach to timeline summarization using Abstract Meaning Representations (AMRs), a graphical representation of the text where the nodes are semantic concepts and the edges denote relationships between concepts. With AMR, sentences with different wordings, but similar semantics, have similar representations. To make use of this feature for timeline summarization, a two-step sentence selection method that leverages features extracted from both AMRs and the text is proposed. First, AMRs are generated for each sentence. Sentences are then filtered out by removing those with no named-entities and keeping the ones with the highest number of named-entities. In the next step, sentences to appear in the timeline are selected based on two scores: Inverse Document Frequency (IDF) of AMR nodes combined with the score obtained by applying a keyword extraction method to the text. Our experimental results on the TLS-Covid19 test collection demonstrate the potential of the proposed approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI