计算机科学
关键词提取
主题(计算)
自然语言处理
萃取(化学)
情报检索
人工智能
万维网
化学
色谱法
作者
Reza Yousefi Maragheh,Chenhao Fang,Charan Chand Irugu,Parth Parikh,Jason Cho,Jianpeng Xu,S. Sukumar,Malay Patel,Evren Körpeoğlu,Sushant Kumar,Kannan Achan
标识
DOI:10.1109/bigdata59044.2023.10386476
摘要
Keyword extraction is one of the core tasks in natural language processing. Classic extraction models are notorious for having a short attention span which make it hard for them to conclude relational connections among the words and sentences that are far from each other. This, in turn, makes their usage prohibitive for generating keywords that are inferred from the context of the whole text. In this paper, we explore using Large Language Models (LLMs) in generating keywords for items that are inferred from the items’ textual metadata. Our modeling framework includes several stages to fine grain the results by avoiding outputting keywords that are non-informative or sensitive and reduce hallucinations common in LLM’s. We call our LLM-based framework Theme-Aware Keyword Extraction (LLM-TAKE). We propose two variations of framework for generating extractive and abstractive themes for products in an E-commerce setting. We perform an extensive set of experiments on three real data sets and show that our modeling framework can enhance accuracy-based and diversity-based metrics when compared with benchmark models.
科研通智能强力驱动
Strongly Powered by AbleSci AI