计算机科学
情报检索
概率潜在语义分析
排名(信息检索)
语义搜索
人工智能
任务(项目管理)
语义计算
语义匹配
相关性(法律)
语义网
自然语言处理
匹配(统计)
统计
数学
管理
政治学
法学
经济
作者
Po-Sen Huang,Xiaodong He,Jianfeng Gao,Li Deng,Alex Acero,Larry Heck
出处
期刊:Conference on Information and Knowledge Management
日期:2013-10-27
卷期号:: 2333-2338
被引量:1889
标识
DOI:10.1145/2505515.2505665
摘要
Latent semantic models, such as LSA, intend to map a query to its relevant documents at the semantic level where keyword-based matching often fails. In this study we strive to develop a series of new latent semantic models with a deep structure that project queries and documents into a common low-dimensional space where the relevance of a document given a query is readily computed as the distance between them. The proposed deep structured semantic models are discriminatively trained by maximizing the conditional likelihood of the clicked documents given a query using the clickthrough data. To make our models applicable to large-scale Web search applications, we also use a technique called word hashing, which is shown to effectively scale up our semantic models to handle large vocabularies which are common in such tasks. The new models are evaluated on a Web document ranking task using a real-world data set. Results show that our best model significantly outperforms other latent semantic models, which were considered state-of-the-art in the performance prior to the work presented in this paper.
科研通智能强力驱动
Strongly Powered by AbleSci AI