计算机科学
人工智能
卷积神经网络
联营
自然语言处理
情报检索
概率潜在语义分析
排名(信息检索)
词(群论)
集合(抽象数据类型)
哲学
语言学
程序设计语言
作者
Yelong Shen,Xiaodong He,Jianfeng Gao,Li Deng,Grégoire Mesnil
标识
DOI:10.1145/2567948.2577348
摘要
This paper presents a series of new latent semantic models based on a convolutional neural network (CNN) to learn low-dimensional semantic vectors for search queries and Web documents. By using the convolution-max pooling operation, local contextual information at the word n-gram level is modeled first. Then, salient local fea-tures in a word sequence are combined to form a global feature vector. Finally, the high-level semantic information of the word sequence is extracted to form a global vector representation. The proposed models are trained on clickthrough data by maximizing the conditional likelihood of clicked documents given a query, us-ing stochastic gradient ascent. The new models are evaluated on a Web document ranking task using a large-scale, real-world data set. Results show that our model significantly outperforms other se-mantic models, which were state-of-the-art in retrieval performance prior to this work.
科研通智能强力驱动
Strongly Powered by AbleSci AI