计算机科学
自然语言处理
人工智能
判决
多任务学习
一般化
卷积神经网络
自然语言理解
建筑
自然语言
任务(项目管理)
数学分析
艺术
数学
管理
经济
视觉艺术
作者
Ronan Collobert,Jason Weston
标识
DOI:10.1145/1390156.1390177
摘要
We describe a single convolutional neural network architecture that, given a sentence, outputs a host of language processing predictions: part-of-speech tags, chunks, named entity tags, semantic roles, semantically similar words and the likelihood that the sentence makes sense (grammatically and semantically) using a language model. The entire network is trained jointly on all these tasks using weight-sharing, an instance of multitask learning. All the tasks use labeled data except the language model which is learnt from unlabeled text and represents a novel form of semi-supervised learning for the shared tasks. We show how both multitask learning and semi-supervised learning improve the generalization of the shared tasks, resulting in state-of-the-art-performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI