亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Non-Linguistic Constraints on the Acquisition of Phrase Structure

短语 语言学 基于规则的机器翻译 集合(抽象数据类型) 计算机科学 短语结构规则 人工智能 自然语言处理 限定词短语 语言习得 心理学 生成语法 哲学 程序设计语言
作者
Jenny R. Saffran
出处
期刊:Proceedings of the Annual Meeting of the Cognitive Science Society 卷期号:22 (22) 被引量:4
摘要

Non-Linguistic Constraints on the Acquisition of Phrase Structure Jenny R. Saffran (jsaffran@facstaff.wisc.edu) Department of Psychology; 1202 W. Johnson Street Madison, WI 53706 USA Abstract To what extent is linguistic structure learnable from statisti- cal information in the input? One set of cues which might as- sist in the discovery of hierarchical phrase structure given se- rially presented input are the dependencies, or predictive rela- tionships, present within phrases. In order to determine whether adult learners can use this statistical information, subjects were exposed to artificial languages which either contained or violated the kinds of dependencies which charac- terize natural languages. The results suggest that adults pos- sess learning mechanisms which detect and utilize statistical cues to phrase and hierarchical structure. A second experiment contrasted the acquisition of these linguistic systems with the same grammars implemented as non-linguistic input (se- quences of non-linguistic sounds or shapes). These findings suggest that constraints on the mechanisms which highlight the statistical cues which are most characteristic of human languages are not specifically tailored for language learning. Introduction While the idea that surface distributional patterns point to pertinent linguistic structures holds a distinguished place in linguistic history (e.g., Bloomfield, 1933; Harris, 1951), statistical learning has only recently re-emerged as a poten- tial contributing force in language acquisition (though see Maratsos & Chalkley, 1980). This renewed interest in sta- tistical learning has been fueled by developments in compu- tational modeling, by the widespread availability of large corpora of child-directed speech, and most recently by em- pirical research demonstrating that human subjects can per- form statistical language learning tasks in laboratory ex- periments. For example, computational algorithms can use the co-occurrence environments of words to discover form classes in large corpora (e.g., Cartwright & Brent, 1997; Finch & Chater, 1994; Mintz, 1996; Mintz, Newport, & Bever, 1995). Similarly, individual verb argument structures can be induced by models which tracks the co-occurrences of verbs and their arguments in the input (e.g., Schutze, 1994; Seidenberg & MacDonald, 1999). Extensive modeling work has also examined the statistical cues available for the dis- covery of word boundaries in continuous speech (e.g., Aslin, Woodward, LaMendola, & Bever, 1996; Brent & Cartwright, 1996; Cairns, Shillcock, Chater, & Levy, 1997; Christian- sen, Allen, & Seidenberg, 1998; Perruchet & Vintner, These models provide invaluable explorations of the ex- tent to which statistical information is available, in princi- ple, to language learners equipped with the right distribu- tional tools. But are humans such learners? A wealth of sta- tistical cues are useless unless humans can detect and use them. In fact, recent research suggests that humans are ex- tremely good at some statistical language learning tasks, such as word segmentation (e.g., Aslin, Saffran, & New- port, 1998; Goodsitt, Morgan & Kuhl, 1993; Saffran, Aslin, & Newport, 1996; Saffran, Newport, & Aslin, 1996) These results suggest that humans possess powerful sta- tistical language learning mechanisms, which are likely to provide important contributions to the language learning process. At the same time, it is important to recognize that these mechanisms would not be useful in language acquisi- tion unless they are somehow constrained or biased to per- form only certain kinds of computations over certain kinds of input. The pertinent generalizations to be drawn from a linguistic corpus are awash in irrelevant information. Any learning device without the right architectural, representa- tional, or computational constraints risks being sidetracked by the massive number of misleading generalizations avail- able in the input (e.g., Gleitman & Wanner, 1982; Pinker, 1984). There are an infinite number of linguistically irrele- vant statistics that an overly powerful statistical learner could compute: for example, which words are presented third in sentences, or which words follow words whose second syllable begins with th (e.g., Pinker, 1989). One way to avoid this combinatorial explosion would be to impose constraints on statistical learning which perform only a subset of the logically possible computations. It is clear that learning in biological systems is limited by inter- nal factors; there are species differences in which specific types of stimuli serve as privileged input (e.g., Garcia & Koelling, 1966; Marler, 1991). External factors also strongly bias learning, because input from structured do- mains consists of non-random information. In order for sta- tistical learning accounts to succeed, learners must be simi- larly constrained: humans must be just the type of statistical learners who are best suited to acquire the type of input ex- emplified by natural languages, focusing on linguistically relevant statistics while ignoring the wealth of available irrelevant computations. Such constraints might arise from various sources, either specific to language or from more general cognitive and/or perceptual constraints on human learning.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
深情的朝雪完成签到,获得积分10
6秒前
11秒前
清神安发布了新的文献求助10
14秒前
清神安完成签到,获得积分10
26秒前
sora98完成签到 ,获得积分10
34秒前
41秒前
纯真天荷完成签到,获得积分10
53秒前
1分钟前
害羞孤风完成签到 ,获得积分10
1分钟前
开心惜梦完成签到,获得积分10
1分钟前
可爱的新儿完成签到,获得积分10
1分钟前
3分钟前
美丽的迎蕾完成签到,获得积分10
3分钟前
Bin_Liu发布了新的文献求助10
3分钟前
su完成签到 ,获得积分10
3分钟前
喂我发布了新的文献求助10
3分钟前
JEREMIAH完成签到,获得积分10
3分钟前
3分钟前
cc完成签到,获得积分10
3分钟前
隐形大地完成签到,获得积分10
3分钟前
Jasper应助科研通管家采纳,获得10
4分钟前
今后应助科研通管家采纳,获得10
4分钟前
美丽的沛菡完成签到,获得积分10
4分钟前
丘比特应助chugu3721采纳,获得10
4分钟前
默默的以柳完成签到,获得积分10
4分钟前
常有李完成签到,获得积分10
5分钟前
5分钟前
5分钟前
快乐红酒发布了新的文献求助10
5分钟前
学不完了完成签到 ,获得积分10
6分钟前
冷酷的冰枫完成签到,获得积分10
6分钟前
和风完成签到 ,获得积分10
6分钟前
CCC完成签到,获得积分10
6分钟前
piupiu完成签到,获得积分10
6分钟前
6分钟前
生动盼兰完成签到,获得积分10
6分钟前
Bin_Liu完成签到,获得积分20
6分钟前
房天川完成签到 ,获得积分10
6分钟前
6分钟前
chugu3721发布了新的文献求助10
7分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Organometallic Chemistry of the Transition Metals 800
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6440853
求助须知:如何正确求助?哪些是违规求助? 8254713
关于积分的说明 17571949
捐赠科研通 5499112
什么是DOI,文献DOI怎么找? 2900102
邀请新用户注册赠送积分活动 1876714
关于科研通互助平台的介绍 1716916