亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Non-Linguistic Constraints on the Acquisition of Phrase Structure

短语 语言学 基于规则的机器翻译 集合(抽象数据类型) 计算机科学 短语结构规则 人工智能 自然语言处理 限定词短语 语言习得 心理学 生成语法 哲学 程序设计语言
作者
Jenny R. Saffran
出处
期刊:Proceedings of the Annual Meeting of the Cognitive Science Society 卷期号:22 (22) 被引量:4
摘要

Non-Linguistic Constraints on the Acquisition of Phrase Structure Jenny R. Saffran (jsaffran@facstaff.wisc.edu) Department of Psychology; 1202 W. Johnson Street Madison, WI 53706 USA Abstract To what extent is linguistic structure learnable from statisti- cal information in the input? One set of cues which might as- sist in the discovery of hierarchical phrase structure given se- rially presented input are the dependencies, or predictive rela- tionships, present within phrases. In order to determine whether adult learners can use this statistical information, subjects were exposed to artificial languages which either contained or violated the kinds of dependencies which charac- terize natural languages. The results suggest that adults pos- sess learning mechanisms which detect and utilize statistical cues to phrase and hierarchical structure. A second experiment contrasted the acquisition of these linguistic systems with the same grammars implemented as non-linguistic input (se- quences of non-linguistic sounds or shapes). These findings suggest that constraints on the mechanisms which highlight the statistical cues which are most characteristic of human languages are not specifically tailored for language learning. Introduction While the idea that surface distributional patterns point to pertinent linguistic structures holds a distinguished place in linguistic history (e.g., Bloomfield, 1933; Harris, 1951), statistical learning has only recently re-emerged as a poten- tial contributing force in language acquisition (though see Maratsos & Chalkley, 1980). This renewed interest in sta- tistical learning has been fueled by developments in compu- tational modeling, by the widespread availability of large corpora of child-directed speech, and most recently by em- pirical research demonstrating that human subjects can per- form statistical language learning tasks in laboratory ex- periments. For example, computational algorithms can use the co-occurrence environments of words to discover form classes in large corpora (e.g., Cartwright & Brent, 1997; Finch & Chater, 1994; Mintz, 1996; Mintz, Newport, & Bever, 1995). Similarly, individual verb argument structures can be induced by models which tracks the co-occurrences of verbs and their arguments in the input (e.g., Schutze, 1994; Seidenberg & MacDonald, 1999). Extensive modeling work has also examined the statistical cues available for the dis- covery of word boundaries in continuous speech (e.g., Aslin, Woodward, LaMendola, & Bever, 1996; Brent & Cartwright, 1996; Cairns, Shillcock, Chater, & Levy, 1997; Christian- sen, Allen, & Seidenberg, 1998; Perruchet & Vintner, These models provide invaluable explorations of the ex- tent to which statistical information is available, in princi- ple, to language learners equipped with the right distribu- tional tools. But are humans such learners? A wealth of sta- tistical cues are useless unless humans can detect and use them. In fact, recent research suggests that humans are ex- tremely good at some statistical language learning tasks, such as word segmentation (e.g., Aslin, Saffran, & New- port, 1998; Goodsitt, Morgan & Kuhl, 1993; Saffran, Aslin, & Newport, 1996; Saffran, Newport, & Aslin, 1996) These results suggest that humans possess powerful sta- tistical language learning mechanisms, which are likely to provide important contributions to the language learning process. At the same time, it is important to recognize that these mechanisms would not be useful in language acquisi- tion unless they are somehow constrained or biased to per- form only certain kinds of computations over certain kinds of input. The pertinent generalizations to be drawn from a linguistic corpus are awash in irrelevant information. Any learning device without the right architectural, representa- tional, or computational constraints risks being sidetracked by the massive number of misleading generalizations avail- able in the input (e.g., Gleitman & Wanner, 1982; Pinker, 1984). There are an infinite number of linguistically irrele- vant statistics that an overly powerful statistical learner could compute: for example, which words are presented third in sentences, or which words follow words whose second syllable begins with th (e.g., Pinker, 1989). One way to avoid this combinatorial explosion would be to impose constraints on statistical learning which perform only a subset of the logically possible computations. It is clear that learning in biological systems is limited by inter- nal factors; there are species differences in which specific types of stimuli serve as privileged input (e.g., Garcia & Koelling, 1966; Marler, 1991). External factors also strongly bias learning, because input from structured do- mains consists of non-random information. In order for sta- tistical learning accounts to succeed, learners must be simi- larly constrained: humans must be just the type of statistical learners who are best suited to acquire the type of input ex- emplified by natural languages, focusing on linguistically relevant statistics while ignoring the wealth of available irrelevant computations. Such constraints might arise from various sources, either specific to language or from more general cognitive and/or perceptual constraints on human learning.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
4秒前
23秒前
科研雪瑞发布了新的文献求助10
27秒前
45秒前
50秒前
59秒前
handong发布了新的文献求助10
1分钟前
Willow完成签到,获得积分10
1分钟前
斯文宛秋发布了新的文献求助10
1分钟前
颜靖仇完成签到,获得积分10
1分钟前
1分钟前
科研通AI6.1应助astg采纳,获得10
1分钟前
1分钟前
handong完成签到,获得积分10
1分钟前
调皮大娘完成签到 ,获得积分10
1分钟前
烟花应助晗安采纳,获得10
1分钟前
1分钟前
1分钟前
1分钟前
ccc发布了新的文献求助10
2分钟前
晗安发布了新的文献求助10
2分钟前
潘善若发布了新的文献求助100
2分钟前
2分钟前
田様应助科研通管家采纳,获得10
2分钟前
顾矜应助ccc采纳,获得10
2分钟前
科目三应助稳重马里奥采纳,获得10
2分钟前
yyds完成签到,获得积分20
2分钟前
2分钟前
马克完成签到,获得积分10
2分钟前
6666完成签到,获得积分10
2分钟前
ai zs发布了新的文献求助10
2分钟前
xlj完成签到 ,获得积分10
2分钟前
科研通AI2S应助自觉无心采纳,获得10
2分钟前
wmd完成签到,获得积分10
2分钟前
2分钟前
2分钟前
mirrovo完成签到 ,获得积分10
2分钟前
2分钟前
daggeraxe发布了新的文献求助10
3分钟前
领导范儿应助斯文宛秋采纳,获得10
3分钟前
高分求助中
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Bounds for Statistical Estimation in Semiparametric Models 500
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Ideology and Meaning-Making under the Putin Regime 450
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6471733
求助须知:如何正确求助?哪些是违规求助? 8275908
关于积分的说明 17646123
捐赠科研通 5550429
什么是DOI,文献DOI怎么找? 2909363
邀请新用户注册赠送积分活动 1886148
关于科研通互助平台的介绍 1736926