Non-Linguistic Constraints on the Acquisition of Phrase Structure

短语 语言学 基于规则的机器翻译 集合(抽象数据类型) 计算机科学 短语结构规则 人工智能 自然语言处理 限定词短语 语言习得 心理学 生成语法 哲学 程序设计语言
作者
Jenny R. Saffran
出处
期刊:Proceedings of the Annual Meeting of the Cognitive Science Society 卷期号:22 (22) 被引量:4
摘要

Non-Linguistic Constraints on the Acquisition of Phrase Structure Jenny R. Saffran (jsaffran@facstaff.wisc.edu) Department of Psychology; 1202 W. Johnson Street Madison, WI 53706 USA Abstract To what extent is linguistic structure learnable from statisti- cal information in the input? One set of cues which might as- sist in the discovery of hierarchical phrase structure given se- rially presented input are the dependencies, or predictive rela- tionships, present within phrases. In order to determine whether adult learners can use this statistical information, subjects were exposed to artificial languages which either contained or violated the kinds of dependencies which charac- terize natural languages. The results suggest that adults pos- sess learning mechanisms which detect and utilize statistical cues to phrase and hierarchical structure. A second experiment contrasted the acquisition of these linguistic systems with the same grammars implemented as non-linguistic input (se- quences of non-linguistic sounds or shapes). These findings suggest that constraints on the mechanisms which highlight the statistical cues which are most characteristic of human languages are not specifically tailored for language learning. Introduction While the idea that surface distributional patterns point to pertinent linguistic structures holds a distinguished place in linguistic history (e.g., Bloomfield, 1933; Harris, 1951), statistical learning has only recently re-emerged as a poten- tial contributing force in language acquisition (though see Maratsos & Chalkley, 1980). This renewed interest in sta- tistical learning has been fueled by developments in compu- tational modeling, by the widespread availability of large corpora of child-directed speech, and most recently by em- pirical research demonstrating that human subjects can per- form statistical language learning tasks in laboratory ex- periments. For example, computational algorithms can use the co-occurrence environments of words to discover form classes in large corpora (e.g., Cartwright & Brent, 1997; Finch & Chater, 1994; Mintz, 1996; Mintz, Newport, & Bever, 1995). Similarly, individual verb argument structures can be induced by models which tracks the co-occurrences of verbs and their arguments in the input (e.g., Schutze, 1994; Seidenberg & MacDonald, 1999). Extensive modeling work has also examined the statistical cues available for the dis- covery of word boundaries in continuous speech (e.g., Aslin, Woodward, LaMendola, & Bever, 1996; Brent & Cartwright, 1996; Cairns, Shillcock, Chater, & Levy, 1997; Christian- sen, Allen, & Seidenberg, 1998; Perruchet & Vintner, These models provide invaluable explorations of the ex- tent to which statistical information is available, in princi- ple, to language learners equipped with the right distribu- tional tools. But are humans such learners? A wealth of sta- tistical cues are useless unless humans can detect and use them. In fact, recent research suggests that humans are ex- tremely good at some statistical language learning tasks, such as word segmentation (e.g., Aslin, Saffran, & New- port, 1998; Goodsitt, Morgan & Kuhl, 1993; Saffran, Aslin, & Newport, 1996; Saffran, Newport, & Aslin, 1996) These results suggest that humans possess powerful sta- tistical language learning mechanisms, which are likely to provide important contributions to the language learning process. At the same time, it is important to recognize that these mechanisms would not be useful in language acquisi- tion unless they are somehow constrained or biased to per- form only certain kinds of computations over certain kinds of input. The pertinent generalizations to be drawn from a linguistic corpus are awash in irrelevant information. Any learning device without the right architectural, representa- tional, or computational constraints risks being sidetracked by the massive number of misleading generalizations avail- able in the input (e.g., Gleitman & Wanner, 1982; Pinker, 1984). There are an infinite number of linguistically irrele- vant statistics that an overly powerful statistical learner could compute: for example, which words are presented third in sentences, or which words follow words whose second syllable begins with th (e.g., Pinker, 1989). One way to avoid this combinatorial explosion would be to impose constraints on statistical learning which perform only a subset of the logically possible computations. It is clear that learning in biological systems is limited by inter- nal factors; there are species differences in which specific types of stimuli serve as privileged input (e.g., Garcia & Koelling, 1966; Marler, 1991). External factors also strongly bias learning, because input from structured do- mains consists of non-random information. In order for sta- tistical learning accounts to succeed, learners must be simi- larly constrained: humans must be just the type of statistical learners who are best suited to acquire the type of input ex- emplified by natural languages, focusing on linguistically relevant statistics while ignoring the wealth of available irrelevant computations. Such constraints might arise from various sources, either specific to language or from more general cognitive and/or perceptual constraints on human learning.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
刚刚
wbgwudi完成签到,获得积分10
1秒前
2秒前
2秒前
失眠夏山发布了新的文献求助20
3秒前
程风破浪发布了新的文献求助10
3秒前
crowling完成签到,获得积分10
3秒前
啃猫爪发布了新的文献求助10
4秒前
0_1完成签到,获得积分10
4秒前
JIE发布了新的文献求助10
4秒前
董日甫完成签到 ,获得积分10
5秒前
6秒前
一只五条悟完成签到,获得积分10
6秒前
张彤彤完成签到 ,获得积分10
8秒前
斯文啊斯文完成签到 ,获得积分20
8秒前
9秒前
研学弟完成签到,获得积分10
10秒前
小老板完成签到,获得积分10
10秒前
Orange应助zfh采纳,获得10
11秒前
小二郎应助小李叭叭采纳,获得10
12秒前
CUREME完成签到,获得积分10
12秒前
摘星012发布了新的文献求助10
14秒前
小宏完成签到,获得积分10
15秒前
wanglejia完成签到,获得积分10
16秒前
张佳良完成签到,获得积分10
17秒前
上官若男应助WN采纳,获得10
19秒前
哇次阿普曼完成签到 ,获得积分10
19秒前
woommoow完成签到,获得积分10
20秒前
20秒前
DrQin完成签到,获得积分10
20秒前
22秒前
22秒前
hhh发布了新的文献求助10
26秒前
是真的完成签到 ,获得积分10
26秒前
Eber完成签到,获得积分20
27秒前
27秒前
llll发布了新的文献求助10
27秒前
刘子完成签到,获得积分10
28秒前
SYLH应助小景采纳,获得10
28秒前
高分求助中
A new approach to the extrapolation of accelerated life test data 1000
Cognitive Neuroscience: The Biology of the Mind 1000
Technical Brochure TB 814: LPIT applications in HV gas insulated switchgear 1000
Immigrant Incorporation in East Asian Democracies 500
Nucleophilic substitution in azasydnone-modified dinitroanisoles 500
不知道标题是什么 500
A Preliminary Study on Correlation Between Independent Components of Facial Thermal Images and Subjective Assessment of Chronic Stress 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 3965857
求助须知:如何正确求助?哪些是违规求助? 3511158
关于积分的说明 11156654
捐赠科研通 3245772
什么是DOI,文献DOI怎么找? 1793118
邀请新用户注册赠送积分活动 874230
科研通“疑难数据库(出版商)”最低求助积分说明 804268