清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

Non-Linguistic Constraints on the Acquisition of Phrase Structure

短语 语言学 基于规则的机器翻译 集合(抽象数据类型) 计算机科学 短语结构规则 人工智能 自然语言处理 限定词短语 语言习得 心理学 生成语法 哲学 程序设计语言
作者
Jenny R. Saffran
出处
期刊:Proceedings of the Annual Meeting of the Cognitive Science Society 卷期号:22 (22) 被引量:4
摘要

Non-Linguistic Constraints on the Acquisition of Phrase Structure Jenny R. Saffran (jsaffran@facstaff.wisc.edu) Department of Psychology; 1202 W. Johnson Street Madison, WI 53706 USA Abstract To what extent is linguistic structure learnable from statisti- cal information in the input? One set of cues which might as- sist in the discovery of hierarchical phrase structure given se- rially presented input are the dependencies, or predictive rela- tionships, present within phrases. In order to determine whether adult learners can use this statistical information, subjects were exposed to artificial languages which either contained or violated the kinds of dependencies which charac- terize natural languages. The results suggest that adults pos- sess learning mechanisms which detect and utilize statistical cues to phrase and hierarchical structure. A second experiment contrasted the acquisition of these linguistic systems with the same grammars implemented as non-linguistic input (se- quences of non-linguistic sounds or shapes). These findings suggest that constraints on the mechanisms which highlight the statistical cues which are most characteristic of human languages are not specifically tailored for language learning. Introduction While the idea that surface distributional patterns point to pertinent linguistic structures holds a distinguished place in linguistic history (e.g., Bloomfield, 1933; Harris, 1951), statistical learning has only recently re-emerged as a poten- tial contributing force in language acquisition (though see Maratsos & Chalkley, 1980). This renewed interest in sta- tistical learning has been fueled by developments in compu- tational modeling, by the widespread availability of large corpora of child-directed speech, and most recently by em- pirical research demonstrating that human subjects can per- form statistical language learning tasks in laboratory ex- periments. For example, computational algorithms can use the co-occurrence environments of words to discover form classes in large corpora (e.g., Cartwright & Brent, 1997; Finch & Chater, 1994; Mintz, 1996; Mintz, Newport, & Bever, 1995). Similarly, individual verb argument structures can be induced by models which tracks the co-occurrences of verbs and their arguments in the input (e.g., Schutze, 1994; Seidenberg & MacDonald, 1999). Extensive modeling work has also examined the statistical cues available for the dis- covery of word boundaries in continuous speech (e.g., Aslin, Woodward, LaMendola, & Bever, 1996; Brent & Cartwright, 1996; Cairns, Shillcock, Chater, & Levy, 1997; Christian- sen, Allen, & Seidenberg, 1998; Perruchet & Vintner, These models provide invaluable explorations of the ex- tent to which statistical information is available, in princi- ple, to language learners equipped with the right distribu- tional tools. But are humans such learners? A wealth of sta- tistical cues are useless unless humans can detect and use them. In fact, recent research suggests that humans are ex- tremely good at some statistical language learning tasks, such as word segmentation (e.g., Aslin, Saffran, & New- port, 1998; Goodsitt, Morgan & Kuhl, 1993; Saffran, Aslin, & Newport, 1996; Saffran, Newport, & Aslin, 1996) These results suggest that humans possess powerful sta- tistical language learning mechanisms, which are likely to provide important contributions to the language learning process. At the same time, it is important to recognize that these mechanisms would not be useful in language acquisi- tion unless they are somehow constrained or biased to per- form only certain kinds of computations over certain kinds of input. The pertinent generalizations to be drawn from a linguistic corpus are awash in irrelevant information. Any learning device without the right architectural, representa- tional, or computational constraints risks being sidetracked by the massive number of misleading generalizations avail- able in the input (e.g., Gleitman & Wanner, 1982; Pinker, 1984). There are an infinite number of linguistically irrele- vant statistics that an overly powerful statistical learner could compute: for example, which words are presented third in sentences, or which words follow words whose second syllable begins with th (e.g., Pinker, 1989). One way to avoid this combinatorial explosion would be to impose constraints on statistical learning which perform only a subset of the logically possible computations. It is clear that learning in biological systems is limited by inter- nal factors; there are species differences in which specific types of stimuli serve as privileged input (e.g., Garcia & Koelling, 1966; Marler, 1991). External factors also strongly bias learning, because input from structured do- mains consists of non-random information. In order for sta- tistical learning accounts to succeed, learners must be simi- larly constrained: humans must be just the type of statistical learners who are best suited to acquire the type of input ex- emplified by natural languages, focusing on linguistically relevant statistics while ignoring the wealth of available irrelevant computations. Such constraints might arise from various sources, either specific to language or from more general cognitive and/or perceptual constraints on human learning.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
有人应助oleskarabach采纳,获得10
15秒前
所所应助oleskarabach采纳,获得10
15秒前
大意的晓亦完成签到 ,获得积分10
49秒前
52秒前
阿巴阿巴发布了新的文献求助10
57秒前
可爱的函函应助阿巴阿巴采纳,获得10
1分钟前
mashibeo发布了新的文献求助30
1分钟前
1分钟前
阿巴阿巴发布了新的文献求助10
1分钟前
天天快乐应助阿巴阿巴采纳,获得10
1分钟前
2分钟前
阿巴阿巴发布了新的文献求助10
2分钟前
Akim应助阿巴阿巴采纳,获得10
2分钟前
充电宝应助BaGGiO采纳,获得10
3分钟前
wwe完成签到,获得积分10
3分钟前
FashionBoy应助wwe采纳,获得10
3分钟前
suibianba完成签到,获得积分10
3分钟前
无花果应助科研通管家采纳,获得10
3分钟前
IlIIlIlIIIllI应助科研通管家采纳,获得20
3分钟前
3分钟前
lt0217发布了新的文献求助10
3分钟前
3分钟前
阿巴阿巴发布了新的文献求助10
3分钟前
herpes完成签到 ,获得积分10
3分钟前
平常山河完成签到 ,获得积分10
3分钟前
BaGGiO完成签到,获得积分10
4分钟前
超男完成签到 ,获得积分10
4分钟前
4分钟前
BaGGiO发布了新的文献求助10
4分钟前
4分钟前
pan发布了新的文献求助10
4分钟前
发仔完成签到,获得积分10
4分钟前
5分钟前
WTT发布了新的文献求助10
5分钟前
阿巴阿巴发布了新的文献求助10
5分钟前
ummmmm发布了新的文献求助10
5分钟前
烟花应助阿巴阿巴采纳,获得10
5分钟前
Ethan完成签到 ,获得积分0
5分钟前
WTT完成签到,获得积分20
5分钟前
思源应助liudy采纳,获得10
5分钟前
高分求助中
Production Logging: Theoretical and Interpretive Elements 2500
Healthcare Finance: Modern Financial Analysis for Accelerating Biomedical Innovation 2000
Applications of Emerging Nanomaterials and Nanotechnology 1111
Agaricales of New Zealand 1: Pluteaceae - Entolomataceae 1040
Les Mantodea de Guyane Insecta, Polyneoptera 1000
Neuromuscular and Electrodiagnostic Medicine Board Review 700
지식생태학: 생태학, 죽은 지식을 깨우다 600
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 纳米技术 内科学 物理 化学工程 计算机科学 复合材料 基因 遗传学 物理化学 催化作用 细胞生物学 免疫学 电极
热门帖子
关注 科研通微信公众号,转发送积分 3466837
求助须知:如何正确求助?哪些是违规求助? 3059644
关于积分的说明 9067346
捐赠科研通 2750142
什么是DOI,文献DOI怎么找? 1509065
科研通“疑难数据库(出版商)”最低求助积分说明 697124
邀请新用户注册赠送积分活动 696913