Generating data as a proxy for unavailable corpus data: the contextualized sentence completion task

Loading...
Thumbnail Image
File version

Version of Record (VoR)

Author(s)
Ford, Marilyn
Bresnan, Joan
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2015
Size
File type(s)
Location
License
Abstract

There is much interest in using large corpora to explore predictors of the probability of higher level linguistic structures, but suitable corpora are not available for all languages and their varieties. We explore a task that uses discourse contexts from an existing corpus as prompts for sentence completion to investigate the usefulness of the method for generating data as a proxy for unavailable corpus data. Mini databases of dative and genitive structures were obtained with the method using American and Australian participants. It is shown that the databases are indeed a good proxy for corpus data.

Journal Title

Corpus Linguistics and Linguistic Theory

Conference Title
Book Title
Edition
Volume

11

Issue

1

Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© 2015 Walter de Gruyter & Co. KG Publishers. The attached file is reproduced here in accordance with the copyright policy of the publisher. Please refer to the journal's website for access to the definitive, published version.

Item Access Status
Note
Access the data
Related item(s)
Subject

Linguistics not elsewhere classified

Artificial Intelligence and Image Processing

Cognitive Sciences

Linguistics

Persistent link to this record
Citation
Collections