Abstract
We present the GIVE-2 Corpus, a new corpus of human instruction giving. The corpus was collected by asking one person in each pair of subjects to guide the other person towards completing a task in a virtual 3D environment with typed instructions. This is the same setting as that of the recent GIVE Challenge, and thus the corpus can serve as a source of data and as a point of comparison for NLG systems that participate in the GIVE Challenge. The instruction-giving data we collect is multilingual (45 German and 63 English dialogues), and can easily be extended to further languages by using our software, which we have made available. We analyze the corpus to study the effects of learning by repeated participation in the task and the effects of the participants' spatial navigation abilities. Finally, we present a novel annotation scheme for situated referring expressions and compare the referring expressions in the German and English data.
Original language | English |
---|---|
Title of host publication | Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010 |
Publisher | European Language Resources Association (ELRA) |
Pages | 2401-2406 |
Number of pages | 6 |
ISBN (Electronic) | 2951740867, 9782951740860 |
Publication status | Published - Jan 1 2010 |
Externally published | Yes |
Event | 7th International Conference on Language Resources and Evaluation, LREC 2010 - Valletta, Malta Duration: May 17 2010 → May 23 2010 |
Other
Other | 7th International Conference on Language Resources and Evaluation, LREC 2010 |
---|---|
Country/Territory | Malta |
City | Valletta |
Period | 5/17/10 → 5/23/10 |
ASJC Scopus subject areas
- Education
- Library and Information Sciences
- Linguistics and Language
- Language and Linguistics