Abstract
In this paper, we present the datasets used in the Shallow and Deep Tracks of the First Multilingual Surface Realisation Shared Task (SR’18).
For the Shallow Track, data in ten languages has been re- leased: Arabic, Czech, Dutch, English, Finnish, French, Italian, Portuguese, Russian and Spanish. For the Deep Track, data in three languages is made available: English, French and Spanish. We describe in detail how the datasets were derived from
the Universal Dependencies V2.0, and report on an evaluation of the Deep Track input quality. In addition, we examine the motivation for, and likely usefulness of, deriving NLG inputs from annotations in resources originally developed for Natural Language Understanding (NLU), and
assess whether the resulting inputs supply enough information of the right kind for the final stage in the NLG process.
For the Shallow Track, data in ten languages has been re- leased: Arabic, Czech, Dutch, English, Finnish, French, Italian, Portuguese, Russian and Spanish. For the Deep Track, data in three languages is made available: English, French and Spanish. We describe in detail how the datasets were derived from
the Universal Dependencies V2.0, and report on an evaluation of the Deep Track input quality. In addition, we examine the motivation for, and likely usefulness of, deriving NLG inputs from annotations in resources originally developed for Natural Language Understanding (NLU), and
assess whether the resulting inputs supply enough information of the right kind for the final stage in the NLG process.
Original language | English |
---|---|
Title of host publication | Proceedings of the 11th International Natural Language Generation Conference |
Publisher | The Association for Computational Linguistics |
DOIs | |
Publication status | Published - 1 Nov 2018 |
Event | 11th International Conference on Natural Language Generation - Tilburg University, Tilburg, Netherlands Duration: 5 Nov 2018 → 8 Nov 2018 https://inlg2018.uvt.nl/ |
Conference
Conference | 11th International Conference on Natural Language Generation |
---|---|
Abbreviated title | INLG2018 |
Country/Territory | Netherlands |
City | Tilburg |
Period | 5/11/18 → 8/11/18 |
Internet address |