System building cost vs. output quality in data-to-text generation

Anja Belz; Eric Kow

System building cost vs. output quality in data-to-text generation

Anja Belz, Eric Kow

University of Brighton

Research output: Chapter in Book/Conference proceeding with ISSN or ISBN › Conference contribution with ISSN or ISBN › peer-review

Abstract

Data-to-text generation systems tend tobe knowledge-based and manually built,which limits their reusability and makes them time and cost-intensive to createand maintain. Methods for automating(part of) the system building process exist,but do such methods risk a loss inoutput quality? In this paper, we investigatethe cost/quality trade-off in generation system building. We comparefour new data-to-text systems which were created by predominantly automatic techniques against six existing systems for the same domain which were created by predominantlymanual techniques. We evaluate the ten systems using intrinsic automatic metrics and human quality ratings.We find that increasing the degree towhich system building is automated doesnot necessarily result in a reduction in outputquality. We find furthermore that standardautomatic evaluation metrics underestimate the quality of handcrafted systems and over-estimate the quality of automatically created systems.

Original language	English
Title of host publication	Proceedings of the 12th European workshop on natural language generation (ENLG 2009)
Place of Publication	Athens, Greece
Publisher	Technografia Digital Press
Pages	16-24
Number of pages	9
Publication status	Published - 30 Mar 2009
Event	Proceedings of the 12th European workshop on natural language generation (ENLG 2009) - Athens, Greece, 30-31 March, 2009 Duration: 30 Mar 2009 → …

Workshop

Workshop	Proceedings of the 12th European workshop on natural language generation (ENLG 2009)
Period	30/03/09 → …

Access to Document

http://www.itri.brighton.ac.uk/~Anja.Belz/Publications/belz-kow-enlg09.pdfLicence: Unspecified

Cite this

@inproceedings{8fcff6c636c44a5fa82de66317dd35e5,

title = "System building cost vs. output quality in data-to-text generation",

abstract = "Data-to-text generation systems tend tobe knowledge-based and manually built,which limits their reusability and makes them time and cost-intensive to createand maintain. Methods for automating(part of) the system building process exist,but do such methods risk a loss inoutput quality? In this paper, we investigatethe cost/quality trade-off in generation system building. We comparefour new data-to-text systems which were created by predominantly automatic techniques against six existing systems for the same domain which were created by predominantlymanual techniques. We evaluate the ten systems using intrinsic automatic metrics and human quality ratings.We find that increasing the degree towhich system building is automated doesnot necessarily result in a reduction in outputquality. We find furthermore that standardautomatic evaluation metrics underestimate the quality of handcrafted systems and over-estimate the quality of automatically created systems.",

author = "Anja Belz and Eric Kow",

year = "2009",

month = mar,

day = "30",

language = "English",

pages = "16--24",

booktitle = "Proceedings of the 12th European workshop on natural language generation (ENLG 2009)",

publisher = "Technografia Digital Press",

note = "Proceedings of the 12th European workshop on natural language generation (ENLG 2009) ; Conference date: 30-03-2009",

}

Belz, A & Kow, E 2009, System building cost vs. output quality in data-to-text generation. in Proceedings of the 12th European workshop on natural language generation (ENLG 2009). Technografia Digital Press, Athens, Greece, pp. 16-24, Proceedings of the 12th European workshop on natural language generation (ENLG 2009), 30/03/09. <http://www.itri.brighton.ac.uk/~Anja.Belz/Publications/belz-kow-enlg09.pdf>

System building cost vs. output quality in data-to-text generation. / Belz, Anja; Kow, Eric.
Proceedings of the 12th European workshop on natural language generation (ENLG 2009). Athens, Greece: Technografia Digital Press, 2009. p. 16-24.

Research output: Chapter in Book/Conference proceeding with ISSN or ISBN › Conference contribution with ISSN or ISBN › peer-review

TY - GEN

T1 - System building cost vs. output quality in data-to-text generation

AU - Belz, Anja

AU - Kow, Eric

PY - 2009/3/30

Y1 - 2009/3/30

N2 - Data-to-text generation systems tend tobe knowledge-based and manually built,which limits their reusability and makes them time and cost-intensive to createand maintain. Methods for automating(part of) the system building process exist,but do such methods risk a loss inoutput quality? In this paper, we investigatethe cost/quality trade-off in generation system building. We comparefour new data-to-text systems which were created by predominantly automatic techniques against six existing systems for the same domain which were created by predominantlymanual techniques. We evaluate the ten systems using intrinsic automatic metrics and human quality ratings.We find that increasing the degree towhich system building is automated doesnot necessarily result in a reduction in outputquality. We find furthermore that standardautomatic evaluation metrics underestimate the quality of handcrafted systems and over-estimate the quality of automatically created systems.

AB - Data-to-text generation systems tend tobe knowledge-based and manually built,which limits their reusability and makes them time and cost-intensive to createand maintain. Methods for automating(part of) the system building process exist,but do such methods risk a loss inoutput quality? In this paper, we investigatethe cost/quality trade-off in generation system building. We comparefour new data-to-text systems which were created by predominantly automatic techniques against six existing systems for the same domain which were created by predominantlymanual techniques. We evaluate the ten systems using intrinsic automatic metrics and human quality ratings.We find that increasing the degree towhich system building is automated doesnot necessarily result in a reduction in outputquality. We find furthermore that standardautomatic evaluation metrics underestimate the quality of handcrafted systems and over-estimate the quality of automatically created systems.

M3 - Conference contribution with ISSN or ISBN

SP - 16

EP - 24

BT - Proceedings of the 12th European workshop on natural language generation (ENLG 2009)

PB - Technografia Digital Press

CY - Athens, Greece

T2 - Proceedings of the 12th European workshop on natural language generation (ENLG 2009)

Y2 - 30 March 2009

ER -

System building cost vs. output quality in data-to-text generation

Abstract

Workshop

Access to Document

Fingerprint

Cite this