Grammar learning by partition search

Anja Belz

Grammar learning by partition search

Anja Belz

University of Brighton

Research output: Chapter in Book/Conference proceeding with ISSN or ISBN › Conference contribution with ISSN or ISBN › peer-review

Abstract

This paper describes Grammar Learning by Partition Search, a general method for automatically constructing grammars for a range of parsing tasks. Given a base grammar, a training corpus, and a parsing task, Partition Search constructs an optimised probabilistic context-free grammar by searching a space of nonterminal set partitions, looking for a partition that maximises parsing performance and minimises grammar size. The method can be used to optimise grammars in terms of size and performance, or to adapt existing grammars to new parsing tasks and new domains. This paper reports an example application to optimising a base grammar extracted from the Wall Street Journal Corpus. Partition Search improves parsing performance by up to 5.29%, and reduces grammar size by up to 16.89%. Parsing results are better than in existing treebank grammar research, and compared to other grammar compression methods, Partition Search has the advantage of achieving compression without loss of grammar coverage.

Original language	English
Title of host publication	Proceedings of the LREC 2002 workshop on event modelling for multilingual document linking
Place of Publication	Amsterdam/Philadelphia
Publisher	John Benjamins Publishing Company
Pages	0-0
Number of pages	1
Publication status	Published - 1 Jan 2002
Event	Proceedings of the LREC 2002 workshop on event modelling for multilingual document linking - Las Palmas, Spain Duration: 1 Jan 2002 → …

Workshop

Workshop	Proceedings of the LREC 2002 workshop on event modelling for multilingual document linking
Period	1/01/02 → …

Keywords

Natural language generation
Partition searching

Cite this

@inproceedings{6ea0f480ac334b0b9a42087a789130b3,

title = "Grammar learning by partition search",

abstract = "This paper describes Grammar Learning by Partition Search, a general method for automatically constructing grammars for a range of parsing tasks. Given a base grammar, a training corpus, and a parsing task, Partition Search constructs an optimised probabilistic context-free grammar by searching a space of nonterminal set partitions, looking for a partition that maximises parsing performance and minimises grammar size. The method can be used to optimise grammars in terms of size and performance, or to adapt existing grammars to new parsing tasks and new domains. This paper reports an example application to optimising a base grammar extracted from the Wall Street Journal Corpus. Partition Search improves parsing performance by up to 5.29%, and reduces grammar size by up to 16.89%. Parsing results are better than in existing treebank grammar research, and compared to other grammar compression methods, Partition Search has the advantage of achieving compression without loss of grammar coverage.",

keywords = "Natural language generation, Partition searching",

author = "Anja Belz",

year = "2002",

month = jan,

day = "1",

language = "English",

pages = "0--0",

booktitle = "Proceedings of the LREC 2002 workshop on event modelling for multilingual document linking",

publisher = "John Benjamins Publishing Company",

note = "Proceedings of the LREC 2002 workshop on event modelling for multilingual document linking ; Conference date: 01-01-2002",

}

TY - GEN

T1 - Grammar learning by partition search

AU - Belz, Anja

PY - 2002/1/1

Y1 - 2002/1/1

N2 - This paper describes Grammar Learning by Partition Search, a general method for automatically constructing grammars for a range of parsing tasks. Given a base grammar, a training corpus, and a parsing task, Partition Search constructs an optimised probabilistic context-free grammar by searching a space of nonterminal set partitions, looking for a partition that maximises parsing performance and minimises grammar size. The method can be used to optimise grammars in terms of size and performance, or to adapt existing grammars to new parsing tasks and new domains. This paper reports an example application to optimising a base grammar extracted from the Wall Street Journal Corpus. Partition Search improves parsing performance by up to 5.29%, and reduces grammar size by up to 16.89%. Parsing results are better than in existing treebank grammar research, and compared to other grammar compression methods, Partition Search has the advantage of achieving compression without loss of grammar coverage.

AB - This paper describes Grammar Learning by Partition Search, a general method for automatically constructing grammars for a range of parsing tasks. Given a base grammar, a training corpus, and a parsing task, Partition Search constructs an optimised probabilistic context-free grammar by searching a space of nonterminal set partitions, looking for a partition that maximises parsing performance and minimises grammar size. The method can be used to optimise grammars in terms of size and performance, or to adapt existing grammars to new parsing tasks and new domains. This paper reports an example application to optimising a base grammar extracted from the Wall Street Journal Corpus. Partition Search improves parsing performance by up to 5.29%, and reduces grammar size by up to 16.89%. Parsing results are better than in existing treebank grammar research, and compared to other grammar compression methods, Partition Search has the advantage of achieving compression without loss of grammar coverage.

KW - Natural language generation

KW - Partition searching

M3 - Conference contribution with ISSN or ISBN

SP - 0

EP - 0

BT - Proceedings of the LREC 2002 workshop on event modelling for multilingual document linking

PB - John Benjamins Publishing Company

CY - Amsterdam/Philadelphia

T2 - Proceedings of the LREC 2002 workshop on event modelling for multilingual document linking

Y2 - 1 January 2002

ER -

Grammar learning by partition search

Abstract

Workshop

Keywords

Fingerprint

Cite this