Learning grammars for different parsing tasks by partition search

Anja Belz

doi:10.3115/1072228.1072296

Learning grammars for different parsing tasks by partition search

Anja Belz

University of Brighton

Research output: Chapter in Book/Conference proceeding with ISSN or ISBN › Conference contribution with ISSN or ISBN › peer-review

Abstract

This paper describes a comparative application of Grammar Learning by Partition Search to four different learning tasks: deep parsing, NP identification, flat phrase chunking and NP chunking. In the experiments, base grammars were extracted from a treebank corpus. From this starting point, new grammars optimised for the different parsing tasks were learnt by Partition Search. No lexical information was used. In half of the experiments, local structural context in the form of parent phrase category information was incorporated into the grammars. Results show that grammars which contain this information outperform grammars which do not by large margins in all tests for all parsing tasks. It makes the biggest difference for deep parsing, typically corresponding to an improvement of around 5%. Overall, Partition Search with parent phrase category information is shown to be a successful method for learning grammars optimised for a given parsing task, and for minimising grammar size. The biggest margin of improvement over a base grammar was a 5.4% increase in the F-Score for deep parsing. The biggest size reductions were 93.5% fewer nonterminals (for NP identification), and 31.3% fewer rules (for XP chunking)

Original language	English
Title of host publication	COLING '02 Proceedings of the 19th international conference on computational linguistics
Place of Publication	Stroudsbury, PA, USA
Publisher	Association for Computational Linguistics
Pages	78-84
Number of pages	7
Volume	1
ISBN (Print)	1558608958
DOIs	https://doi.org/10.3115/1072228.1072296
Publication status	Published - 1 Jan 2002
Event	COLING '02 Proceedings of the 19th international conference on computational linguistics - Taipei, Taiwan Duration: 1 Jan 2002 → …

Conference

Conference	COLING '02 Proceedings of the 19th international conference on computational linguistics
Period	1/01/02 → …

Bibliographical note

Access to Document

10.3115/1072228.1072296Licence: Unspecified

http://dl.acm.org/citation.cfm?id=1072228.1072296Licence: Unspecified

Cite this

@inproceedings{2506ab3ac8014ad79d84fb05d0e92d77,

title = "Learning grammars for different parsing tasks by partition search",

abstract = "This paper describes a comparative application of Grammar Learning by Partition Search to four different learning tasks: deep parsing, NP identification, flat phrase chunking and NP chunking. In the experiments, base grammars were extracted from a treebank corpus. From this starting point, new grammars optimised for the different parsing tasks were learnt by Partition Search. No lexical information was used. In half of the experiments, local structural context in the form of parent phrase category information was incorporated into the grammars. Results show that grammars which contain this information outperform grammars which do not by large margins in all tests for all parsing tasks. It makes the biggest difference for deep parsing, typically corresponding to an improvement of around 5%. Overall, Partition Search with parent phrase category information is shown to be a successful method for learning grammars optimised for a given parsing task, and for minimising grammar size. The biggest margin of improvement over a base grammar was a 5.4% increase in the F-Score for deep parsing. The biggest size reductions were 93.5% fewer nonterminals (for NP identification), and 31.3% fewer rules (for XP chunking)",

author = "Anja Belz",

note = "Association for Computational Linguistics Stroudsburg, PA, USA {\textcopyright} 2002; COLING '02 Proceedings of the 19th international conference on computational linguistics ; Conference date: 01-01-2002",

year = "2002",

month = jan,

day = "1",

doi = "10.3115/1072228.1072296",

language = "English",

isbn = "1558608958",

volume = "1",

pages = "78--84",

booktitle = "COLING '02 Proceedings of the 19th international conference on computational linguistics",

publisher = "Association for Computational Linguistics",

}

Belz, A 2002, Learning grammars for different parsing tasks by partition search. in COLING '02 Proceedings of the 19th international conference on computational linguistics. vol. 1, Association for Computational Linguistics, Stroudsbury, PA, USA, pp. 78-84, COLING '02 Proceedings of the 19th international conference on computational linguistics, 1/01/02. https://doi.org/10.3115/1072228.1072296

Learning grammars for different parsing tasks by partition search. / Belz, Anja.
COLING '02 Proceedings of the 19th international conference on computational linguistics. Vol. 1 Stroudsbury, PA, USA: Association for Computational Linguistics, 2002. p. 78-84.

Research output: Chapter in Book/Conference proceeding with ISSN or ISBN › Conference contribution with ISSN or ISBN › peer-review

TY - GEN

T1 - Learning grammars for different parsing tasks by partition search

AU - Belz, Anja

PY - 2002/1/1

Y1 - 2002/1/1

N2 - This paper describes a comparative application of Grammar Learning by Partition Search to four different learning tasks: deep parsing, NP identification, flat phrase chunking and NP chunking. In the experiments, base grammars were extracted from a treebank corpus. From this starting point, new grammars optimised for the different parsing tasks were learnt by Partition Search. No lexical information was used. In half of the experiments, local structural context in the form of parent phrase category information was incorporated into the grammars. Results show that grammars which contain this information outperform grammars which do not by large margins in all tests for all parsing tasks. It makes the biggest difference for deep parsing, typically corresponding to an improvement of around 5%. Overall, Partition Search with parent phrase category information is shown to be a successful method for learning grammars optimised for a given parsing task, and for minimising grammar size. The biggest margin of improvement over a base grammar was a 5.4% increase in the F-Score for deep parsing. The biggest size reductions were 93.5% fewer nonterminals (for NP identification), and 31.3% fewer rules (for XP chunking)

AB - This paper describes a comparative application of Grammar Learning by Partition Search to four different learning tasks: deep parsing, NP identification, flat phrase chunking and NP chunking. In the experiments, base grammars were extracted from a treebank corpus. From this starting point, new grammars optimised for the different parsing tasks were learnt by Partition Search. No lexical information was used. In half of the experiments, local structural context in the form of parent phrase category information was incorporated into the grammars. Results show that grammars which contain this information outperform grammars which do not by large margins in all tests for all parsing tasks. It makes the biggest difference for deep parsing, typically corresponding to an improvement of around 5%. Overall, Partition Search with parent phrase category information is shown to be a successful method for learning grammars optimised for a given parsing task, and for minimising grammar size. The biggest margin of improvement over a base grammar was a 5.4% increase in the F-Score for deep parsing. The biggest size reductions were 93.5% fewer nonterminals (for NP identification), and 31.3% fewer rules (for XP chunking)

U2 - 10.3115/1072228.1072296

DO - 10.3115/1072228.1072296

M3 - Conference contribution with ISSN or ISBN

SN - 1558608958

VL - 1

SP - 78

EP - 84

BT - COLING '02 Proceedings of the 19th international conference on computational linguistics

PB - Association for Computational Linguistics

CY - Stroudsbury, PA, USA

T2 - COLING '02 Proceedings of the 19th international conference on computational linguistics

Y2 - 1 January 2002

ER -

Learning grammars for different parsing tasks by partition search

Abstract

Conference

Bibliographical note

Access to Document

Fingerprint

Cite this