Abstract
This paper describes a comparative application of Grammar Learning by Partition Search to four different learning tasks: deep parsing, NP identification, flat phrase chunking and NP chunking. In the experiments, base grammars were extracted from a treebank corpus. From this starting point, new grammars optimised for the different parsing tasks were learnt by Partition Search. No lexical information was used. In half of the experiments, local structural context in the form of parent phrase category information was incorporated into the grammars. Results show that grammars which contain this information outperform grammars which do not by large margins in all tests for all parsing tasks. It makes the biggest difference for deep parsing, typically corresponding to an improvement of around 5%. Overall, Partition Search with parent phrase category information is shown to be a successful method for learning grammars optimised for a given parsing task, and for minimising grammar size. The biggest margin of improvement over a base grammar was a 5.4% increase in the F-Score for deep parsing. The biggest size reductions were 93.5% fewer nonterminals (for NP identification), and 31.3% fewer rules (for XP chunking)
Original language | English |
---|---|
Title of host publication | COLING '02 Proceedings of the 19th international conference on computational linguistics |
Place of Publication | Stroudsbury, PA, USA |
Publisher | Association for Computational Linguistics |
Pages | 78-84 |
Number of pages | 7 |
Volume | 1 |
ISBN (Print) | 1558608958 |
DOIs | |
Publication status | Published - 1 Jan 2002 |
Event | COLING '02 Proceedings of the 19th international conference on computational linguistics - Taipei, Taiwan Duration: 1 Jan 2002 → … |
Conference
Conference | COLING '02 Proceedings of the 19th international conference on computational linguistics |
---|---|
Period | 1/01/02 → … |