Morphological complexity and unsupervised learning: validating Russian inflectional classes using high frequency data

Dunstan Brown, Roger Evans

Research output: Chapter in Book/Conference proceeding with ISSN or ISBNConference contribution with ISSN or ISBN

Abstract

This paper addresses the question of whether it is possible to use machine learning techniques on linguistic data to validate linguistic theory. We determine how readily inflectional classes recognized by linguists can be inferred by an unsupervised learning method when it is presented with the paradigms of a small number (80) of high frequency Russian noun lexemes. We interpret this as a measure of the validity of the linguistic theory. Inflectional classes are of particular interest, because they constitute a kind of autonomous morphological complexity which has no direct relationship to other levels of linguistic description, and hence there is no other objective way of assessing a theoretical characterisation of them. Using the same method, we also examine the status of principal parts and defaults in inflectional classes, and the relationship between inflectional classes and stress in Russian nominal morphology. Our experiments suggest that this is an effective and interesting technique for shedding additional light on theoretical claims.
Original languageEnglish
Title of host publicationCurrent issues in Morphological Theory: (Ir)regularity, analogy and frequency. Selected papers from the 14th International Morphology Meeting
EditorsKiefer Ference, Mária Ladányi, Péter Siptár
Place of PublicationAmsterdam
PublisherJohn Benjamins Publishing Co.
Pages135-162
Number of pages28
ISBN (Electronic)9789027273833
ISBN (Print)9789027248404
Publication statusPublished - 1 May 2012
EventCurrent issues in Morphological Theory: (Ir)regularity, analogy and frequency. Selected papers from the 14th International Morphology Meeting - Budapest, 13–16 May, 2010
Duration: 1 May 2012 → …

Publication series

NameCurrent Issues in Morphological Theory

Conference

ConferenceCurrent issues in Morphological Theory: (Ir)regularity, analogy and frequency. Selected papers from the 14th International Morphology Meeting
Period1/05/12 → …

Fingerprint Dive into the research topics of 'Morphological complexity and unsupervised learning: validating Russian inflectional classes using high frequency data'. Together they form a unique fingerprint.

Cite this