Cultural heritage digital resources: from extraction to querying

M. Genereux

Research output: Chapter in Book/Conference proceeding with ISSN or ISBNConference contribution with ISSN or ISBN

Abstract

This article presents a method to extract and query Cultural Heritage (CH) textual digital resources. The extraction and querying phases are linked by a common ontological representation (CIDOC-CRM). A transport format (RDF) allows the ontology to be queried in a suitable query language (SPARQL), on top of which an interface makes it possible to formulate queries in Natural Language (NL). The extraction phase exploits the propositional nature of the ontology. The query interface is based on the Generate and Select principle, where potentially suitable queries are generated to match the user input, only for the most semantically similar candidate to be selected. In the process we evaluate data extracted from the description of a medieval city (Wolfenbuttel), transform and develop two methods of computing similarity between sentences based on WordNet. Experiments are described that compare the pros and cons of the similarity measures and evaluate them.
Original languageEnglish
Title of host publicationACL 2007 Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2007)
Publication statusPublished - 2007
EventACL 2007 Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2007) - Prague, Czech Republic
Duration: 1 Jan 2007 → …

Workshop

WorkshopACL 2007 Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2007)
Period1/01/07 → …

Keywords

  • Cultural heritage
  • Natural language generation

Fingerprint Dive into the research topics of 'Cultural heritage digital resources: from extraction to querying'. Together they form a unique fingerprint.

  • Cite this

    Genereux, M. (2007). Cultural heritage digital resources: from extraction to querying. In ACL 2007 Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2007) http://ilk.uvt.nl/latech07/