A game-based approach to transcribing images of text

Khalil Dahab, Anja Belz

Research output: Chapter in Book/Conference proceeding with ISSN or ISBNConference contribution with ISSN or ISBNpeer-review

Abstract

We present a methodology that takes as input scanned documents of typed or hand-written text, and produces transcriptions of the text as output. Instead of using OCR technology, the methodology is game-based and produces such transcriptions as a by-product. The approach is intended particularly for languages for which language technology and resources are scarce and reliable OCR technology may not exist. It can be used in place of OCR for transcribing individual documents, or to create corpora of paired images and transcriptions required to train OCR tools. We present Minefield, a prototype implementation of the approach which is currently collecting Arabic transcriptions.
Original languageEnglish
Title of host publication7th International Conference on Language Resources and Evaluation
Place of PublicationFrance
PublisherEuropean Language Resources Association (ELRA)
Pages0-0
Number of pages1
Publication statusPublished - 1 Jan 2010
Event7th International Conference on Language Resources and Evaluation - Valletta, Malta
Duration: 1 Jan 2010 → …

Conference

Conference7th International Conference on Language Resources and Evaluation
Period1/01/10 → …

Fingerprint

Dive into the research topics of 'A game-based approach to transcribing images of text'. Together they form a unique fingerprint.

Cite this