PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Active Learning Strategies in Handwritten Text Recognition
Nicolás Serrano, Adrià Giménez Pastor, Alberto Sanchis and Alfons Juan
In: ICMI-MLMI 2010, November 8-10, Beijing, China.

Abstract

Active learning strategies are being increasingly used in a variety of real-world tasks, though their application to handwritten text transcription in old manuscripts remains nearly unexplored. The basic idea is to follow a sequential, line-by-line transcription of the whole manuscript in which a continuously retrained system interacts with the user to efficiently transcribe each new line. This approach has been recently explored using a conventional strategy by which the user is only asked to supervise words that are not recognized with high confidence. In this paper, the conventional strategy is improved by also letting the system to recompute most probable hypotheses with the constraints imposed by user supervisions. In particular, two strategies are studied which differ in the frequency of hypothesis recomputation on the current line: after each (iterative) or all (delayed) user corrections. Empirical results are reported on two real tasks showing that these strategies outperform the conventional approach.

EPrint Type:Conference or Workshop Item (Poster)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:User Modelling for Computer Human Interaction
Natural Language Processing
Multimodal Integration
ID Code:7435
Deposited By:Alfons Juan
Deposited On:17 March 2011