Close menu

SURE

Sunderland Repository records the research produced by the University of Sunderland including practice-based research and theses.

On the Use of Neural Text Generation for the Task of Optical Character Recognition

Mohammadi, Mahnaz, Jaf, Sardar, McGough, Andrew Stephen, Breckon, Toby P., Matthews, Peter, Theodoropoulos, Georgios and Obara, Boguslaw (2019) On the Use of Neural Text Generation for the Task of Optical Character Recognition. In: 16th ACS/IEEE International Conference on Computer Systems and Applications AICCSA 2019, 3-7 Nov. 2019, Abu Dhabi - UAE.

Item Type: Conference or Workshop Item (Paper)

Abstract

Optical Character Recognition (OCR), is extraction of textual data from scanned text documents to facilitate
their indexing, searching, editing and to reduce storage space. Although OCR systems have improved significantly in recent years, they still suffer in situations where the OCR output does not match the text in the original document. Deep learning models have contributed positively to many problems but their full potential to many other problems are yet to be explored. In this paper we propose a post-processing approach based on the application deep learning to improve the accuracy of OCR system (minimizing the error rate).We report on the use of neural network language models to accomplish the task of correcting incorrectly predicted characters/words by OCR systems. We applied our approach to the IAM handwriting database. Our proposed approach delivers significant accuracy improvement of 20:41% in F-score, 10:86% in character level comparison using Levenshtein distance and 20:69% in document level comparison over previously reported context based OCR empirical results of IAM handwriting database.

[img]
Preview
PDF
On the Use of Neural Text Generation for the Task.pdf - Accepted Version

Download (145kB) | Preview
[img] PDF (Administrator use only)
On the Use of Neural Text Generation for the Task of Optical Character Recognition.pdf - Published Version
Restricted to Repository staff only

Download (188kB) | Request a copy

More Information

Uncontrolled Keywords: Neural text generation, Optical character recognition, OCR, OCR post-processing, language models, neural language model, text generation, text prediction, IAM database, handwritten character recognition
Depositing User: Sardar Jaf

Identifiers

Item ID: 11095
URI: http://sure.sunderland.ac.uk/id/eprint/11095
Official URL: http://www.aiccsa.net/AICCSA2019/home-5

Users with ORCIDS

ORCID for Sardar Jaf: ORCID iD orcid.org/0000-0002-5620-0277

Catalogue record

Date Deposited: 06 Sep 2019 12:58
Last Modified: 30 Sep 2020 11:00

Contributors

Author: Sardar Jaf ORCID iD
Author: Mahnaz Mohammadi
Author: Andrew Stephen McGough
Author: Toby P. Breckon
Author: Peter Matthews
Author: Georgios Theodoropoulos
Author: Boguslaw Obara

University Divisions

Faculty of Technology > School of Computer Science

Subjects

Computing > Data Science
Computing > Artificial Intelligence
Computing > Information Systems
Computing > Programming
Computing > Software Engineering

Actions (login required)

View Item View Item