Automatic Coding of Printed Materials

Item Type Journal paper
Abstract

The paper presents a complete method for using automatic techniques to code printed text pages. It involves three automatic steps and one or two steps of manual corrections to obtain fully accurate results. We discovered that present-day consumer digital cameras are much better than high-end scanners to obtain pictures of printed pages quickly and without the wear and tear associated with scanners. We also found that high-end ($370) OCR software is much more cost-effective to achieve accurate text recognition and to process large amounts of data. We also describe how researchers can write a computer program for classifying automatically non-uniform data. We provide detailed instructions for each step in the automatic coding method so that other researchers can readily copy it.

Authors Murmann, Johann P.; Homburg, Ernst; Geven, Ruud; Bermiss, Y. Sekou & Forgione, Alfonzo
Journal or Publication Title International Journal of Humanities and Arts Computing
Language English
Subjects business studies
HSG Classification contribution to scientific community
Refereed Yes
Date 15 August 2008
Publisher Edinburgh University Press
Place of Publication Edinburgh
Volume 1
Number 2
Page Range 151-185
Number of Pages 34
Publisher DOI 10.3366/E1753854808000244
Official URL https://www.euppublishing.com/doi/abs/10.3366/E175...
Contact Email Address peter.murmann@unisg.ch
Depositing User Michele K├Âlbener
Date Deposited 18 Oct 2018 09:07
Last Modified 22 Oct 2018 09:43
URI: https://www.alexandria.unisg.ch/publications/255381

Download

[img] Text
murmann_2007.pdf

Download (3MB)
[img] Text
murmann_2007.pdf

Download (3MB)

Citation

Murmann, Johann P.; Homburg, Ernst; Geven, Ruud; Bermiss, Y. Sekou & Forgione, Alfonzo (2008) Automatic Coding of Printed Materials. International Journal of Humanities and Arts Computing, 1 (2). 151-185.

Statistics

https://www.alexandria.unisg.ch/id/eprint/255381
Edit item Edit item
Feedback?