Degree Programs
List of Courses
Search
Navigation
Training-Less Optical Recognition of Printed Characters Using Equation Fitting
R.M. Matugas and J.P. PABICO. 2008. Transactions of the National Academy of Science and Technology of the Philippines 30(1):163-164.
Abstract
We present a novel computational method for optical recognition of printed characters based on our observation that a character image can be divided into several partitions, where each represents a simple mathematical equation. In this paper, we derived a set of mathematical equations to respectively fit parts of an alphanumeric character. Using these equations in tandem with digital image processing, we can identify characters without going through tedious supervised machine training sequences where large samples of learning patterns is required. We digitized ten samples each of printed alphanumeric characters, and store each image in a matrix M, where the (i,j)th matrix element mi,j = I+. To remove noise, we converted M into binary using a thresholding value h, setting all mi,j > h to 1, and to 0 otherwise. We “thinned” M by reducing to 0 some matrix elements in the Moore neighborhood of a chosen mi,j=1, following the Rutovitz and Hilditch thinning procedures, to make sure that a continuous thin line is visible in M. We divided the thinned M into six partitions: two columns and three rows. In each partition, we extracted the coordinates (i,j) of each mi,j=1 and subjected them to equation fitting. We determined which equation the image part fits best using a simple correlation analysis. We developed a table of patterns for each character, wherein a table entry that matches the 6-part pattern determines the alphanumeric character. We validated our method using 360 images where our method correctly classified 94% of Arial fonts. Of the 6% error, our method interchangeably identified C as G, 9 as S, and I as T. Based on the promising result from our validation method, we conclude that our method is a potential alternative optical recognition scheme, specifically on machine printed characters.
Keywords: optical character recognition, training-less, equation fitting
Rev M. Matugas is an undergraduate alumnus and instructor of ICS.
Submitted 04 February 2008; Received 04 March 2008; Accepted 19 June 2008.
Suggested citation for this online article:
_______. Training-Less Optical Recognition Of Printed Characters Using Equation Fitting. Accessed 08 September 2008. UPLB-ICS webpage (http://www.ics.uplb.edu.ph/node/283).







