Segmentation of Malayalam Handwritten Characters into Pattern Primitives and Recognition using SVM
Baiju.K.B1, Sabna.T.S2, Lajish.V.L3
1Baiju.K.B*, Department of Computer Science, University of Calicut, Kerala, India.
2Sabna.T.S, Department of Computer Science, University of Calicut, Kerala, India.
3Lajish.V.L, Department of Computer Science, University of Calicut, Kerala, India.
Manuscript received on May 06, 2020. | Revised Manuscript received on May 15, 2020. | Manuscript published on June 30, 2020. | PP: 1817-1822 | Volume-9 Issue-5, June 2020. | Retrieval Number: C4820029320/2020©BEIESP | DOI: 10.35940/ijeat.C4820.029320
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: This paper describes a lexical analysis (segmentation) approach in Pattern Recognition for Online Handwritten Character Recognition (OHCR) in Malayalam. The subunits (Pattern Primitives) in the single stroke vowel characters in Malayalam are identified and marked with pattern primitives to obtain a reference set of characters. Segmentation of the handwritten character samples into pattern primitives is made using a Combined Approach of Ramer Douglas Peucker algorithm and Eight Direction Freeman Code as per reference set. Features that are unique in the primitives of a character are extracted. The discriminating features identified are the direction of first primitive, segment count, cusp in second primitive, crossing in third primitive, and cusp in seventh primitive. The experiments were conducted on 100 samples per character that showed exact segmentation as per the reference set. With a five dimension feature set, the study achieved a recognition rate of 95.77% for five-fold cross-validation using Support Vector Machine with RBF kernel. The study shows that the segmentation of characters into pattern primitives is an effective method to realize accurate Malayalam OHCR systems for real-time applications.
Keywords: OHCR, SPR, Pattern Primitives, RDP, EDFC.