Please use this identifier to cite or link to this item:
Title: Detection and segmentation of text in handwritten Gurmukhi scripts
Authors: Modi, Namisha
Supervisor: Jindal, Khushneet
Keywords: Segmentation;handwritten punjabi OCR;Punjabi Line Segmentation;Handwritten Punjabi OCR;Handwritten line segmentation;Handwritten word segmentation;Handwritten character segmentation;Gurmukhi handwritten segmentation
Issue Date: 16-Aug-2013
Abstract: Computerized character recognition has been an intensive and challenging research in the area of computer vision for many years. Such automated character recognition systems provide a solution for processing large volumes of data automatically. Character recognition could be broadly categorized into two sub-fields: Machine Printed Character Recognition and Handwritten Character Recognition (HCR). First, deals with the recognition of machine printed characters where second deals with the recognition of handwritten characters. HCR can be further divided into two sub-fields, namely, online HCR and offline HCR. Offline handwritten character recognition involves the development of computational method that can generate descriptions of the handwritten objects from scanned digital images. This is a challenging computational task, due to the vast impreciseness associated with the handwritten patterns of different individuals. To extract this imprecise knowledge, efficient algorithms of preprocessing, segmentation, classification are required for better recognition of characters, words and text lines. In the present work, we have focused on Punjabi handwritten text segmentation. Gurumukhi script is a two dimensional composition of symbols with connected and disconnected diacritics. Handwritten Gurumukhi script has some complexities like connected, overlapped text lines, words and characters. It is one of the major reasons for errors during the recognition process. In offline HCR, segmentation is a challenging job for writer independent handwritten document image. In this thesis, an algorithm is proposed for text line, word and character segmentation in handwritten Punjabi document. The proposed algorithm deals with the problems like overlapped and connected components in text lines and extracts text lines from handwritten document image. Later, words and characters are extracted.
Description: Master of Technology (Computer Science and Applications)
Appears in Collections:Masters Theses@SOM

Files in This Item:
File Description SizeFormat 
2304.PDF2.39 MBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.