Please use this identifier to cite or link to this item: http://hdl.handle.net/10266/1828
Title: Extracting Person Name, Date and Place from Text Document using LEX Tool
Authors: Sharma, Roohi
Supervisor: Garhwal, Sunita
Keywords: LEX Tool
Issue Date: 7-Aug-2012
Abstract: Automata theory is closely related to formal language theory. Automata is used in designing and checking the behavior of digital circuits, lexical analysis, software for scanning large bodies of text and software verification, communications protocols. Finite automata have finite number of states. Finite automata provide efficient and convenient tools to represent the linguistic phenomena. If a text document is searched manually to get some important information, the process is slow, tedious and error prone. Manual searching commonly misses important information in the document and it causes wastage of time. It has been noted that finite state automata based techniques have been widely used in natural language processing and information extraction in various natural languages. In this thesis report various text extracting approaches are studied. Based on these approaches, a technique is given which extracts person name, place and date from text document using Lex tool. Regular expressions through which required information is extracted are also discussed.
URI: http://hdl.handle.net/10266/1828
Appears in Collections:Masters Theses@CSED

Files in This Item:
File Description SizeFormat 
1828.pdf1.72 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.