Please use this identifier to cite or link to this item:
http://hdl.handle.net/10266/1828
Title: | Extracting Person Name, Date and Place from Text Document using LEX Tool |
Authors: | Sharma, Roohi |
Supervisor: | Garhwal, Sunita |
Keywords: | LEX Tool |
Issue Date: | 7-Aug-2012 |
Abstract: | Automata theory is closely related to formal language theory. Automata is used in designing and checking the behavior of digital circuits, lexical analysis, software for scanning large bodies of text and software verification, communications protocols. Finite automata have finite number of states. Finite automata provide efficient and convenient tools to represent the linguistic phenomena. If a text document is searched manually to get some important information, the process is slow, tedious and error prone. Manual searching commonly misses important information in the document and it causes wastage of time. It has been noted that finite state automata based techniques have been widely used in natural language processing and information extraction in various natural languages. In this thesis report various text extracting approaches are studied. Based on these approaches, a technique is given which extracts person name, place and date from text document using Lex tool. Regular expressions through which required information is extracted are also discussed. |
URI: | http://hdl.handle.net/10266/1828 |
Appears in Collections: | Masters Theses@CSED |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.