English to Hindi Statistical Machine Translation System

dc.contributor.authorSharma, Nakul
dc.contributor.supervisorBhatia, Parteek
dc.contributor.supervisorSingh, Varinderpal
dc.date.accessioned2011-08-02T10:53:24Z
dc.date.available2011-08-02T10:53:24Z
dc.date.issued2011-08-02T10:53:24Z
dc.descriptionM.E. (Software Engineering - CSED)en
dc.description.abstractMachine Translation is an important part of Natural Language Processing. It refers to using machine to convert one natural language to another. Statistical Machine Translation is a part of Machine Translation that strives to use machine learning paradigm towards translating text. Statistical Machine Translation consists of Language Model (LM), Translation Model (TM) and decoder. In this thesis, English to Hindi Statistical Machine Translation system has been developed. The development of Language Model, Translation Model and decoder is done by making use of software’s available in Linux environment. SR International’s Language Model (SRILM) for Language Model, GIZA++ and mkcls for Translation Model, Moses for decoding, has been used in this system. LM computes the probability of target language sentences. TM calculates the probability of target sentences given the source sentence and the decoder maximizes the probability of translated text of target language. A parallel corpus of 5000 sentences in English and Hindi has been used in training of the system. The system was evaluated using manual evaluation method and a geometric average score of 2.693, 2.93 on the parameters of fluency and adequacy respectively, were found.en
dc.format.extent1251678 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/10266/1449
dc.language.isoenen
dc.subjectSMTen
dc.subjectTranslationen
dc.titleEnglish to Hindi Statistical Machine Translation Systemen
dc.typeThesisen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1449 Nakul Sharma (800931012).pdf
Size:
1.19 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.79 KB
Format:
Item-specific license agreed upon to submission
Description: