Prosodically Guided Phonetic Engine for Punjabi Language

Agarwal, Neeshu

Prosodically Guided Phonetic Engine for Punjabi Language

dc.contributor.author	Agarwal, Neeshu
dc.contributor.supervisor	Sharma, R. K.
dc.date.accessioned	2015-08-03T09:43:17Z
dc.date.available	2015-08-03T09:43:17Z
dc.date.issued	2015-08-03T09:43:17Z
dc.description	ME, CSED	en
dc.description.abstract	This thesis deals with prosodically guided phonetic engine for Automatic Speech Recognition (ASR) for Punjabi language using trainable systems. The goal of this work is the development of a phonetic Engine using Phonetic Transcription and so to build acoustic models for Punjabi language. This is done by employing Hidden Markov Models (HMMs) that provide statistical representation of each of the distinct sounds that make up a word and to train the parameters of the models developed. The speech recognition modelling can be done in two ways: Acoustic modelling and Language Modelling. Out of these two, Acoustic Modelling has been used. In this work, Acoustic modelling has been worked out at a phonetic level, allowing general speech recognition applications. For this purpose, the tool HTK is employed, stated as Hidden Markov Model toolkit. HTK uses different commands to produce HMM models for each phone and computing various parameters for mixture. Different phone models have been developed and tested. This thesis is divided into six chapters. A brief review of these chapters is given below. Chapter 1 includes the definitions of the terms and the brief idea of the concepts, description of tools used in this work. Chapter 2 includes the literature survey which depicts the ideology, and previously proposed models and methodologies to train a phonetic engine. Chapter 3 depicts the problem statement and its cause that motivates to work in this domain. Chapter 4 describes the process of data collection of read, lecture, and conversational modes of Punjabi Speech, experimental set up and methodology which depicts the process of development of prosodically guided phonetic engine. For this purpose, Ubuntu Linux (14.04 64 bit) has been considered as environment. Wave Surfer tool has been used for manual transcription and Hidden Markov Model toolkit has been used to train and test the phonetic engine for various modes of speech. Chapter 5 presents the results of phonetic engine developed in this work for two categories, 30 phones and 34 phones. Gender-wise and transcription-wise performance of the developed engine have also been presented in this chapter. Chapter 6 includes the conclusion and future scope of this work.	en
dc.format.extent	10283553 bytes
dc.format.extent	10283553 bytes
dc.format.mimetype	application/pdf
dc.format.mimetype	application/pdf
dc.identifier.uri	http://hdl.handle.net/10266/3478
dc.language.iso	en	en
dc.subject	Prosody	en
dc.subject	Phonetic Engine	en
dc.subject	CSED	en
dc.title	Prosodically Guided Phonetic Engine for Punjabi Language	en
dc.type	Thesis	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 3478.pdf
Size:: 9.81 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.79 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Masters Theses@CSED