Text-Independent Robust Speaker Identification

dc.contributor.authorSharma, Vishu
dc.contributor.supervisorBhardwaj, Saurabh
dc.date.accessioned2016-09-02T06:10:31Z
dc.date.available2016-09-02T06:10:31Z
dc.date.issued2016-09-02
dc.description.abstractAs an underlying topic in speaker recognition, speaker identification aims to identify the speaker in a speech sample. Although identification of speaker is a complex problem, with the development of signal processing algorithms the problem has been simplified and the system performs better if the training and testing conditions are identical. However, the real world environment like background noise, room reverberation, crosstalk, etc., degrades the system performance. Achieving robustness in speaker identification has become a major concern ubiquitously. There are number of existing approaches to this problem such as proposing a robust feature set, introducing noise to models created by clean speech and using methods for speech enhancement to restore the clean speech characteristics. This dissertation aims to address the robustness problem in identification of speakers by proposing a new feature set and then introducing noise in the clean speech models. A new methodology is proposed in which root compression based features are extracted and the order of root is increased for analysing the apt value of root that gives the best identification results. Another methodology extracts the useful information from these feature set, making another set of features which performs substantially better than the conventional speaker features under the influence of noise. The system has been tested under a wide range of signal-tonoise ratio (SNR). The two methods of classification has been used namely, gaussian mixture models and support vector machine. To justify the results of identification, the proposed methodology have been verified on NIST-2003 and Vox-forge 2015 database in presence of noise. The analysis has been done with improving the order of root in the feature compression stage and the evaluation results have shown that features extracted by taking the square root have outperformed others in a noise dominant system while the cube root compression based features have shown best identification accuracy in case of signal dominant systemen_US
dc.identifier.urihttp://hdl.handle.net/10266/4228
dc.language.isoenen_US
dc.subjectSpeaker Identification,en_US
dc.titleText-Independent Robust Speaker Identificationen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
4228.pdf
Size:
1.21 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.03 KB
Format:
Item-specific license agreed upon to submission
Description: