Please use this identifier to cite or link to this item: http://hdl.handle.net/10266/6943
Title: Development of a Biometric System Using Ear
Authors: Mahajan, Anshul
Supervisor: Singla, Sunil K
Keywords: Ear Biometrics;Convolutional Neural Network;Deep Learning;Gated Recurring Unit;Bidirectional Long Short-Term Memory
Issue Date: 10-Jan-2025
Abstract: Ear biometrics is gaining recognition as a promising biometric modality due to the stability and constancy of the outer ear over a person’s lifetime. Unlike facial features, which can change due to aging, emotional states, or other factors, the structure of the outer ear remains largely unchanged, making it a reliable feature for identification. This consistency makes ear biometrics an attractive option for biometric systems. Recent advances in machine learning, particularly in deep learning techniques such as Convolutional Neural Networks (CNNs), have accelerated research in this field. CNNs are especially effective for biometric applications because of their ability to automatically extract distinguishing features from raw ear image data, leading to more accurate and efficient identification systems. This study aims to leverage deep learning approaches to advance human identification using ear biometrics, focusing on developing new models and techniques to improve system precision and effectiveness. The primary objective of this research is to explore the use of deep learning techniques for ear biometrics and develop robust models that surpass traditional methods in accuracy. This involves not only the development of new models but also the use of data augmentation techniques to improve generalization, especially in cases where datasets are limited. In addition, this study examines multi-modal fusion techniques, which combine both ear and profile face images to enhance the identification process. The performance of the proposed models is evaluated using several key metrics, including accuracy, precision, recall, F1-score, and Cumulative Match Curves (CMC). A thorough review of existing research in ear biometrics provides the foundation for this study. While ear biometrics has historically been less researched than other biometric modalities, recent developments in deep learning have opened new possibilities. Multi-modal fusion techniques, for example, have shown promise by combining ear and facial data to improve identification accuracy. However, a significant challenge in this field is the lack of large, high-quality datasets dedicated to ear biometrics, which limits the generalizability of models. To address this issue, data augmentation strategies are employed to introduce variations into the training data, making the models more robust to real-world variations. This research presents three deep learning models designed to address the challenges of person identification using ear biometrics: DeepBio, CSA-GRU, and EMF-CNN. The DeepBio model combines CNNs with Bi-directional Long Shortv Term Memory (BI-LSTM) networks in a hybrid deep learning approach. CNN layers are used to extract meaningful features from ear images, while BI-LSTM networks capture sequential dependencies within the data. Data augmentation techniques such as flipping, rotation, and noise injection are applied to enhance the model’s robustness. DeepBio is evaluated using recognition rate and F1-score as performance metrics, demonstrating improved accuracy and robustness. The second model, CSA-GRU, integrates CNNs with Gated Recurrent Units (GRUs) and self-attention mechanisms. The CNN layers extract spatial features from the ear images, while the GRU layers process temporal information. GRUs are computationally efficient and well-suited for sequential data processing. Selfattention mechanisms allow the network to focus on the most relevant parts of the ear images, improving the model’s ability to distinguish subtle differences. Data augmentation techniques, including Gaussian noise, brightness adjustments, and color jittering, are used to improve the model’s generalization and performance. The final model, EMF-CNN, employs a multi-modal fusion framework that combines ear biometrics with additional features to enhance identification accuracy. This model uses pre-trained CNN architectures for feature extraction and enhances them with adaptive Local Phase Quantization (a-LPQ) and weighted Local Directional Patterns (w-LDP) to capture fine-grained texture information from ear images. The multi-modal approach, which fuses ear and facial data, improves the overall accuracy and robustness of the identification system by leveraging complementary biometric information. The use of these diverse datasets, such as IITD-I, IITD-II, AWE, AMI, EARVN1, and UND, ensures that the models are trained on a comprehensive set of ear images, each captured under varying lighting conditions, angles, and environmental factors. This variety allows the models to generalize well across different scenarios and improves their robustness to real-world conditions. In addition to traditional performance metrics like accuracy, precision, recall, and F1-score, Cumulative Match Curves (CMC) offer a deeper understanding of the models’ ranking abilities, especially in security and forensic applications where identifying the top candidate matches is crucial. These evaluations highlight the strengths and weaknesses of each model, providing insights into areas for further improvement. Overall, this testing framework ensures that the proposed models are not only accurate but also versatile and reliable across diverse biometric applications. In conclusion, this research presents innovative approaches to ear biometrics, contributing significantly to the field of biometric identification. The proposed models—DeepBio, CSA-GRU, and EMF-CNN—demonstrate the potential to envi hance the accuracy and reliability of ear-based identification systems. By integrating advanced deep learning techniques, such as CNNs, GRUs, and self-attention mechanisms, these models outperform traditional methods. Future research could further explore larger datasets, real-time applications, and additional augmentation techniques to refine these models. The proposed approaches hold potential for use in various applications, including security, healthcare, and personalized identity solutions, paving the way for more effective biometric systems.
URI: http://hdl.handle.net/10266/6943
Appears in Collections:Doctoral Theses@EIED

Files in This Item:
File Description SizeFormat 
Thesis Anshul Mahajan.pdf7.92 MBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.