Please use this identifier to cite or link to this item:
Title: Log Files Based Prediction of Failure Types using Deep Learning Models
Authors: Patel, Nidhi
Supervisor: Sharma, R.K.
Keywords: Deep Learning;Failure Classification;CNN;RNN-LSTM;Log Files
Issue Date: 26-Jul-2018
Abstract: In order to verify the correct working of a firmware, a firmware engineer writes various test cases, out of which some tests can get failed due to various issues. Firmware engineer finds the reasons behind the failure of test cases based on logs generated against the test cases, so that they can be resolved and proper verification of firmware can be done. Thus this log files based prediction of a failure types system is developed which is capable of identifying the patterns in the log files that are generated for a test across regression by a firmware engineer and able to classify it in a particular failure type. Error logs are an important source of information both for diagnosis as well as for proactive test failure handling. This log file based prediction includes different stages, namely, Pre-processing, Feature Extraction and Prediction. In Pre-processing phase, regular expressions are written to remove all the non- alphanumeric characters (noise) from logs and to extract test statistics and error description as this is the valuable piece of information or pattern that will help in prediction of failure type. Feature extraction is a process in which we try to extract only that relevant information from the pre-processed data. It can be a low level feature or high level feature. Different classifiers have been used to predict the failure type. This thesis presents a Deep Learning model based prediction system which is trained using Adaptive Moment Estimation (Adam), a Stochastic Optimization algorithm for parameters updation. The Deep Learning models, namely, Convolutional Neural Network (CNN), Recurrent Neural Network-LSTM (RNN-LSTM), and Recurrent Convolutional Neural Network (RNN-CNN) are analogous to neural networks having learnable weights and biases which evaluates log vector to final winner class scores. A comprehensive recognition rate of 87.0% using CNN, 84.0% using RNN-LSTM and 84.0% using RNN-CNN is achieved on a set of 10,000 logs.
Appears in Collections:Masters Theses@CSED

Files in This Item:
File Description SizeFormat 
Nidhi Patel 26 July 2018.pdf3.95 MBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.