Applying Machine Learning to Root Cause Analysis in Agile CI/CD Software Testing Environments

 |  Login

Show simple item record

dc.contributor Aalto-yliopisto fi
dc.contributor Aalto University en
dc.contributor.advisor Huuhtanen, Timo
dc.contributor.author Kahles Bastida, Julen
dc.date.accessioned 2019-02-03T16:02:43Z
dc.date.available 2019-02-03T16:02:43Z
dc.date.issued 2019-01-28
dc.identifier.uri https://aaltodoc.aalto.fi/handle/123456789/36347
dc.description.abstract This thesis evaluates machine learning classification and clustering algorithms with the aim of automating the root cause analysis of failed tests in agile software testing environments. The inefficiency of manually categorizing the root causes in terms of time and human resources motivates this work. The development and testing environments of an agile team at Ericsson Finland are used as this work's framework. The author of the thesis extracts relevant features from the raw log data after interviewing the team's testing engineers (human experts). The author puts his initial efforts into clustering the unlabeled data, and despite obtaining qualitative correlations between several clusters and failure root causes, the vagueness in the rest of the clusters leads to the consideration of labeling. The author then carries out a new round of interviews with the testing engineers, which leads to the conceptualization of ground-truth categories for the test failures. With these, the human experts label the dataset accordingly. A collection of artificial neural networks that either classify the data or pre-process it for clustering is then optimized by the author. The best solution comes in the form of a classification multilayer perceptron that correctly assigns the failure category to new examples, on average, 88.9\% of the time. The primary outcome of this thesis comes in the form of a methodology for the extraction of expert knowledge and its adaptation to machine learning techniques for test failure root cause analysis using test log data. The proposed methodology constitutes a prototype or baseline approach towards achieving this objective in a corporate environment. en
dc.format.extent 79
dc.format.mimetype application/pdf en
dc.language.iso en en
dc.title Applying Machine Learning to Root Cause Analysis in Agile CI/CD Software Testing Environments en
dc.type G2 Pro gradu, diplomityö fi
dc.contributor.school Perustieteiden korkeakoulu fi
dc.subject.keyword root cause analysis en
dc.subject.keyword software testing en
dc.subject.keyword log data analysis en
dc.subject.keyword machine learning en
dc.subject.keyword neural networks en
dc.subject.keyword automation en
dc.identifier.urn URN:NBN:fi:aalto-201902031516
dc.programme.major Acoustics and Audio Technology fi
dc.programme.mcode ELEC3030 fi
dc.type.ontasot Master's thesis en
dc.type.ontasot Diplomityö fi
dc.contributor.supervisor Jung, Alexander
dc.programme Master’s Programme in Computer, Communication and Information Sciences fi
local.aalto.electroniconly yes
local.aalto.openaccess yes


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search archive


Advanced Search

article-iconSubmit a publication

Browse

My Account