loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Daniel Atzberger 1 ; Tim Cech 2 ; Willy Scheibel 1 ; Rico Richter 2 and Jürgen Döllner 1

Affiliations: 1 University of Potsdam, Digital Engineering Faculty, Hasso Plattner Institute, Germany ; 2 University of Potsdam, Digital Engineering Faculty, Germany

Keyword(s): Log Analysis, Anomaly Detection, Event-Streaming, Latent Dirichlet Allocation.

Abstract: Continuous Integration and Continuous Delivery are best practices used in the context of DevOps. By using automated pipelines for building and testing small software changes, possible risks are intended to be detected early. Those pipelines continuously generate log events that are collected in semi-structured log files. In practice, these log files can amass 100 000 events and more. However, the relevant sections in these log files must be manually tagged by the user. This paper presents an online learning approach for detecting relevant log events using Latent Dirichlet Allocation. After grouping a fixed number of log events in a document, our approach prunes the vocabulary to eliminate words without semantic meaning. A sequence of documents is then described as a discrete sequence by applying Latent Dirichlet Allocation, which allows the detection of outliers within the sequence. By integrating the latent variables of the model, our approach provides an explanation of its predicti on. Our experiments show that our approach is sensitive to the choice of its hyperparameters in terms of the number and choice of detected anomalies. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 98.81.24.230

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Atzberger, D.; Cech, T.; Scheibel, W.; Richter, R. and Döllner, J. (2023). Detecting Outliers in CI/CD Pipeline Logs Using Latent Dirichlet Allocation. In Proceedings of the 18th International Conference on Evaluation of Novel Approaches to Software Engineering - ENASE; ISBN 978-989-758-647-7; ISSN 2184-4895, SciTePress, pages 461-468. DOI: 10.5220/0011858500003464

@conference{enase23,
author={Daniel Atzberger. and Tim Cech. and Willy Scheibel. and Rico Richter. and Jürgen Döllner.},
title={Detecting Outliers in CI/CD Pipeline Logs Using Latent Dirichlet Allocation},
booktitle={Proceedings of the 18th International Conference on Evaluation of Novel Approaches to Software Engineering - ENASE},
year={2023},
pages={461-468},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011858500003464},
isbn={978-989-758-647-7},
issn={2184-4895},
}

TY - CONF

JO - Proceedings of the 18th International Conference on Evaluation of Novel Approaches to Software Engineering - ENASE
TI - Detecting Outliers in CI/CD Pipeline Logs Using Latent Dirichlet Allocation
SN - 978-989-758-647-7
IS - 2184-4895
AU - Atzberger, D.
AU - Cech, T.
AU - Scheibel, W.
AU - Richter, R.
AU - Döllner, J.
PY - 2023
SP - 461
EP - 468
DO - 10.5220/0011858500003464
PB - SciTePress