Cargando…

Log-based software monitoring: a systematic mapping study

Modern software development and operations rely on monitoring to understand how systems behave in production. The data provided by application logs and runtime environment are essential to detect and diagnose undesired behavior and improve system reliability. However, despite the rich ecosystem arou...

Descripción completa

Detalles Bibliográficos
Autores principales: Cândido, Jeanderson, Aniche, Maurício, van Deursen, Arie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8114802/
https://www.ncbi.nlm.nih.gov/pubmed/34013028
http://dx.doi.org/10.7717/peerj-cs.489
_version_ 1783691120310484992
author Cândido, Jeanderson
Aniche, Maurício
van Deursen, Arie
author_facet Cândido, Jeanderson
Aniche, Maurício
van Deursen, Arie
author_sort Cândido, Jeanderson
collection PubMed
description Modern software development and operations rely on monitoring to understand how systems behave in production. The data provided by application logs and runtime environment are essential to detect and diagnose undesired behavior and improve system reliability. However, despite the rich ecosystem around industry-ready log solutions, monitoring complex systems and getting insights from log data remains a challenge. Researchers and practitioners have been actively working to address several challenges related to logs, e.g., how to effectively provide better tooling support for logging decisions to developers, how to effectively process and store log data, and how to extract insights from log data. A holistic view of the research effort on logging practices and automated log analysis is key to provide directions and disseminate the state-of-the-art for technology transfer. In this paper, we study 108 papers (72 research track papers, 24 journals, and 12 industry track papers) from different communities (e.g., machine learning, software engineering, and systems) and structure the research field in light of the life-cycle of log data. Our analysis shows that (1) logging is challenging not only in open-source projects but also in industry, (2) machine learning is a promising approach to enable a contextual analysis of source code for log recommendation but further investigation is required to assess the usability of those tools in practice, (3) few studies approached efficient persistence of log data, and (4) there are open opportunities to analyze application logs and to evaluate state-of-the-art log analysis techniques in a DevOps context.
format Online
Article
Text
id pubmed-8114802
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-81148022021-05-18 Log-based software monitoring: a systematic mapping study Cândido, Jeanderson Aniche, Maurício van Deursen, Arie PeerJ Comput Sci Emerging Technologies Modern software development and operations rely on monitoring to understand how systems behave in production. The data provided by application logs and runtime environment are essential to detect and diagnose undesired behavior and improve system reliability. However, despite the rich ecosystem around industry-ready log solutions, monitoring complex systems and getting insights from log data remains a challenge. Researchers and practitioners have been actively working to address several challenges related to logs, e.g., how to effectively provide better tooling support for logging decisions to developers, how to effectively process and store log data, and how to extract insights from log data. A holistic view of the research effort on logging practices and automated log analysis is key to provide directions and disseminate the state-of-the-art for technology transfer. In this paper, we study 108 papers (72 research track papers, 24 journals, and 12 industry track papers) from different communities (e.g., machine learning, software engineering, and systems) and structure the research field in light of the life-cycle of log data. Our analysis shows that (1) logging is challenging not only in open-source projects but also in industry, (2) machine learning is a promising approach to enable a contextual analysis of source code for log recommendation but further investigation is required to assess the usability of those tools in practice, (3) few studies approached efficient persistence of log data, and (4) there are open opportunities to analyze application logs and to evaluate state-of-the-art log analysis techniques in a DevOps context. PeerJ Inc. 2021-05-06 /pmc/articles/PMC8114802/ /pubmed/34013028 http://dx.doi.org/10.7717/peerj-cs.489 Text en © 2021 Cândido et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Emerging Technologies
Cândido, Jeanderson
Aniche, Maurício
van Deursen, Arie
Log-based software monitoring: a systematic mapping study
title Log-based software monitoring: a systematic mapping study
title_full Log-based software monitoring: a systematic mapping study
title_fullStr Log-based software monitoring: a systematic mapping study
title_full_unstemmed Log-based software monitoring: a systematic mapping study
title_short Log-based software monitoring: a systematic mapping study
title_sort log-based software monitoring: a systematic mapping study
topic Emerging Technologies
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8114802/
https://www.ncbi.nlm.nih.gov/pubmed/34013028
http://dx.doi.org/10.7717/peerj-cs.489
work_keys_str_mv AT candidojeanderson logbasedsoftwaremonitoringasystematicmappingstudy
AT anichemauricio logbasedsoftwaremonitoringasystematicmappingstudy
AT vandeursenarie logbasedsoftwaremonitoringasystematicmappingstudy