Cargando…

Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication

Nowadays, continuous integration (CI) is indispensable in the software development process. A central promise of adopting CI is that new features or bug fixes can be delivered more quickly. A recent repository mining study by Bernardo, da Costa & Kulesza (2018) found that only about half of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Yunfang, Leitner, Philipp
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924497/
https://www.ncbi.nlm.nih.gov/pubmed/33816896
http://dx.doi.org/10.7717/peerj-cs.245
_version_ 1783659103068880896
author Guo, Yunfang
Leitner, Philipp
author_facet Guo, Yunfang
Leitner, Philipp
author_sort Guo, Yunfang
collection PubMed
description Nowadays, continuous integration (CI) is indispensable in the software development process. A central promise of adopting CI is that new features or bug fixes can be delivered more quickly. A recent repository mining study by Bernardo, da Costa & Kulesza (2018) found that only about half of the investigated open source projects actually deliver pull requests (PR) faster after adopting CI, with small effect sizes. However, there are some concerns regarding the methodology used by Bernardo et al., which may potentially limit the trustworthiness of this finding. Particularly, they do not explicitly control for normal changes in the pull request delivery time during a project’s lifetime (independently of CI introduction). Hence, in our work, we conduct a conceptual replication of this study. In a first step, we replicate their study results using the same subjects and methodology. In a second step, we address the same core research question using an adapted methodology. We use a different statistical method (regression discontinuity design, RDD) that is more robust towards the confounding factor of projects potentially getting faster in delivering PRs over time naturally, and we introduce a control group of comparable projects that never applied CI. Finally, we also evaluate the generalizability of the original findings on a set of new open source projects sampled using the same methodology. We find that the results of the study by Bernardo et al. largely hold in our replication. Using RDD, we do not find robust evidence of projects getting faster at delivering PRs without CI, and we similarly do not see a speed-up in our control group that never introduced CI. Further, results obtained from a newly mined set of projects are comparable to the original findings. In conclusion, we consider the replication successful.
format Online
Article
Text
id pubmed-7924497
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-79244972021-04-02 Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication Guo, Yunfang Leitner, Philipp PeerJ Comput Sci Software Engineering Nowadays, continuous integration (CI) is indispensable in the software development process. A central promise of adopting CI is that new features or bug fixes can be delivered more quickly. A recent repository mining study by Bernardo, da Costa & Kulesza (2018) found that only about half of the investigated open source projects actually deliver pull requests (PR) faster after adopting CI, with small effect sizes. However, there are some concerns regarding the methodology used by Bernardo et al., which may potentially limit the trustworthiness of this finding. Particularly, they do not explicitly control for normal changes in the pull request delivery time during a project’s lifetime (independently of CI introduction). Hence, in our work, we conduct a conceptual replication of this study. In a first step, we replicate their study results using the same subjects and methodology. In a second step, we address the same core research question using an adapted methodology. We use a different statistical method (regression discontinuity design, RDD) that is more robust towards the confounding factor of projects potentially getting faster in delivering PRs over time naturally, and we introduce a control group of comparable projects that never applied CI. Finally, we also evaluate the generalizability of the original findings on a set of new open source projects sampled using the same methodology. We find that the results of the study by Bernardo et al. largely hold in our replication. Using RDD, we do not find robust evidence of projects getting faster at delivering PRs without CI, and we similarly do not see a speed-up in our control group that never introduced CI. Further, results obtained from a newly mined set of projects are comparable to the original findings. In conclusion, we consider the replication successful. PeerJ Inc. 2019-12-16 /pmc/articles/PMC7924497/ /pubmed/33816896 http://dx.doi.org/10.7717/peerj-cs.245 Text en ©2019 Guo and Leitner https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Software Engineering
Guo, Yunfang
Leitner, Philipp
Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication
title Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication
title_full Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication
title_fullStr Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication
title_full_unstemmed Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication
title_short Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication
title_sort studying the impact of ci on pull request delivery time in open source projects—a conceptual replication
topic Software Engineering
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924497/
https://www.ncbi.nlm.nih.gov/pubmed/33816896
http://dx.doi.org/10.7717/peerj-cs.245
work_keys_str_mv AT guoyunfang studyingtheimpactofcionpullrequestdeliverytimeinopensourceprojectsaconceptualreplication
AT leitnerphilipp studyingtheimpactofcionpullrequestdeliverytimeinopensourceprojectsaconceptualreplication