Cargando…

Dataset of open-source software developers labeled by their experience level in the project and their associated software metrics

Developers are extracted from 17 open-source projects from GitHub. Projects are chosen that use the java programming language, the Spring framework and Maven/Gradle build tools. Along with these developers, 24 software engineering metrics are extracted for each of them. These metrics are either calc...

Descripción completa

Detalles Bibliográficos
Autores principales: Perez, Quentin, Urtado, Christelle, Vauttier, Sylvain
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9813504/
https://www.ncbi.nlm.nih.gov/pubmed/36619255
http://dx.doi.org/10.1016/j.dib.2022.108842
_version_ 1784863936103841792
author Perez, Quentin
Urtado, Christelle
Vauttier, Sylvain
author_facet Perez, Quentin
Urtado, Christelle
Vauttier, Sylvain
author_sort Perez, Quentin
collection PubMed
description Developers are extracted from 17 open-source projects from GitHub. Projects are chosen that use the java programming language, the Spring framework and Maven/Gradle build tools. Along with these developers, 24 software engineering metrics are extracted for each of them. These metrics are either calculated by analyzing the source code or relative to project management metadata. Each of these developers then are manually searched for in professional social media such as LinkedIn or Twitter to be labeled with their experience level in their project. Outliers are statistically detected and manually re-assigned when needed. The resulting dataset contains 703 anonymized developers qualified by their 24 project-related software engineering metrics and labeled for their experience. It is suitable for empirical software engineering studies that need to connect developers’ level of experience to tangible software engineering metrics.
format Online
Article
Text
id pubmed-9813504
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-98135042023-01-06 Dataset of open-source software developers labeled by their experience level in the project and their associated software metrics Perez, Quentin Urtado, Christelle Vauttier, Sylvain Data Brief Data Article Developers are extracted from 17 open-source projects from GitHub. Projects are chosen that use the java programming language, the Spring framework and Maven/Gradle build tools. Along with these developers, 24 software engineering metrics are extracted for each of them. These metrics are either calculated by analyzing the source code or relative to project management metadata. Each of these developers then are manually searched for in professional social media such as LinkedIn or Twitter to be labeled with their experience level in their project. Outliers are statistically detected and manually re-assigned when needed. The resulting dataset contains 703 anonymized developers qualified by their 24 project-related software engineering metrics and labeled for their experience. It is suitable for empirical software engineering studies that need to connect developers’ level of experience to tangible software engineering metrics. Elsevier 2022-12-19 /pmc/articles/PMC9813504/ /pubmed/36619255 http://dx.doi.org/10.1016/j.dib.2022.108842 Text en © 2022 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Perez, Quentin
Urtado, Christelle
Vauttier, Sylvain
Dataset of open-source software developers labeled by their experience level in the project and their associated software metrics
title Dataset of open-source software developers labeled by their experience level in the project and their associated software metrics
title_full Dataset of open-source software developers labeled by their experience level in the project and their associated software metrics
title_fullStr Dataset of open-source software developers labeled by their experience level in the project and their associated software metrics
title_full_unstemmed Dataset of open-source software developers labeled by their experience level in the project and their associated software metrics
title_short Dataset of open-source software developers labeled by their experience level in the project and their associated software metrics
title_sort dataset of open-source software developers labeled by their experience level in the project and their associated software metrics
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9813504/
https://www.ncbi.nlm.nih.gov/pubmed/36619255
http://dx.doi.org/10.1016/j.dib.2022.108842
work_keys_str_mv AT perezquentin datasetofopensourcesoftwaredeveloperslabeledbytheirexperiencelevelintheprojectandtheirassociatedsoftwaremetrics
AT urtadochristelle datasetofopensourcesoftwaredeveloperslabeledbytheirexperiencelevelintheprojectandtheirassociatedsoftwaremetrics
AT vauttiersylvain datasetofopensourcesoftwaredeveloperslabeledbytheirexperiencelevelintheprojectandtheirassociatedsoftwaremetrics