Cargando…
The STRidER Report on Two Years of Quality Control of Autosomal STR Population Datasets
STRidER, the STRs for Identity ENFSI Reference Database, is a curated, freely publicly available online allele frequency database, quality control (QC) and software platform for autosomal Short Tandem Repeats (STRs) developed under the endorsement of the International Society for Forensic Genetics....
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7463946/ https://www.ncbi.nlm.nih.gov/pubmed/32784546 http://dx.doi.org/10.3390/genes11080901 |
_version_ | 1783577251048062976 |
---|---|
author | Bodner, Martin Parson, Walther |
author_facet | Bodner, Martin Parson, Walther |
author_sort | Bodner, Martin |
collection | PubMed |
description | STRidER, the STRs for Identity ENFSI Reference Database, is a curated, freely publicly available online allele frequency database, quality control (QC) and software platform for autosomal Short Tandem Repeats (STRs) developed under the endorsement of the International Society for Forensic Genetics. Continuous updates comprise additional STR loci and populations in the frequency database and many further STR-related aspects. One significant innovation is the autosomal STR data QC provided prior to publication of datasets. Such scrutiny was lacking previously, leaving QC to authors, reviewers and editors, which led to an unacceptably high error rate in scientific papers. The results from scrutinizing 184 STR datasets containing >177,000 individual genotypes submitted in the first two years of STRidER QC since 2017 revealed that about two-thirds of the STR datasets were either being withdrawn by the authors after initial feedback or rejected based on a conservative error rate. Almost no error-free submissions were received, which clearly shows that centralized QC and data curation are essential to maintain the high-quality standard required in forensic genetics. While many errors had minor impact on the resulting allele frequencies, multiple error categories were commonly found within single datasets. Several datasets contained serious flaws. We discuss the factors that caused the errors to draw the attention to redundant pitfalls and thus contribute to better quality of autosomal STR datasets and allele frequency reports. |
format | Online Article Text |
id | pubmed-7463946 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-74639462020-09-04 The STRidER Report on Two Years of Quality Control of Autosomal STR Population Datasets Bodner, Martin Parson, Walther Genes (Basel) Article STRidER, the STRs for Identity ENFSI Reference Database, is a curated, freely publicly available online allele frequency database, quality control (QC) and software platform for autosomal Short Tandem Repeats (STRs) developed under the endorsement of the International Society for Forensic Genetics. Continuous updates comprise additional STR loci and populations in the frequency database and many further STR-related aspects. One significant innovation is the autosomal STR data QC provided prior to publication of datasets. Such scrutiny was lacking previously, leaving QC to authors, reviewers and editors, which led to an unacceptably high error rate in scientific papers. The results from scrutinizing 184 STR datasets containing >177,000 individual genotypes submitted in the first two years of STRidER QC since 2017 revealed that about two-thirds of the STR datasets were either being withdrawn by the authors after initial feedback or rejected based on a conservative error rate. Almost no error-free submissions were received, which clearly shows that centralized QC and data curation are essential to maintain the high-quality standard required in forensic genetics. While many errors had minor impact on the resulting allele frequencies, multiple error categories were commonly found within single datasets. Several datasets contained serious flaws. We discuss the factors that caused the errors to draw the attention to redundant pitfalls and thus contribute to better quality of autosomal STR datasets and allele frequency reports. MDPI 2020-08-07 /pmc/articles/PMC7463946/ /pubmed/32784546 http://dx.doi.org/10.3390/genes11080901 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Bodner, Martin Parson, Walther The STRidER Report on Two Years of Quality Control of Autosomal STR Population Datasets |
title | The STRidER Report on Two Years of Quality Control of Autosomal STR Population Datasets |
title_full | The STRidER Report on Two Years of Quality Control of Autosomal STR Population Datasets |
title_fullStr | The STRidER Report on Two Years of Quality Control of Autosomal STR Population Datasets |
title_full_unstemmed | The STRidER Report on Two Years of Quality Control of Autosomal STR Population Datasets |
title_short | The STRidER Report on Two Years of Quality Control of Autosomal STR Population Datasets |
title_sort | strider report on two years of quality control of autosomal str population datasets |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7463946/ https://www.ncbi.nlm.nih.gov/pubmed/32784546 http://dx.doi.org/10.3390/genes11080901 |
work_keys_str_mv | AT bodnermartin thestriderreportontwoyearsofqualitycontrolofautosomalstrpopulationdatasets AT parsonwalther thestriderreportontwoyearsofqualitycontrolofautosomalstrpopulationdatasets AT bodnermartin striderreportontwoyearsofqualitycontrolofautosomalstrpopulationdatasets AT parsonwalther striderreportontwoyearsofqualitycontrolofautosomalstrpopulationdatasets |