Cargando…

GRAPE: genomic relatedness detection pipeline

Classifying the degree of relatedness between pairs of individuals has both scientific and commercial applications. As an example, genome-wide association studies (GWAS) may suffer from high rates of false positive results due to unrecognized population structure. This problem becomes especially rel...

Descripción completa

Detalles Bibliográficos
Autores principales: Medvedev, Alexander, Lebedev, Mikhail, Ponomarev, Andrew, Kosaretskiy, Mikhail, Osipenko, Dmitriy, Tischenko, Alexander, Kosaretskiy, Egor, Wang, Hui, Kolobkov, Dmitry, Chamberlain-Evans, Vitalina, Vakhitov, Ruslan, Nikonorov, Pavel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10182380/
https://www.ncbi.nlm.nih.gov/pubmed/37224332
http://dx.doi.org/10.12688/f1000research.111658.2
_version_ 1785041759366021120
author Medvedev, Alexander
Lebedev, Mikhail
Ponomarev, Andrew
Kosaretskiy, Mikhail
Osipenko, Dmitriy
Tischenko, Alexander
Kosaretskiy, Egor
Wang, Hui
Kolobkov, Dmitry
Chamberlain-Evans, Vitalina
Vakhitov, Ruslan
Nikonorov, Pavel
author_facet Medvedev, Alexander
Lebedev, Mikhail
Ponomarev, Andrew
Kosaretskiy, Mikhail
Osipenko, Dmitriy
Tischenko, Alexander
Kosaretskiy, Egor
Wang, Hui
Kolobkov, Dmitry
Chamberlain-Evans, Vitalina
Vakhitov, Ruslan
Nikonorov, Pavel
author_sort Medvedev, Alexander
collection PubMed
description Classifying the degree of relatedness between pairs of individuals has both scientific and commercial applications. As an example, genome-wide association studies (GWAS) may suffer from high rates of false positive results due to unrecognized population structure. This problem becomes especially relevant with recent increases in large-cohort studies. Accurate relationship classification is also required for genetic linkage analysis to identify disease-associated loci. Additionally, DNA relatives matching service is one of the leading drivers for the direct-to-consumer genetic testing market. Despite the availability of scientific and research information on the methods for determining kinship and the accessibility of relevant tools, the assembly of the pipeline, which stably operates on a real-world genotypic data, requires significant research and development resources. Currently, there is no open source end-to-end solution for relatedness detection in genomic data, that is fast, reliable and accurate for both close and distant degrees of kinship, combines all the necessary processing steps to work on a real data, and is ready for production integration. To address this, we developed GRAPE: Genomic RelAtedness detection PipelinE. It combines data preprocessing, identity-by-descent (IBD) segments detection, and accurate relationship estimation. The project uses software development best practices, as well as Global Alliance for Genomics and Health (GA4GH) standards and tools. Pipeline efficiency is demonstrated on both simulated and real-world datasets. GRAPE is available from: https://github.com/genxnetwork/grape.
format Online
Article
Text
id pubmed-10182380
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-101823802023-05-14 GRAPE: genomic relatedness detection pipeline Medvedev, Alexander Lebedev, Mikhail Ponomarev, Andrew Kosaretskiy, Mikhail Osipenko, Dmitriy Tischenko, Alexander Kosaretskiy, Egor Wang, Hui Kolobkov, Dmitry Chamberlain-Evans, Vitalina Vakhitov, Ruslan Nikonorov, Pavel F1000Res Software Tool Article Classifying the degree of relatedness between pairs of individuals has both scientific and commercial applications. As an example, genome-wide association studies (GWAS) may suffer from high rates of false positive results due to unrecognized population structure. This problem becomes especially relevant with recent increases in large-cohort studies. Accurate relationship classification is also required for genetic linkage analysis to identify disease-associated loci. Additionally, DNA relatives matching service is one of the leading drivers for the direct-to-consumer genetic testing market. Despite the availability of scientific and research information on the methods for determining kinship and the accessibility of relevant tools, the assembly of the pipeline, which stably operates on a real-world genotypic data, requires significant research and development resources. Currently, there is no open source end-to-end solution for relatedness detection in genomic data, that is fast, reliable and accurate for both close and distant degrees of kinship, combines all the necessary processing steps to work on a real data, and is ready for production integration. To address this, we developed GRAPE: Genomic RelAtedness detection PipelinE. It combines data preprocessing, identity-by-descent (IBD) segments detection, and accurate relationship estimation. The project uses software development best practices, as well as Global Alliance for Genomics and Health (GA4GH) standards and tools. Pipeline efficiency is demonstrated on both simulated and real-world datasets. GRAPE is available from: https://github.com/genxnetwork/grape. F1000 Research Limited 2023-04-13 /pmc/articles/PMC10182380/ /pubmed/37224332 http://dx.doi.org/10.12688/f1000research.111658.2 Text en Copyright: © 2023 Medvedev A et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software Tool Article
Medvedev, Alexander
Lebedev, Mikhail
Ponomarev, Andrew
Kosaretskiy, Mikhail
Osipenko, Dmitriy
Tischenko, Alexander
Kosaretskiy, Egor
Wang, Hui
Kolobkov, Dmitry
Chamberlain-Evans, Vitalina
Vakhitov, Ruslan
Nikonorov, Pavel
GRAPE: genomic relatedness detection pipeline
title GRAPE: genomic relatedness detection pipeline
title_full GRAPE: genomic relatedness detection pipeline
title_fullStr GRAPE: genomic relatedness detection pipeline
title_full_unstemmed GRAPE: genomic relatedness detection pipeline
title_short GRAPE: genomic relatedness detection pipeline
title_sort grape: genomic relatedness detection pipeline
topic Software Tool Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10182380/
https://www.ncbi.nlm.nih.gov/pubmed/37224332
http://dx.doi.org/10.12688/f1000research.111658.2
work_keys_str_mv AT medvedevalexander grapegenomicrelatednessdetectionpipeline
AT lebedevmikhail grapegenomicrelatednessdetectionpipeline
AT ponomarevandrew grapegenomicrelatednessdetectionpipeline
AT kosaretskiymikhail grapegenomicrelatednessdetectionpipeline
AT osipenkodmitriy grapegenomicrelatednessdetectionpipeline
AT tischenkoalexander grapegenomicrelatednessdetectionpipeline
AT kosaretskiyegor grapegenomicrelatednessdetectionpipeline
AT wanghui grapegenomicrelatednessdetectionpipeline
AT kolobkovdmitry grapegenomicrelatednessdetectionpipeline
AT chamberlainevansvitalina grapegenomicrelatednessdetectionpipeline
AT vakhitovruslan grapegenomicrelatednessdetectionpipeline
AT nikonorovpavel grapegenomicrelatednessdetectionpipeline