Cargando…

Pedigree based DNA sequencing pipeline for germline genomes of cancer families

BACKGROUND: In the course of our whole-genome sequencing efforts, we have developed a pipeline for analyzing germline genomes from Mendelian types of cancer pedigrees (familial cancer variant prioritization pipeline, FCVPP). RESULTS: The variant calling step distinguishes two types of genomic varian...

Descripción completa

Detalles Bibliográficos
Autores principales: Försti, Asta, Kumar, Abhishek, Paramasivam, Nagarajan, Schlesner, Matthias, Catalano, Calogerina, Dymerska, Dagmara, Lubinski, Jan, Eils, Roland, Hemminki, Kari
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4977614/
https://www.ncbi.nlm.nih.gov/pubmed/27508007
http://dx.doi.org/10.1186/s13053-016-0058-1
_version_ 1782447061136834560
author Försti, Asta
Kumar, Abhishek
Paramasivam, Nagarajan
Schlesner, Matthias
Catalano, Calogerina
Dymerska, Dagmara
Lubinski, Jan
Eils, Roland
Hemminki, Kari
author_facet Försti, Asta
Kumar, Abhishek
Paramasivam, Nagarajan
Schlesner, Matthias
Catalano, Calogerina
Dymerska, Dagmara
Lubinski, Jan
Eils, Roland
Hemminki, Kari
author_sort Försti, Asta
collection PubMed
description BACKGROUND: In the course of our whole-genome sequencing efforts, we have developed a pipeline for analyzing germline genomes from Mendelian types of cancer pedigrees (familial cancer variant prioritization pipeline, FCVPP). RESULTS: The variant calling step distinguishes two types of genomic variants: single nucleotide variants (SNVs) and indels, which undergo technical quality control. Mendelian types of variants are assumed to be rare and variants with frequencies higher that 0.1 % are screened out using human 1000 Genomes (Phase 3) and non-TCGA ExAC population data. Segregation in the pedigree allows variants to be present in affected family members and not in old, unaffected ones. The effectiveness of variant segregation depends on the number and relatedness of the family members: if over 5 third-degree (or more distant) relatives are available, the experience has shown that the number of likely variants is reduced from many hundreds to a few tens. These are then subjected to bioinformatics analysis, starting with the combined annotation dependent depletion (CADD) tool, which predicts the likelihood of the variant being deleterious. Different sets of individual tools are used for further evaluation of the deleteriousness of coding variants, 5’ and 3’ untranslated regions (UTRs), and intergenic variants. CONLUSIONS: The likelihood of success of the present genomic pipeline in finding novel high- or medium-penetrant genes depends on many steps but first and foremost, the pedigree needs to be reasonably large and the assignments and diagnoses among the members need to be correct.
format Online
Article
Text
id pubmed-4977614
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49776142016-08-10 Pedigree based DNA sequencing pipeline for germline genomes of cancer families Försti, Asta Kumar, Abhishek Paramasivam, Nagarajan Schlesner, Matthias Catalano, Calogerina Dymerska, Dagmara Lubinski, Jan Eils, Roland Hemminki, Kari Hered Cancer Clin Pract Research BACKGROUND: In the course of our whole-genome sequencing efforts, we have developed a pipeline for analyzing germline genomes from Mendelian types of cancer pedigrees (familial cancer variant prioritization pipeline, FCVPP). RESULTS: The variant calling step distinguishes two types of genomic variants: single nucleotide variants (SNVs) and indels, which undergo technical quality control. Mendelian types of variants are assumed to be rare and variants with frequencies higher that 0.1 % are screened out using human 1000 Genomes (Phase 3) and non-TCGA ExAC population data. Segregation in the pedigree allows variants to be present in affected family members and not in old, unaffected ones. The effectiveness of variant segregation depends on the number and relatedness of the family members: if over 5 third-degree (or more distant) relatives are available, the experience has shown that the number of likely variants is reduced from many hundreds to a few tens. These are then subjected to bioinformatics analysis, starting with the combined annotation dependent depletion (CADD) tool, which predicts the likelihood of the variant being deleterious. Different sets of individual tools are used for further evaluation of the deleteriousness of coding variants, 5’ and 3’ untranslated regions (UTRs), and intergenic variants. CONLUSIONS: The likelihood of success of the present genomic pipeline in finding novel high- or medium-penetrant genes depends on many steps but first and foremost, the pedigree needs to be reasonably large and the assignments and diagnoses among the members need to be correct. BioMed Central 2016-08-09 /pmc/articles/PMC4977614/ /pubmed/27508007 http://dx.doi.org/10.1186/s13053-016-0058-1 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Försti, Asta
Kumar, Abhishek
Paramasivam, Nagarajan
Schlesner, Matthias
Catalano, Calogerina
Dymerska, Dagmara
Lubinski, Jan
Eils, Roland
Hemminki, Kari
Pedigree based DNA sequencing pipeline for germline genomes of cancer families
title Pedigree based DNA sequencing pipeline for germline genomes of cancer families
title_full Pedigree based DNA sequencing pipeline for germline genomes of cancer families
title_fullStr Pedigree based DNA sequencing pipeline for germline genomes of cancer families
title_full_unstemmed Pedigree based DNA sequencing pipeline for germline genomes of cancer families
title_short Pedigree based DNA sequencing pipeline for germline genomes of cancer families
title_sort pedigree based dna sequencing pipeline for germline genomes of cancer families
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4977614/
https://www.ncbi.nlm.nih.gov/pubmed/27508007
http://dx.doi.org/10.1186/s13053-016-0058-1
work_keys_str_mv AT forstiasta pedigreebaseddnasequencingpipelineforgermlinegenomesofcancerfamilies
AT kumarabhishek pedigreebaseddnasequencingpipelineforgermlinegenomesofcancerfamilies
AT paramasivamnagarajan pedigreebaseddnasequencingpipelineforgermlinegenomesofcancerfamilies
AT schlesnermatthias pedigreebaseddnasequencingpipelineforgermlinegenomesofcancerfamilies
AT catalanocalogerina pedigreebaseddnasequencingpipelineforgermlinegenomesofcancerfamilies
AT dymerskadagmara pedigreebaseddnasequencingpipelineforgermlinegenomesofcancerfamilies
AT lubinskijan pedigreebaseddnasequencingpipelineforgermlinegenomesofcancerfamilies
AT eilsroland pedigreebaseddnasequencingpipelineforgermlinegenomesofcancerfamilies
AT hemminkikari pedigreebaseddnasequencingpipelineforgermlinegenomesofcancerfamilies