Cargando…

Precise detection of de novo single nucleotide variants in human genomes

The precise determination of de novo genetic variants has enormous implications across different fields of biology and medicine, particularly personalized medicine. Currently, de novo variations are identified by mapping sample reads from a parent–offspring trio to a reference genome, allowing for a...

Descripción completa

Detalles Bibliográficos
Autores principales: Gómez-Romero, Laura, Palacios-Flores, Kim, Reyes, José, García, Delfino, Boege, Margareta, Dávila, Guillermo, Flores, Margarita, Schatz, Michael C., Palacios, Rafael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6003530/
https://www.ncbi.nlm.nih.gov/pubmed/29735690
http://dx.doi.org/10.1073/pnas.1802244115
_version_ 1783332373160525824
author Gómez-Romero, Laura
Palacios-Flores, Kim
Reyes, José
García, Delfino
Boege, Margareta
Dávila, Guillermo
Flores, Margarita
Schatz, Michael C.
Palacios, Rafael
author_facet Gómez-Romero, Laura
Palacios-Flores, Kim
Reyes, José
García, Delfino
Boege, Margareta
Dávila, Guillermo
Flores, Margarita
Schatz, Michael C.
Palacios, Rafael
author_sort Gómez-Romero, Laura
collection PubMed
description The precise determination of de novo genetic variants has enormous implications across different fields of biology and medicine, particularly personalized medicine. Currently, de novo variations are identified by mapping sample reads from a parent–offspring trio to a reference genome, allowing for a certain degree of differences. While widely used, this approach often introduces false-positive (FP) results due to misaligned reads and mischaracterized sequencing errors. In a previous study, we developed an alternative approach to accurately identify single nucleotide variants (SNVs) using only perfect matches. However, this approach could be applied only to haploid regions of the genome and was computationally intensive. In this study, we present a unique approach, coverage-based single nucleotide variant identification (COBASI), which allows the exploration of the entire genome using second-generation short sequence reads without extensive computing requirements. COBASI identifies SNVs using changes in coverage of exactly matching unique substrings, and is particularly suited for pinpointing de novo SNVs. Unlike other approaches that require population frequencies across hundreds of samples to filter out any methodological biases, COBASI can be applied to detect de novo SNVs within isolated families. We demonstrate this capability through extensive simulation studies and by studying a parent–offspring trio we sequenced using short reads. Experimental validation of all 58 candidate de novo SNVs and a selection of non-de novo SNVs found in the trio confirmed zero FP calls. COBASI is available as open source at https://github.com/Laura-Gomez/COBASI for any researcher to use.
format Online
Article
Text
id pubmed-6003530
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-60035302018-06-18 Precise detection of de novo single nucleotide variants in human genomes Gómez-Romero, Laura Palacios-Flores, Kim Reyes, José García, Delfino Boege, Margareta Dávila, Guillermo Flores, Margarita Schatz, Michael C. Palacios, Rafael Proc Natl Acad Sci U S A Biological Sciences The precise determination of de novo genetic variants has enormous implications across different fields of biology and medicine, particularly personalized medicine. Currently, de novo variations are identified by mapping sample reads from a parent–offspring trio to a reference genome, allowing for a certain degree of differences. While widely used, this approach often introduces false-positive (FP) results due to misaligned reads and mischaracterized sequencing errors. In a previous study, we developed an alternative approach to accurately identify single nucleotide variants (SNVs) using only perfect matches. However, this approach could be applied only to haploid regions of the genome and was computationally intensive. In this study, we present a unique approach, coverage-based single nucleotide variant identification (COBASI), which allows the exploration of the entire genome using second-generation short sequence reads without extensive computing requirements. COBASI identifies SNVs using changes in coverage of exactly matching unique substrings, and is particularly suited for pinpointing de novo SNVs. Unlike other approaches that require population frequencies across hundreds of samples to filter out any methodological biases, COBASI can be applied to detect de novo SNVs within isolated families. We demonstrate this capability through extensive simulation studies and by studying a parent–offspring trio we sequenced using short reads. Experimental validation of all 58 candidate de novo SNVs and a selection of non-de novo SNVs found in the trio confirmed zero FP calls. COBASI is available as open source at https://github.com/Laura-Gomez/COBASI for any researcher to use. National Academy of Sciences 2018-05-22 2018-05-07 /pmc/articles/PMC6003530/ /pubmed/29735690 http://dx.doi.org/10.1073/pnas.1802244115 Text en Copyright © 2018 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/ This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Gómez-Romero, Laura
Palacios-Flores, Kim
Reyes, José
García, Delfino
Boege, Margareta
Dávila, Guillermo
Flores, Margarita
Schatz, Michael C.
Palacios, Rafael
Precise detection of de novo single nucleotide variants in human genomes
title Precise detection of de novo single nucleotide variants in human genomes
title_full Precise detection of de novo single nucleotide variants in human genomes
title_fullStr Precise detection of de novo single nucleotide variants in human genomes
title_full_unstemmed Precise detection of de novo single nucleotide variants in human genomes
title_short Precise detection of de novo single nucleotide variants in human genomes
title_sort precise detection of de novo single nucleotide variants in human genomes
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6003530/
https://www.ncbi.nlm.nih.gov/pubmed/29735690
http://dx.doi.org/10.1073/pnas.1802244115
work_keys_str_mv AT gomezromerolaura precisedetectionofdenovosinglenucleotidevariantsinhumangenomes
AT palaciosfloreskim precisedetectionofdenovosinglenucleotidevariantsinhumangenomes
AT reyesjose precisedetectionofdenovosinglenucleotidevariantsinhumangenomes
AT garciadelfino precisedetectionofdenovosinglenucleotidevariantsinhumangenomes
AT boegemargareta precisedetectionofdenovosinglenucleotidevariantsinhumangenomes
AT davilaguillermo precisedetectionofdenovosinglenucleotidevariantsinhumangenomes
AT floresmargarita precisedetectionofdenovosinglenucleotidevariantsinhumangenomes
AT schatzmichaelc precisedetectionofdenovosinglenucleotidevariantsinhumangenomes
AT palaciosrafael precisedetectionofdenovosinglenucleotidevariantsinhumangenomes