Cargando…

Identification of functional genetic variation in exome sequence analysis

Recent technological advances have allowed us to study individual genomes at a base-pair resolution and have demonstrated that the average exome harbors more than 15,000 genetic variants. However, our ability to understand the biological significance of the identified variants and to connect these o...

Descripción completa

Detalles Bibliográficos
Autores principales: Jaffe, Andrew, Wojcik, Genevieve, Chu, Audrey, Golozar, Asieh, Maroo, Ankit, Duggal, Priya, Klein, Alison P
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287847/
https://www.ncbi.nlm.nih.gov/pubmed/22373437
http://dx.doi.org/10.1186/1753-6561-5-S9-S13
_version_ 1782224757191606272
author Jaffe, Andrew
Wojcik, Genevieve
Chu, Audrey
Golozar, Asieh
Maroo, Ankit
Duggal, Priya
Klein, Alison P
author_facet Jaffe, Andrew
Wojcik, Genevieve
Chu, Audrey
Golozar, Asieh
Maroo, Ankit
Duggal, Priya
Klein, Alison P
author_sort Jaffe, Andrew
collection PubMed
description Recent technological advances have allowed us to study individual genomes at a base-pair resolution and have demonstrated that the average exome harbors more than 15,000 genetic variants. However, our ability to understand the biological significance of the identified variants and to connect these observed variants with phenotypes is limited. The first step in this process is to identify genetic variation that is likely to result in changes to protein structure and function, because detailed studies, either population based or functional, for each of the identified variants are not practicable. Therefore algorithms that yield valid predictions of a variant’s functional significance are needed. Over the past decade, several programs have been developed to predict the probability that an observed sequence variant will have a deleterious effect on protein function. These algorithms range from empirical programs that classify using known biochemical properties to statistical algorithms trained using a variety of data sources, including sequence conservation data, biochemical properties, and functional data. Using data from the pilot3 study of the 1000 Genomes Project available through Genetic Analysis Workshop 17, we compared the results of four programs (SIFT, PolyPhen, MAPP, and VarioWatch) used to predict the functional relevance of variants in 101 genes. Analysis was conducted without knowledge of the simulation model. Agreement between programs was modest ranging from 59.4% to 71.4% and only 3.5% of variants were classified as deleterious and 10.9% as tolerated across all four programs.
format Online
Article
Text
id pubmed-3287847
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32878472012-02-28 Identification of functional genetic variation in exome sequence analysis Jaffe, Andrew Wojcik, Genevieve Chu, Audrey Golozar, Asieh Maroo, Ankit Duggal, Priya Klein, Alison P BMC Proc Proceedings Recent technological advances have allowed us to study individual genomes at a base-pair resolution and have demonstrated that the average exome harbors more than 15,000 genetic variants. However, our ability to understand the biological significance of the identified variants and to connect these observed variants with phenotypes is limited. The first step in this process is to identify genetic variation that is likely to result in changes to protein structure and function, because detailed studies, either population based or functional, for each of the identified variants are not practicable. Therefore algorithms that yield valid predictions of a variant’s functional significance are needed. Over the past decade, several programs have been developed to predict the probability that an observed sequence variant will have a deleterious effect on protein function. These algorithms range from empirical programs that classify using known biochemical properties to statistical algorithms trained using a variety of data sources, including sequence conservation data, biochemical properties, and functional data. Using data from the pilot3 study of the 1000 Genomes Project available through Genetic Analysis Workshop 17, we compared the results of four programs (SIFT, PolyPhen, MAPP, and VarioWatch) used to predict the functional relevance of variants in 101 genes. Analysis was conducted without knowledge of the simulation model. Agreement between programs was modest ranging from 59.4% to 71.4% and only 3.5% of variants were classified as deleterious and 10.9% as tolerated across all four programs. BioMed Central 2011-11-29 /pmc/articles/PMC3287847/ /pubmed/22373437 http://dx.doi.org/10.1186/1753-6561-5-S9-S13 Text en Copyright ©2011 Jaffe et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Jaffe, Andrew
Wojcik, Genevieve
Chu, Audrey
Golozar, Asieh
Maroo, Ankit
Duggal, Priya
Klein, Alison P
Identification of functional genetic variation in exome sequence analysis
title Identification of functional genetic variation in exome sequence analysis
title_full Identification of functional genetic variation in exome sequence analysis
title_fullStr Identification of functional genetic variation in exome sequence analysis
title_full_unstemmed Identification of functional genetic variation in exome sequence analysis
title_short Identification of functional genetic variation in exome sequence analysis
title_sort identification of functional genetic variation in exome sequence analysis
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287847/
https://www.ncbi.nlm.nih.gov/pubmed/22373437
http://dx.doi.org/10.1186/1753-6561-5-S9-S13
work_keys_str_mv AT jaffeandrew identificationoffunctionalgeneticvariationinexomesequenceanalysis
AT wojcikgenevieve identificationoffunctionalgeneticvariationinexomesequenceanalysis
AT chuaudrey identificationoffunctionalgeneticvariationinexomesequenceanalysis
AT golozarasieh identificationoffunctionalgeneticvariationinexomesequenceanalysis
AT marooankit identificationoffunctionalgeneticvariationinexomesequenceanalysis
AT duggalpriya identificationoffunctionalgeneticvariationinexomesequenceanalysis
AT kleinalisonp identificationoffunctionalgeneticvariationinexomesequenceanalysis