Cargando…

FLAGS, frequently mutated genes in public exomes

BACKGROUND: Dramatic improvements in DNA-sequencing technologies and computational analyses have led to wide use of whole exome sequencing (WES) to identify the genetic basis of Mendelian disorders. More than 180 novel rare-disease-causing genes with Mendelian inheritance patterns have been discover...

Descripción completa

Detalles Bibliográficos
Autores principales: Shyr, Casper, Tarailo-Graovac, Maja, Gottlieb, Michael, Lee, Jessica JY, van Karnebeek, Clara, Wasserman, Wyeth W
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4267152/
https://www.ncbi.nlm.nih.gov/pubmed/25466818
http://dx.doi.org/10.1186/s12920-014-0064-y
_version_ 1782349109992095744
author Shyr, Casper
Tarailo-Graovac, Maja
Gottlieb, Michael
Lee, Jessica JY
van Karnebeek, Clara
Wasserman, Wyeth W
author_facet Shyr, Casper
Tarailo-Graovac, Maja
Gottlieb, Michael
Lee, Jessica JY
van Karnebeek, Clara
Wasserman, Wyeth W
author_sort Shyr, Casper
collection PubMed
description BACKGROUND: Dramatic improvements in DNA-sequencing technologies and computational analyses have led to wide use of whole exome sequencing (WES) to identify the genetic basis of Mendelian disorders. More than 180 novel rare-disease-causing genes with Mendelian inheritance patterns have been discovered through sequencing the exomes of just a few unrelated individuals or family members. As rare/novel genetic variants continue to be uncovered, there is a major challenge in distinguishing true pathogenic variants from rare benign mutations. METHODS: We used publicly available exome cohorts, together with the dbSNP database, to derive a list of genes (n = 100) that most frequently exhibit rare (<1%) non-synonymous/splice-site variants in general populations. We termed these genes FLAGS for FrequentLy mutAted GeneS and analyzed their properties. RESULTS: Analysis of FLAGS revealed that these genes have significantly longer protein coding sequences, a greater number of paralogs and display less evolutionarily selective pressure than expected. FLAGS are more frequently reported in PubMed clinical literature and more frequently associated with diseased phenotypes compared to the set of human protein-coding genes. We demonstrated an overlap between FLAGS and the rare-disease causing genes recently discovered through WES studies (n = 10) and the need for replication studies and rigorous statistical and biological analyses when associating FLAGS to rare disease. Finally, we showed how FLAGS are applied in disease-causing variant prioritization approach on exome data from a family affected by an unknown rare genetic disorder. CONCLUSIONS: We showed that some genes are frequently affected by rare, likely functional variants in general population, and are frequently observed in WES studies analyzing diverse rare phenotypes. We found that the rate at which genes accumulate rare mutations is beneficial information for prioritizing candidates. We provided a ranking system based on the mutation accumulation rates for prioritizing exome-captured human genes, and propose that clinical reports associating any disease/phenotype to FLAGS be evaluated with extra caution. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-014-0064-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4267152
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42671522014-12-17 FLAGS, frequently mutated genes in public exomes Shyr, Casper Tarailo-Graovac, Maja Gottlieb, Michael Lee, Jessica JY van Karnebeek, Clara Wasserman, Wyeth W BMC Med Genomics Research Article BACKGROUND: Dramatic improvements in DNA-sequencing technologies and computational analyses have led to wide use of whole exome sequencing (WES) to identify the genetic basis of Mendelian disorders. More than 180 novel rare-disease-causing genes with Mendelian inheritance patterns have been discovered through sequencing the exomes of just a few unrelated individuals or family members. As rare/novel genetic variants continue to be uncovered, there is a major challenge in distinguishing true pathogenic variants from rare benign mutations. METHODS: We used publicly available exome cohorts, together with the dbSNP database, to derive a list of genes (n = 100) that most frequently exhibit rare (<1%) non-synonymous/splice-site variants in general populations. We termed these genes FLAGS for FrequentLy mutAted GeneS and analyzed their properties. RESULTS: Analysis of FLAGS revealed that these genes have significantly longer protein coding sequences, a greater number of paralogs and display less evolutionarily selective pressure than expected. FLAGS are more frequently reported in PubMed clinical literature and more frequently associated with diseased phenotypes compared to the set of human protein-coding genes. We demonstrated an overlap between FLAGS and the rare-disease causing genes recently discovered through WES studies (n = 10) and the need for replication studies and rigorous statistical and biological analyses when associating FLAGS to rare disease. Finally, we showed how FLAGS are applied in disease-causing variant prioritization approach on exome data from a family affected by an unknown rare genetic disorder. CONCLUSIONS: We showed that some genes are frequently affected by rare, likely functional variants in general population, and are frequently observed in WES studies analyzing diverse rare phenotypes. We found that the rate at which genes accumulate rare mutations is beneficial information for prioritizing candidates. We provided a ranking system based on the mutation accumulation rates for prioritizing exome-captured human genes, and propose that clinical reports associating any disease/phenotype to FLAGS be evaluated with extra caution. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-014-0064-y) contains supplementary material, which is available to authorized users. BioMed Central 2014-12-03 /pmc/articles/PMC4267152/ /pubmed/25466818 http://dx.doi.org/10.1186/s12920-014-0064-y Text en © Shyr et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Shyr, Casper
Tarailo-Graovac, Maja
Gottlieb, Michael
Lee, Jessica JY
van Karnebeek, Clara
Wasserman, Wyeth W
FLAGS, frequently mutated genes in public exomes
title FLAGS, frequently mutated genes in public exomes
title_full FLAGS, frequently mutated genes in public exomes
title_fullStr FLAGS, frequently mutated genes in public exomes
title_full_unstemmed FLAGS, frequently mutated genes in public exomes
title_short FLAGS, frequently mutated genes in public exomes
title_sort flags, frequently mutated genes in public exomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4267152/
https://www.ncbi.nlm.nih.gov/pubmed/25466818
http://dx.doi.org/10.1186/s12920-014-0064-y
work_keys_str_mv AT shyrcasper flagsfrequentlymutatedgenesinpublicexomes
AT tarailograovacmaja flagsfrequentlymutatedgenesinpublicexomes
AT gottliebmichael flagsfrequentlymutatedgenesinpublicexomes
AT leejessicajy flagsfrequentlymutatedgenesinpublicexomes
AT vankarnebeekclara flagsfrequentlymutatedgenesinpublicexomes
AT wassermanwyethw flagsfrequentlymutatedgenesinpublicexomes