Cargando…
FLAGS, frequently mutated genes in public exomes
BACKGROUND: Dramatic improvements in DNA-sequencing technologies and computational analyses have led to wide use of whole exome sequencing (WES) to identify the genetic basis of Mendelian disorders. More than 180 novel rare-disease-causing genes with Mendelian inheritance patterns have been discover...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4267152/ https://www.ncbi.nlm.nih.gov/pubmed/25466818 http://dx.doi.org/10.1186/s12920-014-0064-y |
_version_ | 1782349109992095744 |
---|---|
author | Shyr, Casper Tarailo-Graovac, Maja Gottlieb, Michael Lee, Jessica JY van Karnebeek, Clara Wasserman, Wyeth W |
author_facet | Shyr, Casper Tarailo-Graovac, Maja Gottlieb, Michael Lee, Jessica JY van Karnebeek, Clara Wasserman, Wyeth W |
author_sort | Shyr, Casper |
collection | PubMed |
description | BACKGROUND: Dramatic improvements in DNA-sequencing technologies and computational analyses have led to wide use of whole exome sequencing (WES) to identify the genetic basis of Mendelian disorders. More than 180 novel rare-disease-causing genes with Mendelian inheritance patterns have been discovered through sequencing the exomes of just a few unrelated individuals or family members. As rare/novel genetic variants continue to be uncovered, there is a major challenge in distinguishing true pathogenic variants from rare benign mutations. METHODS: We used publicly available exome cohorts, together with the dbSNP database, to derive a list of genes (n = 100) that most frequently exhibit rare (<1%) non-synonymous/splice-site variants in general populations. We termed these genes FLAGS for FrequentLy mutAted GeneS and analyzed their properties. RESULTS: Analysis of FLAGS revealed that these genes have significantly longer protein coding sequences, a greater number of paralogs and display less evolutionarily selective pressure than expected. FLAGS are more frequently reported in PubMed clinical literature and more frequently associated with diseased phenotypes compared to the set of human protein-coding genes. We demonstrated an overlap between FLAGS and the rare-disease causing genes recently discovered through WES studies (n = 10) and the need for replication studies and rigorous statistical and biological analyses when associating FLAGS to rare disease. Finally, we showed how FLAGS are applied in disease-causing variant prioritization approach on exome data from a family affected by an unknown rare genetic disorder. CONCLUSIONS: We showed that some genes are frequently affected by rare, likely functional variants in general population, and are frequently observed in WES studies analyzing diverse rare phenotypes. We found that the rate at which genes accumulate rare mutations is beneficial information for prioritizing candidates. We provided a ranking system based on the mutation accumulation rates for prioritizing exome-captured human genes, and propose that clinical reports associating any disease/phenotype to FLAGS be evaluated with extra caution. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-014-0064-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4267152 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42671522014-12-17 FLAGS, frequently mutated genes in public exomes Shyr, Casper Tarailo-Graovac, Maja Gottlieb, Michael Lee, Jessica JY van Karnebeek, Clara Wasserman, Wyeth W BMC Med Genomics Research Article BACKGROUND: Dramatic improvements in DNA-sequencing technologies and computational analyses have led to wide use of whole exome sequencing (WES) to identify the genetic basis of Mendelian disorders. More than 180 novel rare-disease-causing genes with Mendelian inheritance patterns have been discovered through sequencing the exomes of just a few unrelated individuals or family members. As rare/novel genetic variants continue to be uncovered, there is a major challenge in distinguishing true pathogenic variants from rare benign mutations. METHODS: We used publicly available exome cohorts, together with the dbSNP database, to derive a list of genes (n = 100) that most frequently exhibit rare (<1%) non-synonymous/splice-site variants in general populations. We termed these genes FLAGS for FrequentLy mutAted GeneS and analyzed their properties. RESULTS: Analysis of FLAGS revealed that these genes have significantly longer protein coding sequences, a greater number of paralogs and display less evolutionarily selective pressure than expected. FLAGS are more frequently reported in PubMed clinical literature and more frequently associated with diseased phenotypes compared to the set of human protein-coding genes. We demonstrated an overlap between FLAGS and the rare-disease causing genes recently discovered through WES studies (n = 10) and the need for replication studies and rigorous statistical and biological analyses when associating FLAGS to rare disease. Finally, we showed how FLAGS are applied in disease-causing variant prioritization approach on exome data from a family affected by an unknown rare genetic disorder. CONCLUSIONS: We showed that some genes are frequently affected by rare, likely functional variants in general population, and are frequently observed in WES studies analyzing diverse rare phenotypes. We found that the rate at which genes accumulate rare mutations is beneficial information for prioritizing candidates. We provided a ranking system based on the mutation accumulation rates for prioritizing exome-captured human genes, and propose that clinical reports associating any disease/phenotype to FLAGS be evaluated with extra caution. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-014-0064-y) contains supplementary material, which is available to authorized users. BioMed Central 2014-12-03 /pmc/articles/PMC4267152/ /pubmed/25466818 http://dx.doi.org/10.1186/s12920-014-0064-y Text en © Shyr et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Shyr, Casper Tarailo-Graovac, Maja Gottlieb, Michael Lee, Jessica JY van Karnebeek, Clara Wasserman, Wyeth W FLAGS, frequently mutated genes in public exomes |
title | FLAGS, frequently mutated genes in public exomes |
title_full | FLAGS, frequently mutated genes in public exomes |
title_fullStr | FLAGS, frequently mutated genes in public exomes |
title_full_unstemmed | FLAGS, frequently mutated genes in public exomes |
title_short | FLAGS, frequently mutated genes in public exomes |
title_sort | flags, frequently mutated genes in public exomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4267152/ https://www.ncbi.nlm.nih.gov/pubmed/25466818 http://dx.doi.org/10.1186/s12920-014-0064-y |
work_keys_str_mv | AT shyrcasper flagsfrequentlymutatedgenesinpublicexomes AT tarailograovacmaja flagsfrequentlymutatedgenesinpublicexomes AT gottliebmichael flagsfrequentlymutatedgenesinpublicexomes AT leejessicajy flagsfrequentlymutatedgenesinpublicexomes AT vankarnebeekclara flagsfrequentlymutatedgenesinpublicexomes AT wassermanwyethw flagsfrequentlymutatedgenesinpublicexomes |