Cargando…
A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis
Whole-genome sequencing is a promising approach for human autosomal dominant disease studies. However, the vast number of genetic variants observed by this method constitutes a challenge when trying to identify the causal variants. This is often handled by restricting disease studies to the most dam...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6436687/ https://www.ncbi.nlm.nih.gov/pubmed/30917156 http://dx.doi.org/10.1371/journal.pone.0213350 |
_version_ | 1783406842059161600 |
---|---|
author | Rentoft, Matilda Svensson, Daniel Sjödin, Andreas Olason, Pall I. Sjöström, Olle Nylander, Carin Osterman, Pia Sjögren, Rickard Netotea, Sergiu Wibom, Carl Cederquist, Kristina Chabes, Andrei Trygg, Johan Melin, Beatrice S. Johansson, Erik |
author_facet | Rentoft, Matilda Svensson, Daniel Sjödin, Andreas Olason, Pall I. Sjöström, Olle Nylander, Carin Osterman, Pia Sjögren, Rickard Netotea, Sergiu Wibom, Carl Cederquist, Kristina Chabes, Andrei Trygg, Johan Melin, Beatrice S. Johansson, Erik |
author_sort | Rentoft, Matilda |
collection | PubMed |
description | Whole-genome sequencing is a promising approach for human autosomal dominant disease studies. However, the vast number of genetic variants observed by this method constitutes a challenge when trying to identify the causal variants. This is often handled by restricting disease studies to the most damaging variants, e.g. those found in coding regions, and overlooking the remaining genetic variation. Such a biased approach explains in part why the genetic causes of many families with dominantly inherited diseases, in spite of being included in whole-genome sequencing studies, are left unsolved today. Here we explore the use of a geographically matched control population to minimize the number of candidate disease-causing variants without excluding variants based on assumptions on genomic position or functional predictions. To exemplify the benefit of the geographically matched control population we apply a typical disease variant filtering strategy in a family with an autosomal dominant form of colorectal cancer. With the use of the geographically matched control population we end up with 26 candidate variants genome wide. This is in contrast to the tens of thousands of candidates left when only making use of available public variant datasets. The effect of the local control population is dual, it (1) reduces the total number of candidate variants shared between affected individuals, and more importantly (2) increases the rate by which the number of candidate variants are reduced as additional affected family members are included in the filtering strategy. We demonstrate that the application of a geographically matched control population effectively limits the number of candidate disease-causing variants and may provide the means by which variants suitable for functional studies are identified genome wide. |
format | Online Article Text |
id | pubmed-6436687 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-64366872019-04-12 A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis Rentoft, Matilda Svensson, Daniel Sjödin, Andreas Olason, Pall I. Sjöström, Olle Nylander, Carin Osterman, Pia Sjögren, Rickard Netotea, Sergiu Wibom, Carl Cederquist, Kristina Chabes, Andrei Trygg, Johan Melin, Beatrice S. Johansson, Erik PLoS One Research Article Whole-genome sequencing is a promising approach for human autosomal dominant disease studies. However, the vast number of genetic variants observed by this method constitutes a challenge when trying to identify the causal variants. This is often handled by restricting disease studies to the most damaging variants, e.g. those found in coding regions, and overlooking the remaining genetic variation. Such a biased approach explains in part why the genetic causes of many families with dominantly inherited diseases, in spite of being included in whole-genome sequencing studies, are left unsolved today. Here we explore the use of a geographically matched control population to minimize the number of candidate disease-causing variants without excluding variants based on assumptions on genomic position or functional predictions. To exemplify the benefit of the geographically matched control population we apply a typical disease variant filtering strategy in a family with an autosomal dominant form of colorectal cancer. With the use of the geographically matched control population we end up with 26 candidate variants genome wide. This is in contrast to the tens of thousands of candidates left when only making use of available public variant datasets. The effect of the local control population is dual, it (1) reduces the total number of candidate variants shared between affected individuals, and more importantly (2) increases the rate by which the number of candidate variants are reduced as additional affected family members are included in the filtering strategy. We demonstrate that the application of a geographically matched control population effectively limits the number of candidate disease-causing variants and may provide the means by which variants suitable for functional studies are identified genome wide. Public Library of Science 2019-03-27 /pmc/articles/PMC6436687/ /pubmed/30917156 http://dx.doi.org/10.1371/journal.pone.0213350 Text en © 2019 Rentoft et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Rentoft, Matilda Svensson, Daniel Sjödin, Andreas Olason, Pall I. Sjöström, Olle Nylander, Carin Osterman, Pia Sjögren, Rickard Netotea, Sergiu Wibom, Carl Cederquist, Kristina Chabes, Andrei Trygg, Johan Melin, Beatrice S. Johansson, Erik A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis |
title | A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis |
title_full | A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis |
title_fullStr | A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis |
title_full_unstemmed | A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis |
title_short | A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis |
title_sort | geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6436687/ https://www.ncbi.nlm.nih.gov/pubmed/30917156 http://dx.doi.org/10.1371/journal.pone.0213350 |
work_keys_str_mv | AT rentoftmatilda ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT svenssondaniel ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT sjodinandreas ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT olasonpalli ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT sjostromolle ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT nylandercarin ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT ostermanpia ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT sjogrenrickard ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT netoteasergiu ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT wibomcarl ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT cederquistkristina ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT chabesandrei ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT tryggjohan ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT melinbeatrices ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT johanssonerik ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT rentoftmatilda geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT svenssondaniel geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT sjodinandreas geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT olasonpalli geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT sjostromolle geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT nylandercarin geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT ostermanpia geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT sjogrenrickard geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT netoteasergiu geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT wibomcarl geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT cederquistkristina geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT chabesandrei geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT tryggjohan geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT melinbeatrices geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis AT johanssonerik geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis |