Cargando…

A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis

Whole-genome sequencing is a promising approach for human autosomal dominant disease studies. However, the vast number of genetic variants observed by this method constitutes a challenge when trying to identify the causal variants. This is often handled by restricting disease studies to the most dam...

Descripción completa

Detalles Bibliográficos
Autores principales: Rentoft, Matilda, Svensson, Daniel, Sjödin, Andreas, Olason, Pall I., Sjöström, Olle, Nylander, Carin, Osterman, Pia, Sjögren, Rickard, Netotea, Sergiu, Wibom, Carl, Cederquist, Kristina, Chabes, Andrei, Trygg, Johan, Melin, Beatrice S., Johansson, Erik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6436687/
https://www.ncbi.nlm.nih.gov/pubmed/30917156
http://dx.doi.org/10.1371/journal.pone.0213350
_version_ 1783406842059161600
author Rentoft, Matilda
Svensson, Daniel
Sjödin, Andreas
Olason, Pall I.
Sjöström, Olle
Nylander, Carin
Osterman, Pia
Sjögren, Rickard
Netotea, Sergiu
Wibom, Carl
Cederquist, Kristina
Chabes, Andrei
Trygg, Johan
Melin, Beatrice S.
Johansson, Erik
author_facet Rentoft, Matilda
Svensson, Daniel
Sjödin, Andreas
Olason, Pall I.
Sjöström, Olle
Nylander, Carin
Osterman, Pia
Sjögren, Rickard
Netotea, Sergiu
Wibom, Carl
Cederquist, Kristina
Chabes, Andrei
Trygg, Johan
Melin, Beatrice S.
Johansson, Erik
author_sort Rentoft, Matilda
collection PubMed
description Whole-genome sequencing is a promising approach for human autosomal dominant disease studies. However, the vast number of genetic variants observed by this method constitutes a challenge when trying to identify the causal variants. This is often handled by restricting disease studies to the most damaging variants, e.g. those found in coding regions, and overlooking the remaining genetic variation. Such a biased approach explains in part why the genetic causes of many families with dominantly inherited diseases, in spite of being included in whole-genome sequencing studies, are left unsolved today. Here we explore the use of a geographically matched control population to minimize the number of candidate disease-causing variants without excluding variants based on assumptions on genomic position or functional predictions. To exemplify the benefit of the geographically matched control population we apply a typical disease variant filtering strategy in a family with an autosomal dominant form of colorectal cancer. With the use of the geographically matched control population we end up with 26 candidate variants genome wide. This is in contrast to the tens of thousands of candidates left when only making use of available public variant datasets. The effect of the local control population is dual, it (1) reduces the total number of candidate variants shared between affected individuals, and more importantly (2) increases the rate by which the number of candidate variants are reduced as additional affected family members are included in the filtering strategy. We demonstrate that the application of a geographically matched control population effectively limits the number of candidate disease-causing variants and may provide the means by which variants suitable for functional studies are identified genome wide.
format Online
Article
Text
id pubmed-6436687
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-64366872019-04-12 A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis Rentoft, Matilda Svensson, Daniel Sjödin, Andreas Olason, Pall I. Sjöström, Olle Nylander, Carin Osterman, Pia Sjögren, Rickard Netotea, Sergiu Wibom, Carl Cederquist, Kristina Chabes, Andrei Trygg, Johan Melin, Beatrice S. Johansson, Erik PLoS One Research Article Whole-genome sequencing is a promising approach for human autosomal dominant disease studies. However, the vast number of genetic variants observed by this method constitutes a challenge when trying to identify the causal variants. This is often handled by restricting disease studies to the most damaging variants, e.g. those found in coding regions, and overlooking the remaining genetic variation. Such a biased approach explains in part why the genetic causes of many families with dominantly inherited diseases, in spite of being included in whole-genome sequencing studies, are left unsolved today. Here we explore the use of a geographically matched control population to minimize the number of candidate disease-causing variants without excluding variants based on assumptions on genomic position or functional predictions. To exemplify the benefit of the geographically matched control population we apply a typical disease variant filtering strategy in a family with an autosomal dominant form of colorectal cancer. With the use of the geographically matched control population we end up with 26 candidate variants genome wide. This is in contrast to the tens of thousands of candidates left when only making use of available public variant datasets. The effect of the local control population is dual, it (1) reduces the total number of candidate variants shared between affected individuals, and more importantly (2) increases the rate by which the number of candidate variants are reduced as additional affected family members are included in the filtering strategy. We demonstrate that the application of a geographically matched control population effectively limits the number of candidate disease-causing variants and may provide the means by which variants suitable for functional studies are identified genome wide. Public Library of Science 2019-03-27 /pmc/articles/PMC6436687/ /pubmed/30917156 http://dx.doi.org/10.1371/journal.pone.0213350 Text en © 2019 Rentoft et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Rentoft, Matilda
Svensson, Daniel
Sjödin, Andreas
Olason, Pall I.
Sjöström, Olle
Nylander, Carin
Osterman, Pia
Sjögren, Rickard
Netotea, Sergiu
Wibom, Carl
Cederquist, Kristina
Chabes, Andrei
Trygg, Johan
Melin, Beatrice S.
Johansson, Erik
A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis
title A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis
title_full A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis
title_fullStr A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis
title_full_unstemmed A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis
title_short A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis
title_sort geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6436687/
https://www.ncbi.nlm.nih.gov/pubmed/30917156
http://dx.doi.org/10.1371/journal.pone.0213350
work_keys_str_mv AT rentoftmatilda ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT svenssondaniel ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT sjodinandreas ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT olasonpalli ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT sjostromolle ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT nylandercarin ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT ostermanpia ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT sjogrenrickard ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT netoteasergiu ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT wibomcarl ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT cederquistkristina ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT chabesandrei ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT tryggjohan ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT melinbeatrices ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT johanssonerik ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT rentoftmatilda geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT svenssondaniel geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT sjodinandreas geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT olasonpalli geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT sjostromolle geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT nylandercarin geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT ostermanpia geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT sjogrenrickard geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT netoteasergiu geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT wibomcarl geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT cederquistkristina geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT chabesandrei geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT tryggjohan geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT melinbeatrices geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT johanssonerik geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis