Cargando…
Boosting GWAS using biological networks: A study on susceptibility to familial breast cancer
Genome-wide association studies (GWAS) explore the genetic causes of complex diseases. However, classical approaches ignore the biological context of the genetic variants and genes under study. To address this shortcoming, one can use biological networks, which model functional relationships, to sea...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8009366/ https://www.ncbi.nlm.nih.gov/pubmed/33735170 http://dx.doi.org/10.1371/journal.pcbi.1008819 |
_version_ | 1783672859535605760 |
---|---|
author | Climente-González, Héctor Lonjou, Christine Lesueur, Fabienne Stoppa-Lyonnet, Dominique Andrieu, Nadine Azencott, Chloé-Agathe |
author_facet | Climente-González, Héctor Lonjou, Christine Lesueur, Fabienne Stoppa-Lyonnet, Dominique Andrieu, Nadine Azencott, Chloé-Agathe |
author_sort | Climente-González, Héctor |
collection | PubMed |
description | Genome-wide association studies (GWAS) explore the genetic causes of complex diseases. However, classical approaches ignore the biological context of the genetic variants and genes under study. To address this shortcoming, one can use biological networks, which model functional relationships, to search for functionally related susceptibility loci. Many such network methods exist, each arising from different mathematical frameworks, pre-processing steps, and assumptions about the network properties of the susceptibility mechanism. Unsurprisingly, this results in disparate solutions. To explore how to exploit these heterogeneous approaches, we selected six network methods and applied them to GENESIS, a nationwide French study on familial breast cancer. First, we verified that network methods recovered more interpretable results than a standard GWAS. We addressed the heterogeneity of their solutions by studying their overlap, computing what we called the consensus. The key gene in this consensus solution was COPS5, a gene related to multiple cancer hallmarks. Another issue we observed was that network methods were unstable, selecting very different genes on different subsamples of GENESIS. Therefore, we proposed a stable consensus solution formed by the 68 genes most consistently selected across multiple subsamples. This solution was also enriched in genes known to be associated with breast cancer susceptibility (BLM, CASP8, CASP10, DNAJC1, FGFR2, MRPS30, and SLC4A7, P-value = 3 × 10(−4)). The most connected gene was CUL3, a regulator of several genes linked to cancer progression. Lastly, we evaluated the biases of each method and the impact of their parameters on the outcome. In general, network methods preferred highly connected genes, even after random rewirings that stripped the connections of any biological meaning. In conclusion, we present the advantages of network-guided GWAS, characterize their shortcomings, and provide strategies to address them. To compute the consensus networks, implementations of all six methods are available at https://github.com/hclimente/gwas-tools. |
format | Online Article Text |
id | pubmed-8009366 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-80093662021-04-07 Boosting GWAS using biological networks: A study on susceptibility to familial breast cancer Climente-González, Héctor Lonjou, Christine Lesueur, Fabienne Stoppa-Lyonnet, Dominique Andrieu, Nadine Azencott, Chloé-Agathe PLoS Comput Biol Research Article Genome-wide association studies (GWAS) explore the genetic causes of complex diseases. However, classical approaches ignore the biological context of the genetic variants and genes under study. To address this shortcoming, one can use biological networks, which model functional relationships, to search for functionally related susceptibility loci. Many such network methods exist, each arising from different mathematical frameworks, pre-processing steps, and assumptions about the network properties of the susceptibility mechanism. Unsurprisingly, this results in disparate solutions. To explore how to exploit these heterogeneous approaches, we selected six network methods and applied them to GENESIS, a nationwide French study on familial breast cancer. First, we verified that network methods recovered more interpretable results than a standard GWAS. We addressed the heterogeneity of their solutions by studying their overlap, computing what we called the consensus. The key gene in this consensus solution was COPS5, a gene related to multiple cancer hallmarks. Another issue we observed was that network methods were unstable, selecting very different genes on different subsamples of GENESIS. Therefore, we proposed a stable consensus solution formed by the 68 genes most consistently selected across multiple subsamples. This solution was also enriched in genes known to be associated with breast cancer susceptibility (BLM, CASP8, CASP10, DNAJC1, FGFR2, MRPS30, and SLC4A7, P-value = 3 × 10(−4)). The most connected gene was CUL3, a regulator of several genes linked to cancer progression. Lastly, we evaluated the biases of each method and the impact of their parameters on the outcome. In general, network methods preferred highly connected genes, even after random rewirings that stripped the connections of any biological meaning. In conclusion, we present the advantages of network-guided GWAS, characterize their shortcomings, and provide strategies to address them. To compute the consensus networks, implementations of all six methods are available at https://github.com/hclimente/gwas-tools. Public Library of Science 2021-03-18 /pmc/articles/PMC8009366/ /pubmed/33735170 http://dx.doi.org/10.1371/journal.pcbi.1008819 Text en © 2021 Climente-González et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Climente-González, Héctor Lonjou, Christine Lesueur, Fabienne Stoppa-Lyonnet, Dominique Andrieu, Nadine Azencott, Chloé-Agathe Boosting GWAS using biological networks: A study on susceptibility to familial breast cancer |
title | Boosting GWAS using biological networks: A study on susceptibility to familial breast cancer |
title_full | Boosting GWAS using biological networks: A study on susceptibility to familial breast cancer |
title_fullStr | Boosting GWAS using biological networks: A study on susceptibility to familial breast cancer |
title_full_unstemmed | Boosting GWAS using biological networks: A study on susceptibility to familial breast cancer |
title_short | Boosting GWAS using biological networks: A study on susceptibility to familial breast cancer |
title_sort | boosting gwas using biological networks: a study on susceptibility to familial breast cancer |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8009366/ https://www.ncbi.nlm.nih.gov/pubmed/33735170 http://dx.doi.org/10.1371/journal.pcbi.1008819 |
work_keys_str_mv | AT climentegonzalezhector boostinggwasusingbiologicalnetworksastudyonsusceptibilitytofamilialbreastcancer AT lonjouchristine boostinggwasusingbiologicalnetworksastudyonsusceptibilitytofamilialbreastcancer AT lesueurfabienne boostinggwasusingbiologicalnetworksastudyonsusceptibilitytofamilialbreastcancer AT boostinggwasusingbiologicalnetworksastudyonsusceptibilitytofamilialbreastcancer AT stoppalyonnetdominique boostinggwasusingbiologicalnetworksastudyonsusceptibilitytofamilialbreastcancer AT andrieunadine boostinggwasusingbiologicalnetworksastudyonsusceptibilitytofamilialbreastcancer AT azencottchloeagathe boostinggwasusingbiologicalnetworksastudyonsusceptibilitytofamilialbreastcancer |