Cargando…

Clustering and graph mining techniques for classification of complex structural variations in cancer genomes

For many years, a major question in cancer genomics has been the identification of those variations that can have a functional role in cancer, and distinguish from the majority of genomic changes that have no functional consequences. This is particularly challenging when considering complex chromoso...

Descripción completa

Detalles Bibliográficos
Autores principales: Gomez-Sanchez, Gonzalo, Delgado-Serrano, Luisa, Carrera, David, Torrents, David, Berral, Josep Ll.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8885672/
https://www.ncbi.nlm.nih.gov/pubmed/35228601
http://dx.doi.org/10.1038/s41598-022-07211-6
_version_ 1784660488648392704
author Gomez-Sanchez, Gonzalo
Delgado-Serrano, Luisa
Carrera, David
Torrents, David
Berral, Josep Ll.
author_facet Gomez-Sanchez, Gonzalo
Delgado-Serrano, Luisa
Carrera, David
Torrents, David
Berral, Josep Ll.
author_sort Gomez-Sanchez, Gonzalo
collection PubMed
description For many years, a major question in cancer genomics has been the identification of those variations that can have a functional role in cancer, and distinguish from the majority of genomic changes that have no functional consequences. This is particularly challenging when considering complex chromosomal rearrangements, often composed of multiple DNA breaks, resulting in difficulties in classifying and interpreting them functionally. Despite recent efforts towards classifying structural variants (SVs), more robust statistical frames are needed to better classify these variants and isolate those that derive from specific molecular mechanisms. We present a new statistical approach to analyze SVs patterns from 2392 tumor samples from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium and identify significant recurrence, which can inform relevant mechanisms involved in the biology of tumors. The method is based on recursive KDE clustering of 152,926 SVs, randomization methods, graph mining techniques and statistical measures. The proposed methodology was able not only to identify complex patterns across different cancer types but also to prove them as not random occurrences. Furthermore, a new class of pattern that was not previously described has been identified.
format Online
Article
Text
id pubmed-8885672
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-88856722022-03-01 Clustering and graph mining techniques for classification of complex structural variations in cancer genomes Gomez-Sanchez, Gonzalo Delgado-Serrano, Luisa Carrera, David Torrents, David Berral, Josep Ll. Sci Rep Article For many years, a major question in cancer genomics has been the identification of those variations that can have a functional role in cancer, and distinguish from the majority of genomic changes that have no functional consequences. This is particularly challenging when considering complex chromosomal rearrangements, often composed of multiple DNA breaks, resulting in difficulties in classifying and interpreting them functionally. Despite recent efforts towards classifying structural variants (SVs), more robust statistical frames are needed to better classify these variants and isolate those that derive from specific molecular mechanisms. We present a new statistical approach to analyze SVs patterns from 2392 tumor samples from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium and identify significant recurrence, which can inform relevant mechanisms involved in the biology of tumors. The method is based on recursive KDE clustering of 152,926 SVs, randomization methods, graph mining techniques and statistical measures. The proposed methodology was able not only to identify complex patterns across different cancer types but also to prove them as not random occurrences. Furthermore, a new class of pattern that was not previously described has been identified. Nature Publishing Group UK 2022-02-28 /pmc/articles/PMC8885672/ /pubmed/35228601 http://dx.doi.org/10.1038/s41598-022-07211-6 Text en © The Author(s) 2022, corrected publication 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Gomez-Sanchez, Gonzalo
Delgado-Serrano, Luisa
Carrera, David
Torrents, David
Berral, Josep Ll.
Clustering and graph mining techniques for classification of complex structural variations in cancer genomes
title Clustering and graph mining techniques for classification of complex structural variations in cancer genomes
title_full Clustering and graph mining techniques for classification of complex structural variations in cancer genomes
title_fullStr Clustering and graph mining techniques for classification of complex structural variations in cancer genomes
title_full_unstemmed Clustering and graph mining techniques for classification of complex structural variations in cancer genomes
title_short Clustering and graph mining techniques for classification of complex structural variations in cancer genomes
title_sort clustering and graph mining techniques for classification of complex structural variations in cancer genomes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8885672/
https://www.ncbi.nlm.nih.gov/pubmed/35228601
http://dx.doi.org/10.1038/s41598-022-07211-6
work_keys_str_mv AT gomezsanchezgonzalo clusteringandgraphminingtechniquesforclassificationofcomplexstructuralvariationsincancergenomes
AT delgadoserranoluisa clusteringandgraphminingtechniquesforclassificationofcomplexstructuralvariationsincancergenomes
AT carreradavid clusteringandgraphminingtechniquesforclassificationofcomplexstructuralvariationsincancergenomes
AT torrentsdavid clusteringandgraphminingtechniquesforclassificationofcomplexstructuralvariationsincancergenomes
AT berraljosepll clusteringandgraphminingtechniquesforclassificationofcomplexstructuralvariationsincancergenomes