Cargando…

Identification of Copy Number Aberrations in Breast Cancer Subtypes Using Persistence Topology

DNA copy number aberrations (CNAs) are of biological and medical interest because they help identify regulatory mechanisms underlying tumor initiation and evolution. Identification of tumor-driving CNAs (driver CNAs) however remains a challenging task, because they are frequently hidden by CNAs that...

Descripción completa

Detalles Bibliográficos
Autores principales: Arsuaga, Javier, Borrman, Tyler, Cavalcante, Raymond, Gonzalez, Georgina, Park, Catherine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4996377/
https://www.ncbi.nlm.nih.gov/pubmed/27600228
http://dx.doi.org/10.3390/microarrays4030339
_version_ 1782449591207067648
author Arsuaga, Javier
Borrman, Tyler
Cavalcante, Raymond
Gonzalez, Georgina
Park, Catherine
author_facet Arsuaga, Javier
Borrman, Tyler
Cavalcante, Raymond
Gonzalez, Georgina
Park, Catherine
author_sort Arsuaga, Javier
collection PubMed
description DNA copy number aberrations (CNAs) are of biological and medical interest because they help identify regulatory mechanisms underlying tumor initiation and evolution. Identification of tumor-driving CNAs (driver CNAs) however remains a challenging task, because they are frequently hidden by CNAs that are the product of random events that take place during tumor evolution. Experimental detection of CNAs is commonly accomplished through array comparative genomic hybridization (aCGH) assays followed by supervised and/or unsupervised statistical methods that combine the segmented profiles of all patients to identify driver CNAs. Here, we extend a previously-presented supervised algorithm for the identification of CNAs that is based on a topological representation of the data. Our method associates a two-dimensional (2D) point cloud with each aCGH profile and generates a sequence of simplicial complexes, mathematical objects that generalize the concept of a graph. This representation of the data permits segmenting the data at different resolutions and identifying CNAs by interrogating the topological properties of these simplicial complexes. We tested our approach on a published dataset with the goal of identifying specific breast cancer CNAs associated with specific molecular subtypes. Identification of CNAs associated with each subtype was performed by analyzing each subtype separately from the others and by taking the rest of the subtypes as the control. Our results found a new amplification in 11q at the location of the progesterone receptor in the Luminal A subtype. Aberrations in the Luminal B subtype were found only upon removal of the basal-like subtype from the control set. Under those conditions, all regions found in the original publication, except for 17q, were confirmed; all aberrations, except those in chromosome arms 8q and 12q were confirmed in the basal-like subtype. These two chromosome arms, however, were detected only upon removal of three patients with exceedingly large copy number values. More importantly, we detected 10 and 21 additional regions in the Luminal B and basal-like subtypes, respectively. Most of the additional regions were either validated on an independent dataset and/or using GISTIC. Furthermore, we found three new CNAs in the basal-like subtype: a combination of gains and losses in 1p, a gain in 2p and a loss in 14q. Based on these results, we suggest that topological approaches that incorporate multiresolution analyses and that interrogate topological properties of the data can help in the identification of copy number changes in cancer.
format Online
Article
Text
id pubmed-4996377
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-49963772016-09-06 Identification of Copy Number Aberrations in Breast Cancer Subtypes Using Persistence Topology Arsuaga, Javier Borrman, Tyler Cavalcante, Raymond Gonzalez, Georgina Park, Catherine Microarrays (Basel) Article DNA copy number aberrations (CNAs) are of biological and medical interest because they help identify regulatory mechanisms underlying tumor initiation and evolution. Identification of tumor-driving CNAs (driver CNAs) however remains a challenging task, because they are frequently hidden by CNAs that are the product of random events that take place during tumor evolution. Experimental detection of CNAs is commonly accomplished through array comparative genomic hybridization (aCGH) assays followed by supervised and/or unsupervised statistical methods that combine the segmented profiles of all patients to identify driver CNAs. Here, we extend a previously-presented supervised algorithm for the identification of CNAs that is based on a topological representation of the data. Our method associates a two-dimensional (2D) point cloud with each aCGH profile and generates a sequence of simplicial complexes, mathematical objects that generalize the concept of a graph. This representation of the data permits segmenting the data at different resolutions and identifying CNAs by interrogating the topological properties of these simplicial complexes. We tested our approach on a published dataset with the goal of identifying specific breast cancer CNAs associated with specific molecular subtypes. Identification of CNAs associated with each subtype was performed by analyzing each subtype separately from the others and by taking the rest of the subtypes as the control. Our results found a new amplification in 11q at the location of the progesterone receptor in the Luminal A subtype. Aberrations in the Luminal B subtype were found only upon removal of the basal-like subtype from the control set. Under those conditions, all regions found in the original publication, except for 17q, were confirmed; all aberrations, except those in chromosome arms 8q and 12q were confirmed in the basal-like subtype. These two chromosome arms, however, were detected only upon removal of three patients with exceedingly large copy number values. More importantly, we detected 10 and 21 additional regions in the Luminal B and basal-like subtypes, respectively. Most of the additional regions were either validated on an independent dataset and/or using GISTIC. Furthermore, we found three new CNAs in the basal-like subtype: a combination of gains and losses in 1p, a gain in 2p and a loss in 14q. Based on these results, we suggest that topological approaches that incorporate multiresolution analyses and that interrogate topological properties of the data can help in the identification of copy number changes in cancer. MDPI 2015-08-12 /pmc/articles/PMC4996377/ /pubmed/27600228 http://dx.doi.org/10.3390/microarrays4030339 Text en © 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Arsuaga, Javier
Borrman, Tyler
Cavalcante, Raymond
Gonzalez, Georgina
Park, Catherine
Identification of Copy Number Aberrations in Breast Cancer Subtypes Using Persistence Topology
title Identification of Copy Number Aberrations in Breast Cancer Subtypes Using Persistence Topology
title_full Identification of Copy Number Aberrations in Breast Cancer Subtypes Using Persistence Topology
title_fullStr Identification of Copy Number Aberrations in Breast Cancer Subtypes Using Persistence Topology
title_full_unstemmed Identification of Copy Number Aberrations in Breast Cancer Subtypes Using Persistence Topology
title_short Identification of Copy Number Aberrations in Breast Cancer Subtypes Using Persistence Topology
title_sort identification of copy number aberrations in breast cancer subtypes using persistence topology
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4996377/
https://www.ncbi.nlm.nih.gov/pubmed/27600228
http://dx.doi.org/10.3390/microarrays4030339
work_keys_str_mv AT arsuagajavier identificationofcopynumberaberrationsinbreastcancersubtypesusingpersistencetopology
AT borrmantyler identificationofcopynumberaberrationsinbreastcancersubtypesusingpersistencetopology
AT cavalcanteraymond identificationofcopynumberaberrationsinbreastcancersubtypesusingpersistencetopology
AT gonzalezgeorgina identificationofcopynumberaberrationsinbreastcancersubtypesusingpersistencetopology
AT parkcatherine identificationofcopynumberaberrationsinbreastcancersubtypesusingpersistencetopology