Cargando…

Comparing copy-number profiles under multi-copy amplifications and deletions

BACKGROUND: During cancer progression, malignant cells accumulate somatic mutations that can lead to genetic aberrations. In particular, evolutionary events akin to segmental duplications or deletions can alter the copy-number profile (CNP) of a set of genes in a genome. Our aim is to compute the ev...

Descripción completa

Detalles Bibliográficos
Autores principales: Cordonnier, Garance, Lafond, Manuel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7160889/
https://www.ncbi.nlm.nih.gov/pubmed/32299350
http://dx.doi.org/10.1186/s12864-020-6611-3
_version_ 1783522841899040768
author Cordonnier, Garance
Lafond, Manuel
author_facet Cordonnier, Garance
Lafond, Manuel
author_sort Cordonnier, Garance
collection PubMed
description BACKGROUND: During cancer progression, malignant cells accumulate somatic mutations that can lead to genetic aberrations. In particular, evolutionary events akin to segmental duplications or deletions can alter the copy-number profile (CNP) of a set of genes in a genome. Our aim is to compute the evolutionary distance between two cells for which only CNPs are known. This asks for the minimum number of segmental amplifications and deletions to turn one CNP into another. This was recently formalized into a model where each event is assumed to alter a copy-number by 1 or −1, even though these events can affect large portions of a chromosome. RESULTS: We propose a general cost framework where an event can modify the copy-number of a gene by larger amounts. We show that any cost scheme that allows segmental deletions of arbitrary length makes computing the distance strongly NP-hard. We then devise a factor 2 approximation algorithm for the problem when copy-numbers are non-zero and provide an implementation called cnp2cnp. We evaluate our approach experimentally by reconstructing simulated cancer phylogenies from the pairwise distances inferred by cnp2cnp and compare it against two other alternatives, namely the MEDICC distance and the Euclidean distance. CONCLUSIONS: The experimental results show that our distance yields more accurate phylogenies on average than these alternatives if the given CNPs are error-free, but that the MEDICC distance is slightly more robust against error in the data. In all cases, our experiments show that either our approach or the MEDICC approach should preferred over the Euclidean distance.
format Online
Article
Text
id pubmed-7160889
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-71608892020-04-21 Comparing copy-number profiles under multi-copy amplifications and deletions Cordonnier, Garance Lafond, Manuel BMC Genomics Research BACKGROUND: During cancer progression, malignant cells accumulate somatic mutations that can lead to genetic aberrations. In particular, evolutionary events akin to segmental duplications or deletions can alter the copy-number profile (CNP) of a set of genes in a genome. Our aim is to compute the evolutionary distance between two cells for which only CNPs are known. This asks for the minimum number of segmental amplifications and deletions to turn one CNP into another. This was recently formalized into a model where each event is assumed to alter a copy-number by 1 or −1, even though these events can affect large portions of a chromosome. RESULTS: We propose a general cost framework where an event can modify the copy-number of a gene by larger amounts. We show that any cost scheme that allows segmental deletions of arbitrary length makes computing the distance strongly NP-hard. We then devise a factor 2 approximation algorithm for the problem when copy-numbers are non-zero and provide an implementation called cnp2cnp. We evaluate our approach experimentally by reconstructing simulated cancer phylogenies from the pairwise distances inferred by cnp2cnp and compare it against two other alternatives, namely the MEDICC distance and the Euclidean distance. CONCLUSIONS: The experimental results show that our distance yields more accurate phylogenies on average than these alternatives if the given CNPs are error-free, but that the MEDICC distance is slightly more robust against error in the data. In all cases, our experiments show that either our approach or the MEDICC approach should preferred over the Euclidean distance. BioMed Central 2020-04-16 /pmc/articles/PMC7160889/ /pubmed/32299350 http://dx.doi.org/10.1186/s12864-020-6611-3 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Cordonnier, Garance
Lafond, Manuel
Comparing copy-number profiles under multi-copy amplifications and deletions
title Comparing copy-number profiles under multi-copy amplifications and deletions
title_full Comparing copy-number profiles under multi-copy amplifications and deletions
title_fullStr Comparing copy-number profiles under multi-copy amplifications and deletions
title_full_unstemmed Comparing copy-number profiles under multi-copy amplifications and deletions
title_short Comparing copy-number profiles under multi-copy amplifications and deletions
title_sort comparing copy-number profiles under multi-copy amplifications and deletions
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7160889/
https://www.ncbi.nlm.nih.gov/pubmed/32299350
http://dx.doi.org/10.1186/s12864-020-6611-3
work_keys_str_mv AT cordonniergarance comparingcopynumberprofilesundermulticopyamplificationsanddeletions
AT lafondmanuel comparingcopynumberprofilesundermulticopyamplificationsanddeletions