Cargando…
Improving high-resolution copy number variation analysis from next generation sequencing using unique molecular identifiers
BACKGROUND: Recently, copy number variations (CNV) impacting genes involved in oncogenic pathways have attracted an increasing attention to manage disease susceptibility. CNV is one of the most important somatic aberrations in the genome of tumor cells. Oncogene activation and tumor suppressor gene...
Autores principales: | , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7971104/ https://www.ncbi.nlm.nih.gov/pubmed/33711922 http://dx.doi.org/10.1186/s12859-021-04060-4 |
_version_ | 1783666553693143040 |
---|---|
author | Viailly, Pierre-Julien Sater, Vincent Viennot, Mathieu Bohers, Elodie Vergne, Nicolas Berard, Caroline Dauchel, Hélène Lecroq, Thierry Celebi, Alison Ruminy, Philippe Marchand, Vinciane Lanic, Marie-Delphine Dubois, Sydney Penther, Dominique Tilly, Hervé Mareschal, Sylvain Jardin, Fabrice |
author_facet | Viailly, Pierre-Julien Sater, Vincent Viennot, Mathieu Bohers, Elodie Vergne, Nicolas Berard, Caroline Dauchel, Hélène Lecroq, Thierry Celebi, Alison Ruminy, Philippe Marchand, Vinciane Lanic, Marie-Delphine Dubois, Sydney Penther, Dominique Tilly, Hervé Mareschal, Sylvain Jardin, Fabrice |
author_sort | Viailly, Pierre-Julien |
collection | PubMed |
description | BACKGROUND: Recently, copy number variations (CNV) impacting genes involved in oncogenic pathways have attracted an increasing attention to manage disease susceptibility. CNV is one of the most important somatic aberrations in the genome of tumor cells. Oncogene activation and tumor suppressor gene inactivation are often attributed to copy number gain/amplification or deletion, respectively, in many cancer types and stages. Recent advances in next generation sequencing protocols allow for the addition of unique molecular identifiers (UMI) to each read. Each targeted DNA fragment is labeled with a unique random nucleotide sequence added to sequencing primers. UMI are especially useful for CNV detection by making each DNA molecule in a population of reads distinct. RESULTS: Here, we present molecular Copy Number Alteration (mCNA), a new methodology allowing the detection of copy number changes using UMI. The algorithm is composed of four main steps: the construction of UMI count matrices, the use of control samples to construct a pseudo-reference, the computation of log-ratios, the segmentation and finally the statistical inference of abnormal segmented breaks. We demonstrate the success of mCNA on a dataset of patients suffering from Diffuse Large B-cell Lymphoma and we highlight that mCNA results have a strong correlation with comparative genomic hybridization. CONCLUSION: We provide mCNA, a new approach for CNV detection, freely available at https://gitlab.com/pierrejulien.viailly/mcna/ under MIT license. mCNA can significantly improve detection accuracy of CNV changes by using UMI. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04060-4. |
format | Online Article Text |
id | pubmed-7971104 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-79711042021-03-19 Improving high-resolution copy number variation analysis from next generation sequencing using unique molecular identifiers Viailly, Pierre-Julien Sater, Vincent Viennot, Mathieu Bohers, Elodie Vergne, Nicolas Berard, Caroline Dauchel, Hélène Lecroq, Thierry Celebi, Alison Ruminy, Philippe Marchand, Vinciane Lanic, Marie-Delphine Dubois, Sydney Penther, Dominique Tilly, Hervé Mareschal, Sylvain Jardin, Fabrice BMC Bioinformatics Methodology Article BACKGROUND: Recently, copy number variations (CNV) impacting genes involved in oncogenic pathways have attracted an increasing attention to manage disease susceptibility. CNV is one of the most important somatic aberrations in the genome of tumor cells. Oncogene activation and tumor suppressor gene inactivation are often attributed to copy number gain/amplification or deletion, respectively, in many cancer types and stages. Recent advances in next generation sequencing protocols allow for the addition of unique molecular identifiers (UMI) to each read. Each targeted DNA fragment is labeled with a unique random nucleotide sequence added to sequencing primers. UMI are especially useful for CNV detection by making each DNA molecule in a population of reads distinct. RESULTS: Here, we present molecular Copy Number Alteration (mCNA), a new methodology allowing the detection of copy number changes using UMI. The algorithm is composed of four main steps: the construction of UMI count matrices, the use of control samples to construct a pseudo-reference, the computation of log-ratios, the segmentation and finally the statistical inference of abnormal segmented breaks. We demonstrate the success of mCNA on a dataset of patients suffering from Diffuse Large B-cell Lymphoma and we highlight that mCNA results have a strong correlation with comparative genomic hybridization. CONCLUSION: We provide mCNA, a new approach for CNV detection, freely available at https://gitlab.com/pierrejulien.viailly/mcna/ under MIT license. mCNA can significantly improve detection accuracy of CNV changes by using UMI. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04060-4. BioMed Central 2021-03-12 /pmc/articles/PMC7971104/ /pubmed/33711922 http://dx.doi.org/10.1186/s12859-021-04060-4 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Viailly, Pierre-Julien Sater, Vincent Viennot, Mathieu Bohers, Elodie Vergne, Nicolas Berard, Caroline Dauchel, Hélène Lecroq, Thierry Celebi, Alison Ruminy, Philippe Marchand, Vinciane Lanic, Marie-Delphine Dubois, Sydney Penther, Dominique Tilly, Hervé Mareschal, Sylvain Jardin, Fabrice Improving high-resolution copy number variation analysis from next generation sequencing using unique molecular identifiers |
title | Improving high-resolution copy number variation analysis from next generation sequencing using unique molecular identifiers |
title_full | Improving high-resolution copy number variation analysis from next generation sequencing using unique molecular identifiers |
title_fullStr | Improving high-resolution copy number variation analysis from next generation sequencing using unique molecular identifiers |
title_full_unstemmed | Improving high-resolution copy number variation analysis from next generation sequencing using unique molecular identifiers |
title_short | Improving high-resolution copy number variation analysis from next generation sequencing using unique molecular identifiers |
title_sort | improving high-resolution copy number variation analysis from next generation sequencing using unique molecular identifiers |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7971104/ https://www.ncbi.nlm.nih.gov/pubmed/33711922 http://dx.doi.org/10.1186/s12859-021-04060-4 |
work_keys_str_mv | AT viaillypierrejulien improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT satervincent improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT viennotmathieu improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT boherselodie improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT vergnenicolas improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT berardcaroline improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT dauchelhelene improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT lecroqthierry improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT celebialison improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT ruminyphilippe improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT marchandvinciane improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT lanicmariedelphine improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT duboissydney improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT pentherdominique improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT tillyherve improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT mareschalsylvain improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers AT jardinfabrice improvinghighresolutioncopynumbervariationanalysisfromnextgenerationsequencingusinguniquemolecularidentifiers |