Cargando…

SEG - A Software Program for Finding Somatic Copy Number Alterations in Whole Genome Sequencing Data of Cancer

As next-generation sequencing technology advances and the cost decreases, whole genome sequencing (WGS) has become the preferred platform for the identification of somatic copy number alteration (CNA) events in cancer genomes. To more effectively decipher these massive sequencing data, we developed...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Mucheng, Liu, Deli, Tang, Jie, Feng, Yuan, Wang, Tianfang, Dobbin, Kevin K., Schliekelman, Paul, Zhao, Shaying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6154469/
https://www.ncbi.nlm.nih.gov/pubmed/30258547
http://dx.doi.org/10.1016/j.csbj.2018.09.001
_version_ 1783357696354811904
author Zhang, Mucheng
Liu, Deli
Tang, Jie
Feng, Yuan
Wang, Tianfang
Dobbin, Kevin K.
Schliekelman, Paul
Zhao, Shaying
author_facet Zhang, Mucheng
Liu, Deli
Tang, Jie
Feng, Yuan
Wang, Tianfang
Dobbin, Kevin K.
Schliekelman, Paul
Zhao, Shaying
author_sort Zhang, Mucheng
collection PubMed
description As next-generation sequencing technology advances and the cost decreases, whole genome sequencing (WGS) has become the preferred platform for the identification of somatic copy number alteration (CNA) events in cancer genomes. To more effectively decipher these massive sequencing data, we developed a software program named SEG, shortened from the word “segment”. SEG utilizes mapped read or fragment density for CNA discovery. To reduce CNA artifacts arisen from sequencing and mapping biases, SEG first normalizes the data by taking the log(2)-ratio of each tumor density against its matching normal density. SEG then uses dynamic programming to find change-points among a contiguous log(2)-ratio data series along a chromosome, dividing the chromosome into different segments. SEG finally identifies those segments having CNA. Our analyses with both simulated and real sequencing data indicate that SEG finds more small CNAs than other published software tools.
format Online
Article
Text
id pubmed-6154469
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-61544692018-09-26 SEG - A Software Program for Finding Somatic Copy Number Alterations in Whole Genome Sequencing Data of Cancer Zhang, Mucheng Liu, Deli Tang, Jie Feng, Yuan Wang, Tianfang Dobbin, Kevin K. Schliekelman, Paul Zhao, Shaying Comput Struct Biotechnol J Research Article As next-generation sequencing technology advances and the cost decreases, whole genome sequencing (WGS) has become the preferred platform for the identification of somatic copy number alteration (CNA) events in cancer genomes. To more effectively decipher these massive sequencing data, we developed a software program named SEG, shortened from the word “segment”. SEG utilizes mapped read or fragment density for CNA discovery. To reduce CNA artifacts arisen from sequencing and mapping biases, SEG first normalizes the data by taking the log(2)-ratio of each tumor density against its matching normal density. SEG then uses dynamic programming to find change-points among a contiguous log(2)-ratio data series along a chromosome, dividing the chromosome into different segments. SEG finally identifies those segments having CNA. Our analyses with both simulated and real sequencing data indicate that SEG finds more small CNAs than other published software tools. Research Network of Computational and Structural Biotechnology 2018-09-07 /pmc/articles/PMC6154469/ /pubmed/30258547 http://dx.doi.org/10.1016/j.csbj.2018.09.001 Text en © 2018 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Zhang, Mucheng
Liu, Deli
Tang, Jie
Feng, Yuan
Wang, Tianfang
Dobbin, Kevin K.
Schliekelman, Paul
Zhao, Shaying
SEG - A Software Program for Finding Somatic Copy Number Alterations in Whole Genome Sequencing Data of Cancer
title SEG - A Software Program for Finding Somatic Copy Number Alterations in Whole Genome Sequencing Data of Cancer
title_full SEG - A Software Program for Finding Somatic Copy Number Alterations in Whole Genome Sequencing Data of Cancer
title_fullStr SEG - A Software Program for Finding Somatic Copy Number Alterations in Whole Genome Sequencing Data of Cancer
title_full_unstemmed SEG - A Software Program for Finding Somatic Copy Number Alterations in Whole Genome Sequencing Data of Cancer
title_short SEG - A Software Program for Finding Somatic Copy Number Alterations in Whole Genome Sequencing Data of Cancer
title_sort seg - a software program for finding somatic copy number alterations in whole genome sequencing data of cancer
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6154469/
https://www.ncbi.nlm.nih.gov/pubmed/30258547
http://dx.doi.org/10.1016/j.csbj.2018.09.001
work_keys_str_mv AT zhangmucheng segasoftwareprogramforfindingsomaticcopynumberalterationsinwholegenomesequencingdataofcancer
AT liudeli segasoftwareprogramforfindingsomaticcopynumberalterationsinwholegenomesequencingdataofcancer
AT tangjie segasoftwareprogramforfindingsomaticcopynumberalterationsinwholegenomesequencingdataofcancer
AT fengyuan segasoftwareprogramforfindingsomaticcopynumberalterationsinwholegenomesequencingdataofcancer
AT wangtianfang segasoftwareprogramforfindingsomaticcopynumberalterationsinwholegenomesequencingdataofcancer
AT dobbinkevink segasoftwareprogramforfindingsomaticcopynumberalterationsinwholegenomesequencingdataofcancer
AT schliekelmanpaul segasoftwareprogramforfindingsomaticcopynumberalterationsinwholegenomesequencingdataofcancer
AT zhaoshaying segasoftwareprogramforfindingsomaticcopynumberalterationsinwholegenomesequencingdataofcancer