Cargando…

Methods for high-throughput MethylCap-Seq data analysis

BACKGROUND: Advances in whole genome profiling have revolutionized the cancer research field, but at the same time have raised new bioinformatics challenges. For next generation sequencing (NGS), these include data storage, computational costs, sequence processing and alignment, delineating appropri...

Descripción completa

Detalles Bibliográficos
Autores principales: Rodriguez, Benjamin AT, Frankhouser, David, Murphy, Mark, Trimarchi, Michael, Tam, Hok-Hei, Curfman, John, Huang, Rita, Chan, Michael WY, Lai, Hung-Cheng, Parikh, Deval, Ball, Bryan, Schwind, Sebastian, Blum, William, Marcucci, Guido, Yan, Pearlly, Bundschuh, Ralf
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3481483/
https://www.ncbi.nlm.nih.gov/pubmed/23134780
http://dx.doi.org/10.1186/1471-2164-13-S6-S14
_version_ 1782247749137203200
author Rodriguez, Benjamin AT
Frankhouser, David
Murphy, Mark
Trimarchi, Michael
Tam, Hok-Hei
Curfman, John
Huang, Rita
Chan, Michael WY
Lai, Hung-Cheng
Parikh, Deval
Ball, Bryan
Schwind, Sebastian
Blum, William
Marcucci, Guido
Yan, Pearlly
Bundschuh, Ralf
author_facet Rodriguez, Benjamin AT
Frankhouser, David
Murphy, Mark
Trimarchi, Michael
Tam, Hok-Hei
Curfman, John
Huang, Rita
Chan, Michael WY
Lai, Hung-Cheng
Parikh, Deval
Ball, Bryan
Schwind, Sebastian
Blum, William
Marcucci, Guido
Yan, Pearlly
Bundschuh, Ralf
author_sort Rodriguez, Benjamin AT
collection PubMed
description BACKGROUND: Advances in whole genome profiling have revolutionized the cancer research field, but at the same time have raised new bioinformatics challenges. For next generation sequencing (NGS), these include data storage, computational costs, sequence processing and alignment, delineating appropriate statistical measures, and data visualization. Currently there is a lack of workflows for efficient analysis of large, MethylCap-seq datasets containing multiple sample groups. METHODS: The NGS application MethylCap-seq involves the in vitro capture of methylated DNA and subsequent analysis of enriched fragments by massively parallel sequencing. The workflow we describe performs MethylCap-seq experimental Quality Control (QC), sequence file processing and alignment, differential methylation analysis of multiple biological groups, hierarchical clustering, assessment of genome-wide methylation patterns, and preparation of files for data visualization. RESULTS: Here, we present a scalable, flexible workflow for MethylCap-seq QC, secondary data analysis, tertiary analysis of multiple experimental groups, and data visualization. We demonstrate the experimental QC procedure with results from a large ovarian cancer study dataset and propose parameters which can identify problematic experiments. Promoter methylation profiling and hierarchical clustering analyses are demonstrated for four groups of acute myeloid leukemia (AML) patients. We propose a Global Methylation Indicator (GMI) function to assess genome-wide changes in methylation patterns between experimental groups. We also show how the workflow facilitates data visualization in a web browser with the application Anno-J. CONCLUSIONS: This workflow and its suite of features will assist biologists in conducting methylation profiling projects and facilitate meaningful biological interpretation.
format Online
Article
Text
id pubmed-3481483
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34814832012-11-02 Methods for high-throughput MethylCap-Seq data analysis Rodriguez, Benjamin AT Frankhouser, David Murphy, Mark Trimarchi, Michael Tam, Hok-Hei Curfman, John Huang, Rita Chan, Michael WY Lai, Hung-Cheng Parikh, Deval Ball, Bryan Schwind, Sebastian Blum, William Marcucci, Guido Yan, Pearlly Bundschuh, Ralf BMC Genomics Research BACKGROUND: Advances in whole genome profiling have revolutionized the cancer research field, but at the same time have raised new bioinformatics challenges. For next generation sequencing (NGS), these include data storage, computational costs, sequence processing and alignment, delineating appropriate statistical measures, and data visualization. Currently there is a lack of workflows for efficient analysis of large, MethylCap-seq datasets containing multiple sample groups. METHODS: The NGS application MethylCap-seq involves the in vitro capture of methylated DNA and subsequent analysis of enriched fragments by massively parallel sequencing. The workflow we describe performs MethylCap-seq experimental Quality Control (QC), sequence file processing and alignment, differential methylation analysis of multiple biological groups, hierarchical clustering, assessment of genome-wide methylation patterns, and preparation of files for data visualization. RESULTS: Here, we present a scalable, flexible workflow for MethylCap-seq QC, secondary data analysis, tertiary analysis of multiple experimental groups, and data visualization. We demonstrate the experimental QC procedure with results from a large ovarian cancer study dataset and propose parameters which can identify problematic experiments. Promoter methylation profiling and hierarchical clustering analyses are demonstrated for four groups of acute myeloid leukemia (AML) patients. We propose a Global Methylation Indicator (GMI) function to assess genome-wide changes in methylation patterns between experimental groups. We also show how the workflow facilitates data visualization in a web browser with the application Anno-J. CONCLUSIONS: This workflow and its suite of features will assist biologists in conducting methylation profiling projects and facilitate meaningful biological interpretation. BioMed Central 2012-10-26 /pmc/articles/PMC3481483/ /pubmed/23134780 http://dx.doi.org/10.1186/1471-2164-13-S6-S14 Text en Copyright ©2012 Rodriguez et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Rodriguez, Benjamin AT
Frankhouser, David
Murphy, Mark
Trimarchi, Michael
Tam, Hok-Hei
Curfman, John
Huang, Rita
Chan, Michael WY
Lai, Hung-Cheng
Parikh, Deval
Ball, Bryan
Schwind, Sebastian
Blum, William
Marcucci, Guido
Yan, Pearlly
Bundschuh, Ralf
Methods for high-throughput MethylCap-Seq data analysis
title Methods for high-throughput MethylCap-Seq data analysis
title_full Methods for high-throughput MethylCap-Seq data analysis
title_fullStr Methods for high-throughput MethylCap-Seq data analysis
title_full_unstemmed Methods for high-throughput MethylCap-Seq data analysis
title_short Methods for high-throughput MethylCap-Seq data analysis
title_sort methods for high-throughput methylcap-seq data analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3481483/
https://www.ncbi.nlm.nih.gov/pubmed/23134780
http://dx.doi.org/10.1186/1471-2164-13-S6-S14
work_keys_str_mv AT rodriguezbenjaminat methodsforhighthroughputmethylcapseqdataanalysis
AT frankhouserdavid methodsforhighthroughputmethylcapseqdataanalysis
AT murphymark methodsforhighthroughputmethylcapseqdataanalysis
AT trimarchimichael methodsforhighthroughputmethylcapseqdataanalysis
AT tamhokhei methodsforhighthroughputmethylcapseqdataanalysis
AT curfmanjohn methodsforhighthroughputmethylcapseqdataanalysis
AT huangrita methodsforhighthroughputmethylcapseqdataanalysis
AT chanmichaelwy methodsforhighthroughputmethylcapseqdataanalysis
AT laihungcheng methodsforhighthroughputmethylcapseqdataanalysis
AT parikhdeval methodsforhighthroughputmethylcapseqdataanalysis
AT ballbryan methodsforhighthroughputmethylcapseqdataanalysis
AT schwindsebastian methodsforhighthroughputmethylcapseqdataanalysis
AT blumwilliam methodsforhighthroughputmethylcapseqdataanalysis
AT marcucciguido methodsforhighthroughputmethylcapseqdataanalysis
AT yanpearlly methodsforhighthroughputmethylcapseqdataanalysis
AT bundschuhralf methodsforhighthroughputmethylcapseqdataanalysis