Cargando…

ADaCGH: A Parallelized Web-Based Application and R Package for the Analysis of aCGH Data

BACKGROUND: Copy number alterations (CNAs) in genomic DNA have been associated with complex human diseases, including cancer. One of the most common techniques to detect CNAs is array-based comparative genomic hybridization (aCGH). The availability of aCGH platforms and the need for identification o...

Descripción completa

Detalles Bibliográficos
Autores principales: Díaz-Uriarte, Ramón, Rueda, Oscar M.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1940324/
https://www.ncbi.nlm.nih.gov/pubmed/17710137
http://dx.doi.org/10.1371/journal.pone.0000737
_version_ 1782134461963436032
author Díaz-Uriarte, Ramón
Rueda, Oscar M.
author_facet Díaz-Uriarte, Ramón
Rueda, Oscar M.
author_sort Díaz-Uriarte, Ramón
collection PubMed
description BACKGROUND: Copy number alterations (CNAs) in genomic DNA have been associated with complex human diseases, including cancer. One of the most common techniques to detect CNAs is array-based comparative genomic hybridization (aCGH). The availability of aCGH platforms and the need for identification of CNAs has resulted in a wealth of methodological studies. METHODOLOGY/PRINCIPAL FINDINGS: ADaCGH is an R package and a web-based application for the analysis of aCGH data. It implements eight methods for detection of CNAs, gains and losses of genomic DNA, including all of the best performing ones from two recent reviews (CBS, GLAD, CGHseg, HMM). For improved speed, we use parallel computing (via MPI). Additional information (GO terms, PubMed citations, KEGG and Reactome pathways) is available for individual genes, and for sets of genes with altered copy numbers. CONCLUSIONS/SIGNIFICANCE: ADaCGH represents a qualitative increase in the standards of these types of applications: a) all of the best performing algorithms are included, not just one or two; b) we do not limit ourselves to providing a thin layer of CGI on top of existing BioConductor packages, but instead carefully use parallelization, examining different schemes, and are able to achieve significant decreases in user waiting time (factors up to 45×); c) we have added functionality not currently available in some methods, to adapt to recent recommendations (e.g., merging of segmentation results in wavelet-based and CGHseg algorithms); d) we incorporate redundancy, fault-tolerance and checkpointing, which are unique among web-based, parallelized applications; e) all of the code is available under open source licenses, allowing to build upon, copy, and adapt our code for other software projects.
format Text
id pubmed-1940324
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-19403242007-08-16 ADaCGH: A Parallelized Web-Based Application and R Package for the Analysis of aCGH Data Díaz-Uriarte, Ramón Rueda, Oscar M. PLoS One Research Article BACKGROUND: Copy number alterations (CNAs) in genomic DNA have been associated with complex human diseases, including cancer. One of the most common techniques to detect CNAs is array-based comparative genomic hybridization (aCGH). The availability of aCGH platforms and the need for identification of CNAs has resulted in a wealth of methodological studies. METHODOLOGY/PRINCIPAL FINDINGS: ADaCGH is an R package and a web-based application for the analysis of aCGH data. It implements eight methods for detection of CNAs, gains and losses of genomic DNA, including all of the best performing ones from two recent reviews (CBS, GLAD, CGHseg, HMM). For improved speed, we use parallel computing (via MPI). Additional information (GO terms, PubMed citations, KEGG and Reactome pathways) is available for individual genes, and for sets of genes with altered copy numbers. CONCLUSIONS/SIGNIFICANCE: ADaCGH represents a qualitative increase in the standards of these types of applications: a) all of the best performing algorithms are included, not just one or two; b) we do not limit ourselves to providing a thin layer of CGI on top of existing BioConductor packages, but instead carefully use parallelization, examining different schemes, and are able to achieve significant decreases in user waiting time (factors up to 45×); c) we have added functionality not currently available in some methods, to adapt to recent recommendations (e.g., merging of segmentation results in wavelet-based and CGHseg algorithms); d) we incorporate redundancy, fault-tolerance and checkpointing, which are unique among web-based, parallelized applications; e) all of the code is available under open source licenses, allowing to build upon, copy, and adapt our code for other software projects. Public Library of Science 2007-08-15 /pmc/articles/PMC1940324/ /pubmed/17710137 http://dx.doi.org/10.1371/journal.pone.0000737 Text en Diaz-Uriarte, Rueda. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Díaz-Uriarte, Ramón
Rueda, Oscar M.
ADaCGH: A Parallelized Web-Based Application and R Package for the Analysis of aCGH Data
title ADaCGH: A Parallelized Web-Based Application and R Package for the Analysis of aCGH Data
title_full ADaCGH: A Parallelized Web-Based Application and R Package for the Analysis of aCGH Data
title_fullStr ADaCGH: A Parallelized Web-Based Application and R Package for the Analysis of aCGH Data
title_full_unstemmed ADaCGH: A Parallelized Web-Based Application and R Package for the Analysis of aCGH Data
title_short ADaCGH: A Parallelized Web-Based Application and R Package for the Analysis of aCGH Data
title_sort adacgh: a parallelized web-based application and r package for the analysis of acgh data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1940324/
https://www.ncbi.nlm.nih.gov/pubmed/17710137
http://dx.doi.org/10.1371/journal.pone.0000737
work_keys_str_mv AT diazuriarteramon adacghaparallelizedwebbasedapplicationandrpackagefortheanalysisofacghdata
AT ruedaoscarm adacghaparallelizedwebbasedapplicationandrpackagefortheanalysisofacghdata