Cargando…

Interpreting genomic data via entropic dissection

Since the emergence of high-throughput genome sequencing platforms and more recently the next-generation platforms, the genome databases are growing at an astronomical rate. Tremendous efforts have been invested in recent years in understanding intriguing complexities beneath the vast ocean of genom...

Descripción completa

Detalles Bibliográficos
Autores principales: Azad, Rajeev K., Li, Jing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592408/
https://www.ncbi.nlm.nih.gov/pubmed/23036836
http://dx.doi.org/10.1093/nar/gks917
_version_ 1782262109043687424
author Azad, Rajeev K.
Li, Jing
author_facet Azad, Rajeev K.
Li, Jing
author_sort Azad, Rajeev K.
collection PubMed
description Since the emergence of high-throughput genome sequencing platforms and more recently the next-generation platforms, the genome databases are growing at an astronomical rate. Tremendous efforts have been invested in recent years in understanding intriguing complexities beneath the vast ocean of genomic data. This is apparent in the spurt of computational methods for interpreting these data in the past few years. Genomic data interpretation is notoriously difficult, partly owing to the inherent heterogeneities appearing at different scales. Methods developed to interpret these data often suffer from their inability to adequately measure the underlying heterogeneities and thus lead to confounding results. Here, we present an information entropy-based approach that unravels the distinctive patterns underlying genomic data efficiently and thus is applicable in addressing a variety of biological problems. We show the robustness and consistency of the proposed methodology in addressing three different biological problems of significance—identification of alien DNAs in bacterial genomes, detection of structural variants in cancer cell lines and alignment-free genome comparison.
format Online
Article
Text
id pubmed-3592408
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35924082013-03-08 Interpreting genomic data via entropic dissection Azad, Rajeev K. Li, Jing Nucleic Acids Res Methods Online Since the emergence of high-throughput genome sequencing platforms and more recently the next-generation platforms, the genome databases are growing at an astronomical rate. Tremendous efforts have been invested in recent years in understanding intriguing complexities beneath the vast ocean of genomic data. This is apparent in the spurt of computational methods for interpreting these data in the past few years. Genomic data interpretation is notoriously difficult, partly owing to the inherent heterogeneities appearing at different scales. Methods developed to interpret these data often suffer from their inability to adequately measure the underlying heterogeneities and thus lead to confounding results. Here, we present an information entropy-based approach that unravels the distinctive patterns underlying genomic data efficiently and thus is applicable in addressing a variety of biological problems. We show the robustness and consistency of the proposed methodology in addressing three different biological problems of significance—identification of alien DNAs in bacterial genomes, detection of structural variants in cancer cell lines and alignment-free genome comparison. Oxford University Press 2013-01 2012-10-03 /pmc/articles/PMC3592408/ /pubmed/23036836 http://dx.doi.org/10.1093/nar/gks917 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle Methods Online
Azad, Rajeev K.
Li, Jing
Interpreting genomic data via entropic dissection
title Interpreting genomic data via entropic dissection
title_full Interpreting genomic data via entropic dissection
title_fullStr Interpreting genomic data via entropic dissection
title_full_unstemmed Interpreting genomic data via entropic dissection
title_short Interpreting genomic data via entropic dissection
title_sort interpreting genomic data via entropic dissection
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592408/
https://www.ncbi.nlm.nih.gov/pubmed/23036836
http://dx.doi.org/10.1093/nar/gks917
work_keys_str_mv AT azadrajeevk interpretinggenomicdataviaentropicdissection
AT lijing interpretinggenomicdataviaentropicdissection