Cargando…

Network construction and structure detection with metagenomic count data

BACKGROUND: The human microbiome plays a critical role in human health. Massive amounts of metagenomic data have been generated with advances in next-generation sequencing technologies that characterize microbial communities via direct isolation and sequencing. How to extract, analyze, and transform...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Zhenqiu, Lin, Shili, Piantadosi, Steven
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676895/
https://www.ncbi.nlm.nih.gov/pubmed/26692900
http://dx.doi.org/10.1186/s13040-015-0072-2
_version_ 1782405252229627904
author Liu, Zhenqiu
Lin, Shili
Piantadosi, Steven
author_facet Liu, Zhenqiu
Lin, Shili
Piantadosi, Steven
author_sort Liu, Zhenqiu
collection PubMed
description BACKGROUND: The human microbiome plays a critical role in human health. Massive amounts of metagenomic data have been generated with advances in next-generation sequencing technologies that characterize microbial communities via direct isolation and sequencing. How to extract, analyze, and transform these vast amounts of data into useful knowledge is a great challenge to bioinformaticians. Microbial biodiversity research has focused primarily on taxa composition and abundance and less on the co-occurrences among different taxa. However, taxa co-occurrences and their relationships to environmental and clinical conditions are important because network structure may help to understand how microbial taxa function together. RESULTS: We propose a systematic robust approach for bacteria network construction and structure detection using metagenomic count data. Pairwise similarity/distance measures between taxa are proposed by adapting distance measures for samples in ecology. We also extend the sparse inverse covariance approach to a sparse inverse of a similarity matrix from count data for network construction. Our approach is efficient for large metagenomic count data with thousands of bacterial taxa. We evaluate our method with real and simulated data. Our method identifies true and biologically significant network structures efficiently. CONCLUSIONS: Network analysis is crucial for detecting subnetwork structures with metagenomic count data. We developed a software tool in MATLAB for network construction and biologically significant module detection. Software MetaNet can be downloaded from http://biostatistics.csmc.edu/MetaNet/.
format Online
Article
Text
id pubmed-4676895
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46768952015-12-13 Network construction and structure detection with metagenomic count data Liu, Zhenqiu Lin, Shili Piantadosi, Steven BioData Min Methodology BACKGROUND: The human microbiome plays a critical role in human health. Massive amounts of metagenomic data have been generated with advances in next-generation sequencing technologies that characterize microbial communities via direct isolation and sequencing. How to extract, analyze, and transform these vast amounts of data into useful knowledge is a great challenge to bioinformaticians. Microbial biodiversity research has focused primarily on taxa composition and abundance and less on the co-occurrences among different taxa. However, taxa co-occurrences and their relationships to environmental and clinical conditions are important because network structure may help to understand how microbial taxa function together. RESULTS: We propose a systematic robust approach for bacteria network construction and structure detection using metagenomic count data. Pairwise similarity/distance measures between taxa are proposed by adapting distance measures for samples in ecology. We also extend the sparse inverse covariance approach to a sparse inverse of a similarity matrix from count data for network construction. Our approach is efficient for large metagenomic count data with thousands of bacterial taxa. We evaluate our method with real and simulated data. Our method identifies true and biologically significant network structures efficiently. CONCLUSIONS: Network analysis is crucial for detecting subnetwork structures with metagenomic count data. We developed a software tool in MATLAB for network construction and biologically significant module detection. Software MetaNet can be downloaded from http://biostatistics.csmc.edu/MetaNet/. BioMed Central 2015-12-12 /pmc/articles/PMC4676895/ /pubmed/26692900 http://dx.doi.org/10.1186/s13040-015-0072-2 Text en © Liu et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Liu, Zhenqiu
Lin, Shili
Piantadosi, Steven
Network construction and structure detection with metagenomic count data
title Network construction and structure detection with metagenomic count data
title_full Network construction and structure detection with metagenomic count data
title_fullStr Network construction and structure detection with metagenomic count data
title_full_unstemmed Network construction and structure detection with metagenomic count data
title_short Network construction and structure detection with metagenomic count data
title_sort network construction and structure detection with metagenomic count data
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676895/
https://www.ncbi.nlm.nih.gov/pubmed/26692900
http://dx.doi.org/10.1186/s13040-015-0072-2
work_keys_str_mv AT liuzhenqiu networkconstructionandstructuredetectionwithmetagenomiccountdata
AT linshili networkconstructionandstructuredetectionwithmetagenomiccountdata
AT piantadosisteven networkconstructionandstructuredetectionwithmetagenomiccountdata