Cargando…
Network construction and structure detection with metagenomic count data
BACKGROUND: The human microbiome plays a critical role in human health. Massive amounts of metagenomic data have been generated with advances in next-generation sequencing technologies that characterize microbial communities via direct isolation and sequencing. How to extract, analyze, and transform...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676895/ https://www.ncbi.nlm.nih.gov/pubmed/26692900 http://dx.doi.org/10.1186/s13040-015-0072-2 |
_version_ | 1782405252229627904 |
---|---|
author | Liu, Zhenqiu Lin, Shili Piantadosi, Steven |
author_facet | Liu, Zhenqiu Lin, Shili Piantadosi, Steven |
author_sort | Liu, Zhenqiu |
collection | PubMed |
description | BACKGROUND: The human microbiome plays a critical role in human health. Massive amounts of metagenomic data have been generated with advances in next-generation sequencing technologies that characterize microbial communities via direct isolation and sequencing. How to extract, analyze, and transform these vast amounts of data into useful knowledge is a great challenge to bioinformaticians. Microbial biodiversity research has focused primarily on taxa composition and abundance and less on the co-occurrences among different taxa. However, taxa co-occurrences and their relationships to environmental and clinical conditions are important because network structure may help to understand how microbial taxa function together. RESULTS: We propose a systematic robust approach for bacteria network construction and structure detection using metagenomic count data. Pairwise similarity/distance measures between taxa are proposed by adapting distance measures for samples in ecology. We also extend the sparse inverse covariance approach to a sparse inverse of a similarity matrix from count data for network construction. Our approach is efficient for large metagenomic count data with thousands of bacterial taxa. We evaluate our method with real and simulated data. Our method identifies true and biologically significant network structures efficiently. CONCLUSIONS: Network analysis is crucial for detecting subnetwork structures with metagenomic count data. We developed a software tool in MATLAB for network construction and biologically significant module detection. Software MetaNet can be downloaded from http://biostatistics.csmc.edu/MetaNet/. |
format | Online Article Text |
id | pubmed-4676895 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46768952015-12-13 Network construction and structure detection with metagenomic count data Liu, Zhenqiu Lin, Shili Piantadosi, Steven BioData Min Methodology BACKGROUND: The human microbiome plays a critical role in human health. Massive amounts of metagenomic data have been generated with advances in next-generation sequencing technologies that characterize microbial communities via direct isolation and sequencing. How to extract, analyze, and transform these vast amounts of data into useful knowledge is a great challenge to bioinformaticians. Microbial biodiversity research has focused primarily on taxa composition and abundance and less on the co-occurrences among different taxa. However, taxa co-occurrences and their relationships to environmental and clinical conditions are important because network structure may help to understand how microbial taxa function together. RESULTS: We propose a systematic robust approach for bacteria network construction and structure detection using metagenomic count data. Pairwise similarity/distance measures between taxa are proposed by adapting distance measures for samples in ecology. We also extend the sparse inverse covariance approach to a sparse inverse of a similarity matrix from count data for network construction. Our approach is efficient for large metagenomic count data with thousands of bacterial taxa. We evaluate our method with real and simulated data. Our method identifies true and biologically significant network structures efficiently. CONCLUSIONS: Network analysis is crucial for detecting subnetwork structures with metagenomic count data. We developed a software tool in MATLAB for network construction and biologically significant module detection. Software MetaNet can be downloaded from http://biostatistics.csmc.edu/MetaNet/. BioMed Central 2015-12-12 /pmc/articles/PMC4676895/ /pubmed/26692900 http://dx.doi.org/10.1186/s13040-015-0072-2 Text en © Liu et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Liu, Zhenqiu Lin, Shili Piantadosi, Steven Network construction and structure detection with metagenomic count data |
title | Network construction and structure detection with metagenomic count data |
title_full | Network construction and structure detection with metagenomic count data |
title_fullStr | Network construction and structure detection with metagenomic count data |
title_full_unstemmed | Network construction and structure detection with metagenomic count data |
title_short | Network construction and structure detection with metagenomic count data |
title_sort | network construction and structure detection with metagenomic count data |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676895/ https://www.ncbi.nlm.nih.gov/pubmed/26692900 http://dx.doi.org/10.1186/s13040-015-0072-2 |
work_keys_str_mv | AT liuzhenqiu networkconstructionandstructuredetectionwithmetagenomiccountdata AT linshili networkconstructionandstructuredetectionwithmetagenomiccountdata AT piantadosisteven networkconstructionandstructuredetectionwithmetagenomiccountdata |