Cargando…
Detecting methylation signatures in neurodegenerative disease by density-based clustering of applications with reducing noise
There have been numerous genetic and epigenetic datasets generated for the study of complex disease including neurodegenerative disease. However, analysis of such data often suffers from detecting the outliers of the samples, which subsequently affects the extraction of the true biological signals i...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7747741/ https://www.ncbi.nlm.nih.gov/pubmed/33335112 http://dx.doi.org/10.1038/s41598-020-78463-3 |
_version_ | 1783624994871312384 |
---|---|
author | Mallik, Saurav Zhao, Zhongming |
author_facet | Mallik, Saurav Zhao, Zhongming |
author_sort | Mallik, Saurav |
collection | PubMed |
description | There have been numerous genetic and epigenetic datasets generated for the study of complex disease including neurodegenerative disease. However, analysis of such data often suffers from detecting the outliers of the samples, which subsequently affects the extraction of the true biological signals involved in the disease. To address this critical issue, we developed a novel framework for identifying methylation signatures using consecutive adaptation of a well-known outlier detection algorithm, density based clustering of applications with reducing noise (DBSCAN) followed by hierarchical clustering. We applied the framework to two representative neurodegenerative diseases, Alzheimer’s disease (AD) and Down syndrome (DS), using DNA methylation datasets from public sources (Gene Expression Omnibus, GEO accession ID: GSE74486). We first applied DBSCAN algorithm to eliminate outliers, and then used Limma statistical method to determine differentially methylated genes. Next, hierarchical clustering technique was applied to detect gene modules. Our analysis identified a methylation signature comprising 21 genes for AD and a methylation signature comprising 89 genes for DS, respectively. Our evaluation indicated that these two signatures could lead to high classification accuracy values (92% and 70%) for these two diseases. In summary, this framework will be useful to better detect outlier-free genetic and epigenetic signatures in various complex diseases and their developmental stages. |
format | Online Article Text |
id | pubmed-7747741 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-77477412020-12-22 Detecting methylation signatures in neurodegenerative disease by density-based clustering of applications with reducing noise Mallik, Saurav Zhao, Zhongming Sci Rep Article There have been numerous genetic and epigenetic datasets generated for the study of complex disease including neurodegenerative disease. However, analysis of such data often suffers from detecting the outliers of the samples, which subsequently affects the extraction of the true biological signals involved in the disease. To address this critical issue, we developed a novel framework for identifying methylation signatures using consecutive adaptation of a well-known outlier detection algorithm, density based clustering of applications with reducing noise (DBSCAN) followed by hierarchical clustering. We applied the framework to two representative neurodegenerative diseases, Alzheimer’s disease (AD) and Down syndrome (DS), using DNA methylation datasets from public sources (Gene Expression Omnibus, GEO accession ID: GSE74486). We first applied DBSCAN algorithm to eliminate outliers, and then used Limma statistical method to determine differentially methylated genes. Next, hierarchical clustering technique was applied to detect gene modules. Our analysis identified a methylation signature comprising 21 genes for AD and a methylation signature comprising 89 genes for DS, respectively. Our evaluation indicated that these two signatures could lead to high classification accuracy values (92% and 70%) for these two diseases. In summary, this framework will be useful to better detect outlier-free genetic and epigenetic signatures in various complex diseases and their developmental stages. Nature Publishing Group UK 2020-12-17 /pmc/articles/PMC7747741/ /pubmed/33335112 http://dx.doi.org/10.1038/s41598-020-78463-3 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Mallik, Saurav Zhao, Zhongming Detecting methylation signatures in neurodegenerative disease by density-based clustering of applications with reducing noise |
title | Detecting methylation signatures in neurodegenerative disease by density-based clustering of applications with reducing noise |
title_full | Detecting methylation signatures in neurodegenerative disease by density-based clustering of applications with reducing noise |
title_fullStr | Detecting methylation signatures in neurodegenerative disease by density-based clustering of applications with reducing noise |
title_full_unstemmed | Detecting methylation signatures in neurodegenerative disease by density-based clustering of applications with reducing noise |
title_short | Detecting methylation signatures in neurodegenerative disease by density-based clustering of applications with reducing noise |
title_sort | detecting methylation signatures in neurodegenerative disease by density-based clustering of applications with reducing noise |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7747741/ https://www.ncbi.nlm.nih.gov/pubmed/33335112 http://dx.doi.org/10.1038/s41598-020-78463-3 |
work_keys_str_mv | AT malliksaurav detectingmethylationsignaturesinneurodegenerativediseasebydensitybasedclusteringofapplicationswithreducingnoise AT zhaozhongming detectingmethylationsignaturesinneurodegenerativediseasebydensitybasedclusteringofapplicationswithreducingnoise |