Cargando…

Identification of a small optimal subset of CpG sites as bio-markers from high-throughput DNA methylation profiles

BACKGROUND: DNA methylation patterns have been shown to significantly correlate with different tissue types and disease states. High-throughput methylation arrays enable large-scale DNA methylation analysis to identify informative DNA methylation biomarkers. The identification of disease-specific me...

Descripción completa

Detalles Bibliográficos
Autores principales: Meng, Hailong, Murrelle, Edward L, Li, Guoya
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2613930/
https://www.ncbi.nlm.nih.gov/pubmed/18954440
http://dx.doi.org/10.1186/1471-2105-9-457
_version_ 1782163214494072832
author Meng, Hailong
Murrelle, Edward L
Li, Guoya
author_facet Meng, Hailong
Murrelle, Edward L
Li, Guoya
author_sort Meng, Hailong
collection PubMed
description BACKGROUND: DNA methylation patterns have been shown to significantly correlate with different tissue types and disease states. High-throughput methylation arrays enable large-scale DNA methylation analysis to identify informative DNA methylation biomarkers. The identification of disease-specific methylation signatures is of fundamental and practical interest for risk assessment, diagnosis, and prognosis of diseases. RESULTS: Using published high-throughput DNA methylation data, a two-stage feature selection method was developed to select a small optimal subset of DNA methylation features to precisely classify two sample groups. With this approach, a small number of CpG sites were highly sensitive and specific in distinguishing lung cancer tissue samples from normal lung tissue samples. CONCLUSION: This study shows that it is feasible to identify DNA methylation biomarkers from high-throughput DNA methylation profiles and that a small number of signature CpG sites can suffice to classify two groups of samples. The computational method we developed in the study is efficient to identify signature CpG sites from disease samples with complex methylation patterns.
format Text
id pubmed-2613930
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26139302009-01-12 Identification of a small optimal subset of CpG sites as bio-markers from high-throughput DNA methylation profiles Meng, Hailong Murrelle, Edward L Li, Guoya BMC Bioinformatics Research Article BACKGROUND: DNA methylation patterns have been shown to significantly correlate with different tissue types and disease states. High-throughput methylation arrays enable large-scale DNA methylation analysis to identify informative DNA methylation biomarkers. The identification of disease-specific methylation signatures is of fundamental and practical interest for risk assessment, diagnosis, and prognosis of diseases. RESULTS: Using published high-throughput DNA methylation data, a two-stage feature selection method was developed to select a small optimal subset of DNA methylation features to precisely classify two sample groups. With this approach, a small number of CpG sites were highly sensitive and specific in distinguishing lung cancer tissue samples from normal lung tissue samples. CONCLUSION: This study shows that it is feasible to identify DNA methylation biomarkers from high-throughput DNA methylation profiles and that a small number of signature CpG sites can suffice to classify two groups of samples. The computational method we developed in the study is efficient to identify signature CpG sites from disease samples with complex methylation patterns. BioMed Central 2008-10-27 /pmc/articles/PMC2613930/ /pubmed/18954440 http://dx.doi.org/10.1186/1471-2105-9-457 Text en Copyright © 2008 Meng et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Meng, Hailong
Murrelle, Edward L
Li, Guoya
Identification of a small optimal subset of CpG sites as bio-markers from high-throughput DNA methylation profiles
title Identification of a small optimal subset of CpG sites as bio-markers from high-throughput DNA methylation profiles
title_full Identification of a small optimal subset of CpG sites as bio-markers from high-throughput DNA methylation profiles
title_fullStr Identification of a small optimal subset of CpG sites as bio-markers from high-throughput DNA methylation profiles
title_full_unstemmed Identification of a small optimal subset of CpG sites as bio-markers from high-throughput DNA methylation profiles
title_short Identification of a small optimal subset of CpG sites as bio-markers from high-throughput DNA methylation profiles
title_sort identification of a small optimal subset of cpg sites as bio-markers from high-throughput dna methylation profiles
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2613930/
https://www.ncbi.nlm.nih.gov/pubmed/18954440
http://dx.doi.org/10.1186/1471-2105-9-457
work_keys_str_mv AT menghailong identificationofasmalloptimalsubsetofcpgsitesasbiomarkersfromhighthroughputdnamethylationprofiles
AT murrelleedwardl identificationofasmalloptimalsubsetofcpgsitesasbiomarkersfromhighthroughputdnamethylationprofiles
AT liguoya identificationofasmalloptimalsubsetofcpgsitesasbiomarkersfromhighthroughputdnamethylationprofiles