Cargando…

A Statistical Framework to Predict Functional Non-Coding Regions in the Human Genome Through Integrated Analysis of Annotation Data

Identifying functional regions in the human genome is a major goal in human genetics. Great efforts have been made to functionally annotate the human genome either through computational predictions, such as genomic conservation, or high-throughput experiments, such as the ENCODE project. These effor...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Qiongshi, Hu, Yiming, Sun, Jiehuan, Cheng, Yuwei, Cheung, Kei-Hoi, Zhao, Hongyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4444969/
https://www.ncbi.nlm.nih.gov/pubmed/26015273
http://dx.doi.org/10.1038/srep10576
_version_ 1782373211175911424
author Lu, Qiongshi
Hu, Yiming
Sun, Jiehuan
Cheng, Yuwei
Cheung, Kei-Hoi
Zhao, Hongyu
author_facet Lu, Qiongshi
Hu, Yiming
Sun, Jiehuan
Cheng, Yuwei
Cheung, Kei-Hoi
Zhao, Hongyu
author_sort Lu, Qiongshi
collection PubMed
description Identifying functional regions in the human genome is a major goal in human genetics. Great efforts have been made to functionally annotate the human genome either through computational predictions, such as genomic conservation, or high-throughput experiments, such as the ENCODE project. These efforts have resulted in a rich collection of functional annotation data of diverse types that need to be jointly analyzed for integrated interpretation and annotation. Here we present GenoCanyon, a whole-genome annotation method that performs unsupervised statistical learning using 22 computational and experimental annotations thereby inferring the functional potential of each position in the human genome. With GenoCanyon, we are able to predict many of the known functional regions. The ability of predicting functional regions as well as its generalizable statistical framework makes GenoCanyon a unique and powerful tool for whole-genome annotation. The GenoCanyon web server is available at http://genocanyon.med.yale.edu
format Online
Article
Text
id pubmed-4444969
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-44449692015-06-01 A Statistical Framework to Predict Functional Non-Coding Regions in the Human Genome Through Integrated Analysis of Annotation Data Lu, Qiongshi Hu, Yiming Sun, Jiehuan Cheng, Yuwei Cheung, Kei-Hoi Zhao, Hongyu Sci Rep Article Identifying functional regions in the human genome is a major goal in human genetics. Great efforts have been made to functionally annotate the human genome either through computational predictions, such as genomic conservation, or high-throughput experiments, such as the ENCODE project. These efforts have resulted in a rich collection of functional annotation data of diverse types that need to be jointly analyzed for integrated interpretation and annotation. Here we present GenoCanyon, a whole-genome annotation method that performs unsupervised statistical learning using 22 computational and experimental annotations thereby inferring the functional potential of each position in the human genome. With GenoCanyon, we are able to predict many of the known functional regions. The ability of predicting functional regions as well as its generalizable statistical framework makes GenoCanyon a unique and powerful tool for whole-genome annotation. The GenoCanyon web server is available at http://genocanyon.med.yale.edu Nature Publishing Group 2015-06-30 /pmc/articles/PMC4444969/ /pubmed/26015273 http://dx.doi.org/10.1038/srep10576 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Lu, Qiongshi
Hu, Yiming
Sun, Jiehuan
Cheng, Yuwei
Cheung, Kei-Hoi
Zhao, Hongyu
A Statistical Framework to Predict Functional Non-Coding Regions in the Human Genome Through Integrated Analysis of Annotation Data
title A Statistical Framework to Predict Functional Non-Coding Regions in the Human Genome Through Integrated Analysis of Annotation Data
title_full A Statistical Framework to Predict Functional Non-Coding Regions in the Human Genome Through Integrated Analysis of Annotation Data
title_fullStr A Statistical Framework to Predict Functional Non-Coding Regions in the Human Genome Through Integrated Analysis of Annotation Data
title_full_unstemmed A Statistical Framework to Predict Functional Non-Coding Regions in the Human Genome Through Integrated Analysis of Annotation Data
title_short A Statistical Framework to Predict Functional Non-Coding Regions in the Human Genome Through Integrated Analysis of Annotation Data
title_sort statistical framework to predict functional non-coding regions in the human genome through integrated analysis of annotation data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4444969/
https://www.ncbi.nlm.nih.gov/pubmed/26015273
http://dx.doi.org/10.1038/srep10576
work_keys_str_mv AT luqiongshi astatisticalframeworktopredictfunctionalnoncodingregionsinthehumangenomethroughintegratedanalysisofannotationdata
AT huyiming astatisticalframeworktopredictfunctionalnoncodingregionsinthehumangenomethroughintegratedanalysisofannotationdata
AT sunjiehuan astatisticalframeworktopredictfunctionalnoncodingregionsinthehumangenomethroughintegratedanalysisofannotationdata
AT chengyuwei astatisticalframeworktopredictfunctionalnoncodingregionsinthehumangenomethroughintegratedanalysisofannotationdata
AT cheungkeihoi astatisticalframeworktopredictfunctionalnoncodingregionsinthehumangenomethroughintegratedanalysisofannotationdata
AT zhaohongyu astatisticalframeworktopredictfunctionalnoncodingregionsinthehumangenomethroughintegratedanalysisofannotationdata
AT luqiongshi statisticalframeworktopredictfunctionalnoncodingregionsinthehumangenomethroughintegratedanalysisofannotationdata
AT huyiming statisticalframeworktopredictfunctionalnoncodingregionsinthehumangenomethroughintegratedanalysisofannotationdata
AT sunjiehuan statisticalframeworktopredictfunctionalnoncodingregionsinthehumangenomethroughintegratedanalysisofannotationdata
AT chengyuwei statisticalframeworktopredictfunctionalnoncodingregionsinthehumangenomethroughintegratedanalysisofannotationdata
AT cheungkeihoi statisticalframeworktopredictfunctionalnoncodingregionsinthehumangenomethroughintegratedanalysisofannotationdata
AT zhaohongyu statisticalframeworktopredictfunctionalnoncodingregionsinthehumangenomethroughintegratedanalysisofannotationdata