Cargando…

SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining

Human infertility affects 10–15% of couples, half of which is attributed to the male partner. Abnormal spermatogenesis is a major cause of male infertility. Characterizing the genes involved in spermatogenesis is fundamental to understand the mechanisms underlying this biological process and in deve...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yuanwei, Zhong, Liangwen, Xu, Bo, Yang, Yifan, Ban, Rongjun, Zhu, Jun, Cooke, Howard J., Hao, QiaoMei, Shi, Qinghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3531227/
https://www.ncbi.nlm.nih.gov/pubmed/23193286
http://dx.doi.org/10.1093/nar/gks1186
_version_ 1782254139233796096
author Zhang, Yuanwei
Zhong, Liangwen
Xu, Bo
Yang, Yifan
Ban, Rongjun
Zhu, Jun
Cooke, Howard J.
Hao, QiaoMei
Shi, Qinghua
author_facet Zhang, Yuanwei
Zhong, Liangwen
Xu, Bo
Yang, Yifan
Ban, Rongjun
Zhu, Jun
Cooke, Howard J.
Hao, QiaoMei
Shi, Qinghua
author_sort Zhang, Yuanwei
collection PubMed
description Human infertility affects 10–15% of couples, half of which is attributed to the male partner. Abnormal spermatogenesis is a major cause of male infertility. Characterizing the genes involved in spermatogenesis is fundamental to understand the mechanisms underlying this biological process and in developing treatments for male infertility. Although many genes have been implicated in spermatogenesis, no dedicated bioinformatic resource for spermatogenesis is available. We have developed such a database, SpermatogenesisOnline 1.0 (http://mcg.ustc.edu.cn/sdap1/spermgenes/), using manual curation from 30 233 articles published before 1 May 2012. It provides detailed information for 1666 genes reported to participate in spermatogenesis in 37 organisms. Based on the analysis of these genes, we developed an algorithm, Greed AUC Stepwise (GAS) model, which predicted 762 genes to participate in spermatogenesis (GAS probability >0.5) based on genome-wide transcriptional data in Mus musculus testis from the ArrayExpress database. These predicted and experimentally verified genes were annotated, with several identical spermatogenesis-related GO terms being enriched for both classes. Furthermore, protein–protein interaction analysis indicates direct interactions of predicted genes with the experimentally verified ones, which supports the reliability of GAS. The strategy (manual curation and data mining) used to develop SpermatogenesisOnline 1.0 can be easily extended to other biological processes.
format Online
Article
Text
id pubmed-3531227
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35312272013-01-03 SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining Zhang, Yuanwei Zhong, Liangwen Xu, Bo Yang, Yifan Ban, Rongjun Zhu, Jun Cooke, Howard J. Hao, QiaoMei Shi, Qinghua Nucleic Acids Res Articles Human infertility affects 10–15% of couples, half of which is attributed to the male partner. Abnormal spermatogenesis is a major cause of male infertility. Characterizing the genes involved in spermatogenesis is fundamental to understand the mechanisms underlying this biological process and in developing treatments for male infertility. Although many genes have been implicated in spermatogenesis, no dedicated bioinformatic resource for spermatogenesis is available. We have developed such a database, SpermatogenesisOnline 1.0 (http://mcg.ustc.edu.cn/sdap1/spermgenes/), using manual curation from 30 233 articles published before 1 May 2012. It provides detailed information for 1666 genes reported to participate in spermatogenesis in 37 organisms. Based on the analysis of these genes, we developed an algorithm, Greed AUC Stepwise (GAS) model, which predicted 762 genes to participate in spermatogenesis (GAS probability >0.5) based on genome-wide transcriptional data in Mus musculus testis from the ArrayExpress database. These predicted and experimentally verified genes were annotated, with several identical spermatogenesis-related GO terms being enriched for both classes. Furthermore, protein–protein interaction analysis indicates direct interactions of predicted genes with the experimentally verified ones, which supports the reliability of GAS. The strategy (manual curation and data mining) used to develop SpermatogenesisOnline 1.0 can be easily extended to other biological processes. Oxford University Press 2013-01 2012-11-27 /pmc/articles/PMC3531227/ /pubmed/23193286 http://dx.doi.org/10.1093/nar/gks1186 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle Articles
Zhang, Yuanwei
Zhong, Liangwen
Xu, Bo
Yang, Yifan
Ban, Rongjun
Zhu, Jun
Cooke, Howard J.
Hao, QiaoMei
Shi, Qinghua
SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining
title SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining
title_full SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining
title_fullStr SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining
title_full_unstemmed SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining
title_short SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining
title_sort spermatogenesisonline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3531227/
https://www.ncbi.nlm.nih.gov/pubmed/23193286
http://dx.doi.org/10.1093/nar/gks1186
work_keys_str_mv AT zhangyuanwei spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining
AT zhongliangwen spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining
AT xubo spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining
AT yangyifan spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining
AT banrongjun spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining
AT zhujun spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining
AT cookehowardj spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining
AT haoqiaomei spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining
AT shiqinghua spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining