Cargando…
SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining
Human infertility affects 10–15% of couples, half of which is attributed to the male partner. Abnormal spermatogenesis is a major cause of male infertility. Characterizing the genes involved in spermatogenesis is fundamental to understand the mechanisms underlying this biological process and in deve...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3531227/ https://www.ncbi.nlm.nih.gov/pubmed/23193286 http://dx.doi.org/10.1093/nar/gks1186 |
_version_ | 1782254139233796096 |
---|---|
author | Zhang, Yuanwei Zhong, Liangwen Xu, Bo Yang, Yifan Ban, Rongjun Zhu, Jun Cooke, Howard J. Hao, QiaoMei Shi, Qinghua |
author_facet | Zhang, Yuanwei Zhong, Liangwen Xu, Bo Yang, Yifan Ban, Rongjun Zhu, Jun Cooke, Howard J. Hao, QiaoMei Shi, Qinghua |
author_sort | Zhang, Yuanwei |
collection | PubMed |
description | Human infertility affects 10–15% of couples, half of which is attributed to the male partner. Abnormal spermatogenesis is a major cause of male infertility. Characterizing the genes involved in spermatogenesis is fundamental to understand the mechanisms underlying this biological process and in developing treatments for male infertility. Although many genes have been implicated in spermatogenesis, no dedicated bioinformatic resource for spermatogenesis is available. We have developed such a database, SpermatogenesisOnline 1.0 (http://mcg.ustc.edu.cn/sdap1/spermgenes/), using manual curation from 30 233 articles published before 1 May 2012. It provides detailed information for 1666 genes reported to participate in spermatogenesis in 37 organisms. Based on the analysis of these genes, we developed an algorithm, Greed AUC Stepwise (GAS) model, which predicted 762 genes to participate in spermatogenesis (GAS probability >0.5) based on genome-wide transcriptional data in Mus musculus testis from the ArrayExpress database. These predicted and experimentally verified genes were annotated, with several identical spermatogenesis-related GO terms being enriched for both classes. Furthermore, protein–protein interaction analysis indicates direct interactions of predicted genes with the experimentally verified ones, which supports the reliability of GAS. The strategy (manual curation and data mining) used to develop SpermatogenesisOnline 1.0 can be easily extended to other biological processes. |
format | Online Article Text |
id | pubmed-3531227 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-35312272013-01-03 SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining Zhang, Yuanwei Zhong, Liangwen Xu, Bo Yang, Yifan Ban, Rongjun Zhu, Jun Cooke, Howard J. Hao, QiaoMei Shi, Qinghua Nucleic Acids Res Articles Human infertility affects 10–15% of couples, half of which is attributed to the male partner. Abnormal spermatogenesis is a major cause of male infertility. Characterizing the genes involved in spermatogenesis is fundamental to understand the mechanisms underlying this biological process and in developing treatments for male infertility. Although many genes have been implicated in spermatogenesis, no dedicated bioinformatic resource for spermatogenesis is available. We have developed such a database, SpermatogenesisOnline 1.0 (http://mcg.ustc.edu.cn/sdap1/spermgenes/), using manual curation from 30 233 articles published before 1 May 2012. It provides detailed information for 1666 genes reported to participate in spermatogenesis in 37 organisms. Based on the analysis of these genes, we developed an algorithm, Greed AUC Stepwise (GAS) model, which predicted 762 genes to participate in spermatogenesis (GAS probability >0.5) based on genome-wide transcriptional data in Mus musculus testis from the ArrayExpress database. These predicted and experimentally verified genes were annotated, with several identical spermatogenesis-related GO terms being enriched for both classes. Furthermore, protein–protein interaction analysis indicates direct interactions of predicted genes with the experimentally verified ones, which supports the reliability of GAS. The strategy (manual curation and data mining) used to develop SpermatogenesisOnline 1.0 can be easily extended to other biological processes. Oxford University Press 2013-01 2012-11-27 /pmc/articles/PMC3531227/ /pubmed/23193286 http://dx.doi.org/10.1093/nar/gks1186 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com. |
spellingShingle | Articles Zhang, Yuanwei Zhong, Liangwen Xu, Bo Yang, Yifan Ban, Rongjun Zhu, Jun Cooke, Howard J. Hao, QiaoMei Shi, Qinghua SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining |
title | SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining |
title_full | SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining |
title_fullStr | SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining |
title_full_unstemmed | SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining |
title_short | SpermatogenesisOnline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining |
title_sort | spermatogenesisonline 1.0: a resource for spermatogenesis based on manual literature curation and genome-wide data mining |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3531227/ https://www.ncbi.nlm.nih.gov/pubmed/23193286 http://dx.doi.org/10.1093/nar/gks1186 |
work_keys_str_mv | AT zhangyuanwei spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining AT zhongliangwen spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining AT xubo spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining AT yangyifan spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining AT banrongjun spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining AT zhujun spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining AT cookehowardj spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining AT haoqiaomei spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining AT shiqinghua spermatogenesisonline10aresourceforspermatogenesisbasedonmanualliteraturecurationandgenomewidedatamining |