Cargando…

Structural and functional-annotation of an equine whole genome oligoarray

BACKGROUND: The horse genome is sequenced, allowing equine researchers to use high-throughput functional genomics platforms such as microarrays; next-generation sequencing for gene expression and proteomics. However, for researchers to derive value from these functional genomics datasets, they must...

Descripción completa

Detalles Bibliográficos
Autores principales: Bright, Lauren A, Burgess, Shane C, Chowdhary, Bhanu, Swiderski, Cyprianna E, McCarthy, Fiona M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226197/
https://www.ncbi.nlm.nih.gov/pubmed/19811692
http://dx.doi.org/10.1186/1471-2105-10-S11-S8
_version_ 1782217580515164160
author Bright, Lauren A
Burgess, Shane C
Chowdhary, Bhanu
Swiderski, Cyprianna E
McCarthy, Fiona M
author_facet Bright, Lauren A
Burgess, Shane C
Chowdhary, Bhanu
Swiderski, Cyprianna E
McCarthy, Fiona M
author_sort Bright, Lauren A
collection PubMed
description BACKGROUND: The horse genome is sequenced, allowing equine researchers to use high-throughput functional genomics platforms such as microarrays; next-generation sequencing for gene expression and proteomics. However, for researchers to derive value from these functional genomics datasets, they must be able to model this data in biologically relevant ways; to do so requires that the equine genome be more fully annotated. There are two interrelated types of genomic annotation: structural and functional. Structural annotation is delineating and demarcating the genomic elements (such as genes, promoters, and regulatory elements). Functional annotation is assigning function to structural elements. The Gene Ontology (GO) is the de facto standard for functional annotation, and is routinely used as a basis for modelling and hypothesis testing, large functional genomics datasets. RESULTS: An Equine Whole Genome Oligonucleotide (EWGO) array with 21,351 elements was developed at Texas A&M University. This 70-mer oligoarray was designed using the approximately 7× assembled and annotated sequence of the equine genome to be one of the most comprehensive arrays available for expressed equine sequences. To assist researchers in determining the biological meaning of data derived from this array, we have structurally annotated it by mapping the elements to multiple database accessions, including UniProtKB, Entrez Gene, NRPD (Non-Redundant Protein Database) and UniGene. We next provided GO functional annotations for the gene transcripts represented on this array. Overall, we GO annotated 14,531 gene products (68.1% of the gene products represented on the EWGO array) with 57,912 annotations. GAQ (GO Annotation Quality) scores were calculated for this array both before and after we added GO annotation. The additional annotations improved the meanGAQ score 16-fold. This data is publicly available at AgBase http://www.agbase.msstate.edu/. CONCLUSION: Providing additional information about the public databases which link to the gene products represented on the array allows users more flexibility when using gene expression modelling and hypothesis-testing computational tools. Moreover, since different databases provide different types of information, users have access to multiple data sources. In addition, our GO annotation underpins functional modelling for most gene expression analysis tools and enables equine researchers to model large lists of differentially expressed transcripts in biologically relevant ways.
format Online
Article
Text
id pubmed-3226197
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32261972011-11-30 Structural and functional-annotation of an equine whole genome oligoarray Bright, Lauren A Burgess, Shane C Chowdhary, Bhanu Swiderski, Cyprianna E McCarthy, Fiona M BMC Bioinformatics Proceedings BACKGROUND: The horse genome is sequenced, allowing equine researchers to use high-throughput functional genomics platforms such as microarrays; next-generation sequencing for gene expression and proteomics. However, for researchers to derive value from these functional genomics datasets, they must be able to model this data in biologically relevant ways; to do so requires that the equine genome be more fully annotated. There are two interrelated types of genomic annotation: structural and functional. Structural annotation is delineating and demarcating the genomic elements (such as genes, promoters, and regulatory elements). Functional annotation is assigning function to structural elements. The Gene Ontology (GO) is the de facto standard for functional annotation, and is routinely used as a basis for modelling and hypothesis testing, large functional genomics datasets. RESULTS: An Equine Whole Genome Oligonucleotide (EWGO) array with 21,351 elements was developed at Texas A&M University. This 70-mer oligoarray was designed using the approximately 7× assembled and annotated sequence of the equine genome to be one of the most comprehensive arrays available for expressed equine sequences. To assist researchers in determining the biological meaning of data derived from this array, we have structurally annotated it by mapping the elements to multiple database accessions, including UniProtKB, Entrez Gene, NRPD (Non-Redundant Protein Database) and UniGene. We next provided GO functional annotations for the gene transcripts represented on this array. Overall, we GO annotated 14,531 gene products (68.1% of the gene products represented on the EWGO array) with 57,912 annotations. GAQ (GO Annotation Quality) scores were calculated for this array both before and after we added GO annotation. The additional annotations improved the meanGAQ score 16-fold. This data is publicly available at AgBase http://www.agbase.msstate.edu/. CONCLUSION: Providing additional information about the public databases which link to the gene products represented on the array allows users more flexibility when using gene expression modelling and hypothesis-testing computational tools. Moreover, since different databases provide different types of information, users have access to multiple data sources. In addition, our GO annotation underpins functional modelling for most gene expression analysis tools and enables equine researchers to model large lists of differentially expressed transcripts in biologically relevant ways. BioMed Central 2009-10-08 /pmc/articles/PMC3226197/ /pubmed/19811692 http://dx.doi.org/10.1186/1471-2105-10-S11-S8 Text en Copyright ©2009 Bright et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Bright, Lauren A
Burgess, Shane C
Chowdhary, Bhanu
Swiderski, Cyprianna E
McCarthy, Fiona M
Structural and functional-annotation of an equine whole genome oligoarray
title Structural and functional-annotation of an equine whole genome oligoarray
title_full Structural and functional-annotation of an equine whole genome oligoarray
title_fullStr Structural and functional-annotation of an equine whole genome oligoarray
title_full_unstemmed Structural and functional-annotation of an equine whole genome oligoarray
title_short Structural and functional-annotation of an equine whole genome oligoarray
title_sort structural and functional-annotation of an equine whole genome oligoarray
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226197/
https://www.ncbi.nlm.nih.gov/pubmed/19811692
http://dx.doi.org/10.1186/1471-2105-10-S11-S8
work_keys_str_mv AT brightlaurena structuralandfunctionalannotationofanequinewholegenomeoligoarray
AT burgessshanec structuralandfunctionalannotationofanequinewholegenomeoligoarray
AT chowdharybhanu structuralandfunctionalannotationofanequinewholegenomeoligoarray
AT swiderskicypriannae structuralandfunctionalannotationofanequinewholegenomeoligoarray
AT mccarthyfionam structuralandfunctionalannotationofanequinewholegenomeoligoarray