Cargando…
High-density rhesus macaque oligonucleotide microarray design using early-stage rhesus genome sequence information and human genome annotations
BACKGROUND: Until recently, few genomic reagents specific for non-human primate research have been available. To address this need, we have constructed a macaque-specific high-density oligonucleotide microarray by using highly fragmented low-pass sequence contigs from the rhesus genome project toget...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1790710/ https://www.ncbi.nlm.nih.gov/pubmed/17244361 http://dx.doi.org/10.1186/1471-2164-8-28 |
_version_ | 1782132125020979200 |
---|---|
author | Wallace, James C Korth, Marcus J Paeper, Bryan Proll, Sean C Thomas, Matthew J Magness, Charles L Iadonato, Shawn P Nelson, Charles Katze, Michael G |
author_facet | Wallace, James C Korth, Marcus J Paeper, Bryan Proll, Sean C Thomas, Matthew J Magness, Charles L Iadonato, Shawn P Nelson, Charles Katze, Michael G |
author_sort | Wallace, James C |
collection | PubMed |
description | BACKGROUND: Until recently, few genomic reagents specific for non-human primate research have been available. To address this need, we have constructed a macaque-specific high-density oligonucleotide microarray by using highly fragmented low-pass sequence contigs from the rhesus genome project together with the detailed sequence and exon structure of the human genome. Using this method, we designed oligonucleotide probes to over 17,000 distinct rhesus/human gene orthologs and increased by four-fold the number of available genes relative to our first-generation expressed sequence tag (EST)-derived array. RESULTS: We constructed a database containing 248,000 exon sequences from 23,000 human RefSeq genes and compared each human exon with its best matching sequence in the January 2005 version of the rhesus genome project list of 486,000 DNA contigs. Best matching rhesus exon sequences for each of the 23,000 human genes were then concatenated in the proper order and orientation to produce a rhesus "virtual transcriptome." Microarray probes were designed, one per gene, to the region closest to the 3' untranslated region (UTR) of each rhesus virtual transcript. Each probe was compared to a composite rhesus/human transcript database to test for cross-hybridization potential yielding a final probe set representing 18,296 rhesus/human gene orthologs, including transcript variants, and over 17,000 distinct genes. We hybridized mRNA from rhesus brain and spleen to both the EST- and genome-derived microarrays. Besides four-fold greater gene coverage, the genome-derived array also showed greater mean signal intensities for genes present on both arrays. Genome-derived probes showed 99.4% identity when compared to 4,767 rhesus GenBank sequence tag site (STS) sequences indicating that early stage low-pass versions of complex genomes are of sufficient quality to yield valuable functional genomic information when combined with finished genome information from a closely related species. CONCLUSION: The number of different genes represented on microarrays for unfinished genomes can be greatly increased by matching known gene transcript annotations from a closely related species with sequence data from the unfinished genome. Signal intensity on both EST- and genome-derived arrays was highly correlated with probe distance from the 3' UTR, information often missing from ESTs yet present in early-stage genome projects. |
format | Text |
id | pubmed-1790710 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-17907102007-02-02 High-density rhesus macaque oligonucleotide microarray design using early-stage rhesus genome sequence information and human genome annotations Wallace, James C Korth, Marcus J Paeper, Bryan Proll, Sean C Thomas, Matthew J Magness, Charles L Iadonato, Shawn P Nelson, Charles Katze, Michael G BMC Genomics Research Article BACKGROUND: Until recently, few genomic reagents specific for non-human primate research have been available. To address this need, we have constructed a macaque-specific high-density oligonucleotide microarray by using highly fragmented low-pass sequence contigs from the rhesus genome project together with the detailed sequence and exon structure of the human genome. Using this method, we designed oligonucleotide probes to over 17,000 distinct rhesus/human gene orthologs and increased by four-fold the number of available genes relative to our first-generation expressed sequence tag (EST)-derived array. RESULTS: We constructed a database containing 248,000 exon sequences from 23,000 human RefSeq genes and compared each human exon with its best matching sequence in the January 2005 version of the rhesus genome project list of 486,000 DNA contigs. Best matching rhesus exon sequences for each of the 23,000 human genes were then concatenated in the proper order and orientation to produce a rhesus "virtual transcriptome." Microarray probes were designed, one per gene, to the region closest to the 3' untranslated region (UTR) of each rhesus virtual transcript. Each probe was compared to a composite rhesus/human transcript database to test for cross-hybridization potential yielding a final probe set representing 18,296 rhesus/human gene orthologs, including transcript variants, and over 17,000 distinct genes. We hybridized mRNA from rhesus brain and spleen to both the EST- and genome-derived microarrays. Besides four-fold greater gene coverage, the genome-derived array also showed greater mean signal intensities for genes present on both arrays. Genome-derived probes showed 99.4% identity when compared to 4,767 rhesus GenBank sequence tag site (STS) sequences indicating that early stage low-pass versions of complex genomes are of sufficient quality to yield valuable functional genomic information when combined with finished genome information from a closely related species. CONCLUSION: The number of different genes represented on microarrays for unfinished genomes can be greatly increased by matching known gene transcript annotations from a closely related species with sequence data from the unfinished genome. Signal intensity on both EST- and genome-derived arrays was highly correlated with probe distance from the 3' UTR, information often missing from ESTs yet present in early-stage genome projects. BioMed Central 2007-01-23 /pmc/articles/PMC1790710/ /pubmed/17244361 http://dx.doi.org/10.1186/1471-2164-8-28 Text en Copyright © 2007 Wallace et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Wallace, James C Korth, Marcus J Paeper, Bryan Proll, Sean C Thomas, Matthew J Magness, Charles L Iadonato, Shawn P Nelson, Charles Katze, Michael G High-density rhesus macaque oligonucleotide microarray design using early-stage rhesus genome sequence information and human genome annotations |
title | High-density rhesus macaque oligonucleotide microarray design using early-stage rhesus genome sequence information and human genome annotations |
title_full | High-density rhesus macaque oligonucleotide microarray design using early-stage rhesus genome sequence information and human genome annotations |
title_fullStr | High-density rhesus macaque oligonucleotide microarray design using early-stage rhesus genome sequence information and human genome annotations |
title_full_unstemmed | High-density rhesus macaque oligonucleotide microarray design using early-stage rhesus genome sequence information and human genome annotations |
title_short | High-density rhesus macaque oligonucleotide microarray design using early-stage rhesus genome sequence information and human genome annotations |
title_sort | high-density rhesus macaque oligonucleotide microarray design using early-stage rhesus genome sequence information and human genome annotations |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1790710/ https://www.ncbi.nlm.nih.gov/pubmed/17244361 http://dx.doi.org/10.1186/1471-2164-8-28 |
work_keys_str_mv | AT wallacejamesc highdensityrhesusmacaqueoligonucleotidemicroarraydesignusingearlystagerhesusgenomesequenceinformationandhumangenomeannotations AT korthmarcusj highdensityrhesusmacaqueoligonucleotidemicroarraydesignusingearlystagerhesusgenomesequenceinformationandhumangenomeannotations AT paeperbryan highdensityrhesusmacaqueoligonucleotidemicroarraydesignusingearlystagerhesusgenomesequenceinformationandhumangenomeannotations AT prollseanc highdensityrhesusmacaqueoligonucleotidemicroarraydesignusingearlystagerhesusgenomesequenceinformationandhumangenomeannotations AT thomasmatthewj highdensityrhesusmacaqueoligonucleotidemicroarraydesignusingearlystagerhesusgenomesequenceinformationandhumangenomeannotations AT magnesscharlesl highdensityrhesusmacaqueoligonucleotidemicroarraydesignusingearlystagerhesusgenomesequenceinformationandhumangenomeannotations AT iadonatoshawnp highdensityrhesusmacaqueoligonucleotidemicroarraydesignusingearlystagerhesusgenomesequenceinformationandhumangenomeannotations AT nelsoncharles highdensityrhesusmacaqueoligonucleotidemicroarraydesignusingearlystagerhesusgenomesequenceinformationandhumangenomeannotations AT katzemichaelg highdensityrhesusmacaqueoligonucleotidemicroarraydesignusingearlystagerhesusgenomesequenceinformationandhumangenomeannotations |