Cargando…

Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags

BACKGROUND: Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from pub...

Descripción completa

Detalles Bibliográficos
Autores principales: Gorodkin, Jan, Cirera, Susanna, Hedegaard, Jakob, Gilchrist, Michael J, Panitz, Frank, Jørgensen, Claus, Scheibye-Knudsen, Karsten, Arvin, Troels, Lumholdt, Steen, Sawera, Milena, Green, Trine, Nielsen, Bente J, Havgaard, Jakob H, Rosenkilde, Carina, Wang, Jun, Li, Heng, Li, Ruiqiang, Liu, Bin, Hu, Songnian, Dong, Wei, Li, Wei, Yu, Jun, Wang, Jian, Stærfeldt, Hans-Henrik, Wernersson, Rasmus, Madsen, Lone B, Thomsen, Bo, Hornshøj, Henrik, Bujie, Zhan, Wang, Xuegang, Wang, Xuefei, Bolund, Lars, Brunak, Søren, Yang, Huanming, Bendixen, Christian, Fredholm, Merete
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1895994/
https://www.ncbi.nlm.nih.gov/pubmed/17407547
http://dx.doi.org/10.1186/gb-2007-8-4-r45
_version_ 1782133907303432192
author Gorodkin, Jan
Cirera, Susanna
Hedegaard, Jakob
Gilchrist, Michael J
Panitz, Frank
Jørgensen, Claus
Scheibye-Knudsen, Karsten
Arvin, Troels
Lumholdt, Steen
Sawera, Milena
Green, Trine
Nielsen, Bente J
Havgaard, Jakob H
Rosenkilde, Carina
Wang, Jun
Li, Heng
Li, Ruiqiang
Liu, Bin
Hu, Songnian
Dong, Wei
Li, Wei
Yu, Jun
Wang, Jian
Stærfeldt, Hans-Henrik
Wernersson, Rasmus
Madsen, Lone B
Thomsen, Bo
Hornshøj, Henrik
Bujie, Zhan
Wang, Xuegang
Wang, Xuefei
Bolund, Lars
Brunak, Søren
Yang, Huanming
Bendixen, Christian
Fredholm, Merete
author_facet Gorodkin, Jan
Cirera, Susanna
Hedegaard, Jakob
Gilchrist, Michael J
Panitz, Frank
Jørgensen, Claus
Scheibye-Knudsen, Karsten
Arvin, Troels
Lumholdt, Steen
Sawera, Milena
Green, Trine
Nielsen, Bente J
Havgaard, Jakob H
Rosenkilde, Carina
Wang, Jun
Li, Heng
Li, Ruiqiang
Liu, Bin
Hu, Songnian
Dong, Wei
Li, Wei
Yu, Jun
Wang, Jian
Stærfeldt, Hans-Henrik
Wernersson, Rasmus
Madsen, Lone B
Thomsen, Bo
Hornshøj, Henrik
Bujie, Zhan
Wang, Xuegang
Wang, Xuefei
Bolund, Lars
Brunak, Søren
Yang, Huanming
Bendixen, Christian
Fredholm, Merete
author_sort Gorodkin, Jan
collection PubMed
description BACKGROUND: Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. RESULTS: Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which approximately 25% have a high confidence match to UniProt. Approximately 6,000 new porcine gene clusters were identified. Expression analysis based on the non-normalized libraries resulted in the following findings. The distribution of cluster sizes is scaling invariant. Brain and testes are among the tissues with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression of genes between different tissues, in particular brain/spinal cord, and found patterns of correlation between genes that share expression in pairs of libraries. Finally, there was remarkable agreement in expression between specialized tissues according to Gene Ontology categories. CONCLUSION: This EST collection, the largest to date in pig, represents an essential resource for annotation, comparative genomics, assembly of the pig genome sequence, and further porcine transcription studies.
format Text
id pubmed-1895994
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18959942007-06-22 Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags Gorodkin, Jan Cirera, Susanna Hedegaard, Jakob Gilchrist, Michael J Panitz, Frank Jørgensen, Claus Scheibye-Knudsen, Karsten Arvin, Troels Lumholdt, Steen Sawera, Milena Green, Trine Nielsen, Bente J Havgaard, Jakob H Rosenkilde, Carina Wang, Jun Li, Heng Li, Ruiqiang Liu, Bin Hu, Songnian Dong, Wei Li, Wei Yu, Jun Wang, Jian Stærfeldt, Hans-Henrik Wernersson, Rasmus Madsen, Lone B Thomsen, Bo Hornshøj, Henrik Bujie, Zhan Wang, Xuegang Wang, Xuefei Bolund, Lars Brunak, Søren Yang, Huanming Bendixen, Christian Fredholm, Merete Genome Biol Research BACKGROUND: Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. RESULTS: Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which approximately 25% have a high confidence match to UniProt. Approximately 6,000 new porcine gene clusters were identified. Expression analysis based on the non-normalized libraries resulted in the following findings. The distribution of cluster sizes is scaling invariant. Brain and testes are among the tissues with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression of genes between different tissues, in particular brain/spinal cord, and found patterns of correlation between genes that share expression in pairs of libraries. Finally, there was remarkable agreement in expression between specialized tissues according to Gene Ontology categories. CONCLUSION: This EST collection, the largest to date in pig, represents an essential resource for annotation, comparative genomics, assembly of the pig genome sequence, and further porcine transcription studies. BioMed Central 2007 2007-04-02 /pmc/articles/PMC1895994/ /pubmed/17407547 http://dx.doi.org/10.1186/gb-2007-8-4-r45 Text en Copyright © 2007 Gorodkin et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Gorodkin, Jan
Cirera, Susanna
Hedegaard, Jakob
Gilchrist, Michael J
Panitz, Frank
Jørgensen, Claus
Scheibye-Knudsen, Karsten
Arvin, Troels
Lumholdt, Steen
Sawera, Milena
Green, Trine
Nielsen, Bente J
Havgaard, Jakob H
Rosenkilde, Carina
Wang, Jun
Li, Heng
Li, Ruiqiang
Liu, Bin
Hu, Songnian
Dong, Wei
Li, Wei
Yu, Jun
Wang, Jian
Stærfeldt, Hans-Henrik
Wernersson, Rasmus
Madsen, Lone B
Thomsen, Bo
Hornshøj, Henrik
Bujie, Zhan
Wang, Xuegang
Wang, Xuefei
Bolund, Lars
Brunak, Søren
Yang, Huanming
Bendixen, Christian
Fredholm, Merete
Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags
title Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags
title_full Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags
title_fullStr Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags
title_full_unstemmed Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags
title_short Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags
title_sort porcine transcriptome analysis based on 97 non-normalized cdna libraries and assembly of 1,021,891 expressed sequence tags
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1895994/
https://www.ncbi.nlm.nih.gov/pubmed/17407547
http://dx.doi.org/10.1186/gb-2007-8-4-r45
work_keys_str_mv AT gorodkinjan porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT cirerasusanna porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT hedegaardjakob porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT gilchristmichaelj porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT panitzfrank porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT jørgensenclaus porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT scheibyeknudsenkarsten porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT arvintroels porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT lumholdtsteen porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT saweramilena porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT greentrine porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT nielsenbentej porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT havgaardjakobh porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT rosenkildecarina porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT wangjun porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT liheng porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT liruiqiang porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT liubin porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT husongnian porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT dongwei porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT liwei porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT yujun porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT wangjian porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT stærfeldthanshenrik porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT wernerssonrasmus porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT madsenloneb porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT thomsenbo porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT hornshøjhenrik porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT bujiezhan porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT wangxuegang porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT wangxuefei porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT bolundlars porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT brunaksøren porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT yanghuanming porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT bendixenchristian porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags
AT fredholmmerete porcinetranscriptomeanalysisbasedon97nonnormalizedcdnalibrariesandassemblyof1021891expressedsequencetags