Cargando…

De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences

BACKGROUND: Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities...

Descripción completa

Detalles Bibliográficos
Autores principales: Maudhoo, Mnirnal D, Madison, Jacob D, Norgren, Robert B
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4403674/
https://www.ncbi.nlm.nih.gov/pubmed/25897398
http://dx.doi.org/10.1186/s13742-015-0061-x
_version_ 1782367361578303488
author Maudhoo, Mnirnal D
Madison, Jacob D
Norgren, Robert B
author_facet Maudhoo, Mnirnal D
Madison, Jacob D
Norgren, Robert B
author_sort Maudhoo, Mnirnal D
collection PubMed
description BACKGROUND: Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities. We provide transcriptome data from four untransformed cell types derived from the reference Pan troglodytes, “Clint”, to better annotate the chimpanzee genome and provide empirical validation for proposed gene models of this important species. FINDINGS: RNA was extracted from primary cells cultured from four tissues: skin, adipose stroma, vascular smooth muscle and skeletal muscle. These four RNA samples were sequenced on the Illumina HiSeq 2000 platform. Sequences were deposited in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA). Transcripts were assembled, annotated and deposited in the NCBI Transcriptome Shotgun Assembly (TSA) database. CONCLUSIONS: We have provided a high quality annotation of 44,275 transcripts with full-length coding sequence (CDS). This set represented a total of 10,110 unique genes, thus providing empirical support for their existence. This dataset can be used to improve the annotation of the Pan troglodytes genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13742-015-0061-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4403674
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44036742015-04-21 De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences Maudhoo, Mnirnal D Madison, Jacob D Norgren, Robert B Gigascience Data Note BACKGROUND: Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities. We provide transcriptome data from four untransformed cell types derived from the reference Pan troglodytes, “Clint”, to better annotate the chimpanzee genome and provide empirical validation for proposed gene models of this important species. FINDINGS: RNA was extracted from primary cells cultured from four tissues: skin, adipose stroma, vascular smooth muscle and skeletal muscle. These four RNA samples were sequenced on the Illumina HiSeq 2000 platform. Sequences were deposited in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA). Transcripts were assembled, annotated and deposited in the NCBI Transcriptome Shotgun Assembly (TSA) database. CONCLUSIONS: We have provided a high quality annotation of 44,275 transcripts with full-length coding sequence (CDS). This set represented a total of 10,110 unique genes, thus providing empirical support for their existence. This dataset can be used to improve the annotation of the Pan troglodytes genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13742-015-0061-x) contains supplementary material, which is available to authorized users. BioMed Central 2015-04-18 /pmc/articles/PMC4403674/ /pubmed/25897398 http://dx.doi.org/10.1186/s13742-015-0061-x Text en © Maudhoo et al.; licensee BioMed Central. 2015 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Data Note
Maudhoo, Mnirnal D
Madison, Jacob D
Norgren, Robert B
De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences
title De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences
title_full De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences
title_fullStr De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences
title_full_unstemmed De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences
title_short De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences
title_sort de novo assembly of the chimpanzee transcriptome from nextgen mrna sequences
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4403674/
https://www.ncbi.nlm.nih.gov/pubmed/25897398
http://dx.doi.org/10.1186/s13742-015-0061-x
work_keys_str_mv AT maudhoomnirnald denovoassemblyofthechimpanzeetranscriptomefromnextgenmrnasequences
AT madisonjacobd denovoassemblyofthechimpanzeetranscriptomefromnextgenmrnasequences
AT norgrenrobertb denovoassemblyofthechimpanzeetranscriptomefromnextgenmrnasequences