Cargando…
De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences
BACKGROUND: Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4403674/ https://www.ncbi.nlm.nih.gov/pubmed/25897398 http://dx.doi.org/10.1186/s13742-015-0061-x |
_version_ | 1782367361578303488 |
---|---|
author | Maudhoo, Mnirnal D Madison, Jacob D Norgren, Robert B |
author_facet | Maudhoo, Mnirnal D Madison, Jacob D Norgren, Robert B |
author_sort | Maudhoo, Mnirnal D |
collection | PubMed |
description | BACKGROUND: Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities. We provide transcriptome data from four untransformed cell types derived from the reference Pan troglodytes, “Clint”, to better annotate the chimpanzee genome and provide empirical validation for proposed gene models of this important species. FINDINGS: RNA was extracted from primary cells cultured from four tissues: skin, adipose stroma, vascular smooth muscle and skeletal muscle. These four RNA samples were sequenced on the Illumina HiSeq 2000 platform. Sequences were deposited in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA). Transcripts were assembled, annotated and deposited in the NCBI Transcriptome Shotgun Assembly (TSA) database. CONCLUSIONS: We have provided a high quality annotation of 44,275 transcripts with full-length coding sequence (CDS). This set represented a total of 10,110 unique genes, thus providing empirical support for their existence. This dataset can be used to improve the annotation of the Pan troglodytes genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13742-015-0061-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4403674 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44036742015-04-21 De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences Maudhoo, Mnirnal D Madison, Jacob D Norgren, Robert B Gigascience Data Note BACKGROUND: Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities. We provide transcriptome data from four untransformed cell types derived from the reference Pan troglodytes, “Clint”, to better annotate the chimpanzee genome and provide empirical validation for proposed gene models of this important species. FINDINGS: RNA was extracted from primary cells cultured from four tissues: skin, adipose stroma, vascular smooth muscle and skeletal muscle. These four RNA samples were sequenced on the Illumina HiSeq 2000 platform. Sequences were deposited in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA). Transcripts were assembled, annotated and deposited in the NCBI Transcriptome Shotgun Assembly (TSA) database. CONCLUSIONS: We have provided a high quality annotation of 44,275 transcripts with full-length coding sequence (CDS). This set represented a total of 10,110 unique genes, thus providing empirical support for their existence. This dataset can be used to improve the annotation of the Pan troglodytes genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13742-015-0061-x) contains supplementary material, which is available to authorized users. BioMed Central 2015-04-18 /pmc/articles/PMC4403674/ /pubmed/25897398 http://dx.doi.org/10.1186/s13742-015-0061-x Text en © Maudhoo et al.; licensee BioMed Central. 2015 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Data Note Maudhoo, Mnirnal D Madison, Jacob D Norgren, Robert B De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences |
title | De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences |
title_full | De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences |
title_fullStr | De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences |
title_full_unstemmed | De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences |
title_short | De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences |
title_sort | de novo assembly of the chimpanzee transcriptome from nextgen mrna sequences |
topic | Data Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4403674/ https://www.ncbi.nlm.nih.gov/pubmed/25897398 http://dx.doi.org/10.1186/s13742-015-0061-x |
work_keys_str_mv | AT maudhoomnirnald denovoassemblyofthechimpanzeetranscriptomefromnextgenmrnasequences AT madisonjacobd denovoassemblyofthechimpanzeetranscriptomefromnextgenmrnasequences AT norgrenrobertb denovoassemblyofthechimpanzeetranscriptomefromnextgenmrnasequences |