Cargando…

PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics

BACKGROUND: The peanut (Arachis hypogaea) is an important crop cultivated worldwide for oil production and food sources. Its complex genetic architecture (e.g., the large and tetraploid genome possibly due to unique cross of wild diploid relatives and subsequent chromosome duplication: 2n = 4x = 40,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Duan, Xiaohong, Schmidt, Emily, Li, Pei, Lenox, Douglas, Liu, Lin, Shu, Changlong, Zhang, Jie, Liang, Chun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2012
Materias:	Database
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3444431/ https://www.ncbi.nlm.nih.gov/pubmed/22712730 http://dx.doi.org/10.1186/1471-2229-12-94

_version_	1782243685162811392
author	Duan, Xiaohong Schmidt, Emily Li, Pei Lenox, Douglas Liu, Lin Shu, Changlong Zhang, Jie Liang, Chun
author_facet	Duan, Xiaohong Schmidt, Emily Li, Pei Lenox, Douglas Liu, Lin Shu, Changlong Zhang, Jie Liang, Chun
author_sort	Duan, Xiaohong
collection	PubMed
description	BACKGROUND: The peanut (Arachis hypogaea) is an important crop cultivated worldwide for oil production and food sources. Its complex genetic architecture (e.g., the large and tetraploid genome possibly due to unique cross of wild diploid relatives and subsequent chromosome duplication: 2n = 4x = 40, AABB, 2800 Mb) presents a major challenge for its genome sequencing and makes it a less-studied crop. Without a doubt, transcriptome sequencing is the most effective way to harness the genome structure and gene expression dynamics of this non-model species that has a limited genomic resource. DESCRIPTION: With the development of next generation sequencing technologies such as 454 pyro-sequencing and Illumina sequencing by synthesis, the transcriptomics data of peanut is rapidly accumulated in both the public databases and private sectors. Integrating 187,636 Sanger reads (103,685,419 bases), 1,165,168 Roche 454 reads (333,862,593 bases) and 57,135,995 Illumina reads (4,073,740,115 bases), we generated the first release of our peanut transcriptome assembly that contains 32,619 contigs. We provided EC, KEGG and GO functional annotations to these contigs and detected SSRs, SNPs and other genetic polymorphisms for each contig. Based on both open-source and our in-house tools, PeanutDB presents many seamlessly integrated web interfaces that allow users to search, filter, navigate and visualize easily the whole transcript assembly, its annotations and detected polymorphisms and simple sequence repeats. For each contig, sequence alignment is presented in both bird’s-eye view and nucleotide level resolution, with colorfully highlighted regions of mismatches, indels and repeats that facilitate close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors. CONCLUSION: As a public genomic database that integrates peanut transcriptome data from different sources, PeanutDB (http://bioinfolab.muohio.edu/txid3818v1) provides the Peanut research community with an easy-to-use web portal that will definitely facilitate genomics research and molecular breeding in this less-studied crop.
format	Online Article Text
id	pubmed-3444431
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-34444312012-09-18 PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics Duan, Xiaohong Schmidt, Emily Li, Pei Lenox, Douglas Liu, Lin Shu, Changlong Zhang, Jie Liang, Chun BMC Plant Biol Database BACKGROUND: The peanut (Arachis hypogaea) is an important crop cultivated worldwide for oil production and food sources. Its complex genetic architecture (e.g., the large and tetraploid genome possibly due to unique cross of wild diploid relatives and subsequent chromosome duplication: 2n = 4x = 40, AABB, 2800 Mb) presents a major challenge for its genome sequencing and makes it a less-studied crop. Without a doubt, transcriptome sequencing is the most effective way to harness the genome structure and gene expression dynamics of this non-model species that has a limited genomic resource. DESCRIPTION: With the development of next generation sequencing technologies such as 454 pyro-sequencing and Illumina sequencing by synthesis, the transcriptomics data of peanut is rapidly accumulated in both the public databases and private sectors. Integrating 187,636 Sanger reads (103,685,419 bases), 1,165,168 Roche 454 reads (333,862,593 bases) and 57,135,995 Illumina reads (4,073,740,115 bases), we generated the first release of our peanut transcriptome assembly that contains 32,619 contigs. We provided EC, KEGG and GO functional annotations to these contigs and detected SSRs, SNPs and other genetic polymorphisms for each contig. Based on both open-source and our in-house tools, PeanutDB presents many seamlessly integrated web interfaces that allow users to search, filter, navigate and visualize easily the whole transcript assembly, its annotations and detected polymorphisms and simple sequence repeats. For each contig, sequence alignment is presented in both bird’s-eye view and nucleotide level resolution, with colorfully highlighted regions of mismatches, indels and repeats that facilitate close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors. CONCLUSION: As a public genomic database that integrates peanut transcriptome data from different sources, PeanutDB (http://bioinfolab.muohio.edu/txid3818v1) provides the Peanut research community with an easy-to-use web portal that will definitely facilitate genomics research and molecular breeding in this less-studied crop. BioMed Central 2012-06-19 /pmc/articles/PMC3444431/ /pubmed/22712730 http://dx.doi.org/10.1186/1471-2229-12-94 Text en Copyright ©2012 Duan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Database Duan, Xiaohong Schmidt, Emily Li, Pei Lenox, Douglas Liu, Lin Shu, Changlong Zhang, Jie Liang, Chun PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics
title	PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics
title_full	PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics
title_fullStr	PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics
title_full_unstemmed	PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics
title_short	PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics
title_sort	peanutdb: an integrated bioinformatics web portal for arachis hypogaea transcriptomics
topic	Database
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3444431/ https://www.ncbi.nlm.nih.gov/pubmed/22712730 http://dx.doi.org/10.1186/1471-2229-12-94
work_keys_str_mv	AT duanxiaohong peanutdbanintegratedbioinformaticswebportalforarachishypogaeatranscriptomics AT schmidtemily peanutdbanintegratedbioinformaticswebportalforarachishypogaeatranscriptomics AT lipei peanutdbanintegratedbioinformaticswebportalforarachishypogaeatranscriptomics AT lenoxdouglas peanutdbanintegratedbioinformaticswebportalforarachishypogaeatranscriptomics AT liulin peanutdbanintegratedbioinformaticswebportalforarachishypogaeatranscriptomics AT shuchanglong peanutdbanintegratedbioinformaticswebportalforarachishypogaeatranscriptomics AT zhangjie peanutdbanintegratedbioinformaticswebportalforarachishypogaeatranscriptomics AT liangchun peanutdbanintegratedbioinformaticswebportalforarachishypogaeatranscriptomics

PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics

Ejemplares similares