Cargando…

High-coverage sequencing and annotated assemblies of the budgerigar genome

BACKGROUND: Parrots belong to a group of behaviorally advanced vertebrates and have an advanced ability of vocal learning relative to other vocal-learning birds. They can imitate human speech, synchronize their body movements to a rhythmic beat, and understand complex concepts of referential meaning...

Descripción completa

Detalles Bibliográficos
Autores principales: Ganapathy, Ganeshkumar, Howard, Jason T, Ward, James M, Li, Jianwen, Li, Bo, Li, Yingrui, Xiong, Yingqi, Zhang, Yong, Zhou, Shiguo, Schwartz, David C, Schatz, Michael, Aboukhalil, Robert, Fedrigo, Olivier, Bukovnik, Lisa, Wang, Ty, Wray, Greg, Rasolonjatovo, Isabelle, Winer, Roger, Knight, James R, Koren, Sergey, Warren, Wesley C, Zhang, Guojie, Phillippy, Adam M, Jarvis, Erich D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4109783/
https://www.ncbi.nlm.nih.gov/pubmed/25061512
http://dx.doi.org/10.1186/2047-217X-3-11
_version_ 1782327910853509120
author Ganapathy, Ganeshkumar
Howard, Jason T
Ward, James M
Li, Jianwen
Li, Bo
Li, Yingrui
Xiong, Yingqi
Zhang, Yong
Zhou, Shiguo
Schwartz, David C
Schatz, Michael
Aboukhalil, Robert
Fedrigo, Olivier
Bukovnik, Lisa
Wang, Ty
Wray, Greg
Rasolonjatovo, Isabelle
Winer, Roger
Knight, James R
Koren, Sergey
Warren, Wesley C
Zhang, Guojie
Phillippy, Adam M
Jarvis, Erich D
author_facet Ganapathy, Ganeshkumar
Howard, Jason T
Ward, James M
Li, Jianwen
Li, Bo
Li, Yingrui
Xiong, Yingqi
Zhang, Yong
Zhou, Shiguo
Schwartz, David C
Schatz, Michael
Aboukhalil, Robert
Fedrigo, Olivier
Bukovnik, Lisa
Wang, Ty
Wray, Greg
Rasolonjatovo, Isabelle
Winer, Roger
Knight, James R
Koren, Sergey
Warren, Wesley C
Zhang, Guojie
Phillippy, Adam M
Jarvis, Erich D
author_sort Ganapathy, Ganeshkumar
collection PubMed
description BACKGROUND: Parrots belong to a group of behaviorally advanced vertebrates and have an advanced ability of vocal learning relative to other vocal-learning birds. They can imitate human speech, synchronize their body movements to a rhythmic beat, and understand complex concepts of referential meaning to sounds. However, little is known about the genetics of these traits. Elucidating the genetic bases would require whole genome sequencing and a robust assembly of a parrot genome. FINDINGS: We present a genomic resource for the budgerigar, an Australian Parakeet (Melopsittacus undulatus) -- the most widely studied parrot species in neuroscience and behavior. We present genomic sequence data that includes over 300× raw read coverage from multiple sequencing technologies and chromosome optical maps from a single male animal. The reads and optical maps were used to create three hybrid assemblies representing some of the largest genomic scaffolds to date for a bird; two of which were annotated based on similarities to reference sets of non-redundant human, zebra finch and chicken proteins, and budgerigar transcriptome sequence assemblies. The sequence reads for this project were in part generated and used for both the Assemblathon 2 competition and the first de novo assembly of a giga-scale vertebrate genome utilizing PacBio single-molecule sequencing. CONCLUSIONS: Across several quality metrics, these budgerigar assemblies are comparable to or better than the chicken and zebra finch genome assemblies built from traditional Sanger sequencing reads, and are sufficient to analyze regions that are difficult to sequence and assemble, including those not yet assembled in prior bird genomes, and promoter regions of genes differentially regulated in vocal learning brain regions. This work provides valuable data and material for genome technology development and for investigating the genomics of complex behavioral traits.
format Online
Article
Text
id pubmed-4109783
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41097832014-07-25 High-coverage sequencing and annotated assemblies of the budgerigar genome Ganapathy, Ganeshkumar Howard, Jason T Ward, James M Li, Jianwen Li, Bo Li, Yingrui Xiong, Yingqi Zhang, Yong Zhou, Shiguo Schwartz, David C Schatz, Michael Aboukhalil, Robert Fedrigo, Olivier Bukovnik, Lisa Wang, Ty Wray, Greg Rasolonjatovo, Isabelle Winer, Roger Knight, James R Koren, Sergey Warren, Wesley C Zhang, Guojie Phillippy, Adam M Jarvis, Erich D Gigascience Data Note BACKGROUND: Parrots belong to a group of behaviorally advanced vertebrates and have an advanced ability of vocal learning relative to other vocal-learning birds. They can imitate human speech, synchronize their body movements to a rhythmic beat, and understand complex concepts of referential meaning to sounds. However, little is known about the genetics of these traits. Elucidating the genetic bases would require whole genome sequencing and a robust assembly of a parrot genome. FINDINGS: We present a genomic resource for the budgerigar, an Australian Parakeet (Melopsittacus undulatus) -- the most widely studied parrot species in neuroscience and behavior. We present genomic sequence data that includes over 300× raw read coverage from multiple sequencing technologies and chromosome optical maps from a single male animal. The reads and optical maps were used to create three hybrid assemblies representing some of the largest genomic scaffolds to date for a bird; two of which were annotated based on similarities to reference sets of non-redundant human, zebra finch and chicken proteins, and budgerigar transcriptome sequence assemblies. The sequence reads for this project were in part generated and used for both the Assemblathon 2 competition and the first de novo assembly of a giga-scale vertebrate genome utilizing PacBio single-molecule sequencing. CONCLUSIONS: Across several quality metrics, these budgerigar assemblies are comparable to or better than the chicken and zebra finch genome assemblies built from traditional Sanger sequencing reads, and are sufficient to analyze regions that are difficult to sequence and assemble, including those not yet assembled in prior bird genomes, and promoter regions of genes differentially regulated in vocal learning brain regions. This work provides valuable data and material for genome technology development and for investigating the genomics of complex behavioral traits. BioMed Central 2014-07-08 /pmc/articles/PMC4109783/ /pubmed/25061512 http://dx.doi.org/10.1186/2047-217X-3-11 Text en Copyright © 2014 Ganapathy et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Data Note
Ganapathy, Ganeshkumar
Howard, Jason T
Ward, James M
Li, Jianwen
Li, Bo
Li, Yingrui
Xiong, Yingqi
Zhang, Yong
Zhou, Shiguo
Schwartz, David C
Schatz, Michael
Aboukhalil, Robert
Fedrigo, Olivier
Bukovnik, Lisa
Wang, Ty
Wray, Greg
Rasolonjatovo, Isabelle
Winer, Roger
Knight, James R
Koren, Sergey
Warren, Wesley C
Zhang, Guojie
Phillippy, Adam M
Jarvis, Erich D
High-coverage sequencing and annotated assemblies of the budgerigar genome
title High-coverage sequencing and annotated assemblies of the budgerigar genome
title_full High-coverage sequencing and annotated assemblies of the budgerigar genome
title_fullStr High-coverage sequencing and annotated assemblies of the budgerigar genome
title_full_unstemmed High-coverage sequencing and annotated assemblies of the budgerigar genome
title_short High-coverage sequencing and annotated assemblies of the budgerigar genome
title_sort high-coverage sequencing and annotated assemblies of the budgerigar genome
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4109783/
https://www.ncbi.nlm.nih.gov/pubmed/25061512
http://dx.doi.org/10.1186/2047-217X-3-11
work_keys_str_mv AT ganapathyganeshkumar highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT howardjasont highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT wardjamesm highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT lijianwen highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT libo highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT liyingrui highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT xiongyingqi highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT zhangyong highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT zhoushiguo highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT schwartzdavidc highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT schatzmichael highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT aboukhalilrobert highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT fedrigoolivier highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT bukovniklisa highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT wangty highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT wraygreg highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT rasolonjatovoisabelle highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT winerroger highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT knightjamesr highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT korensergey highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT warrenwesleyc highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT zhangguojie highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT phillippyadamm highcoveragesequencingandannotatedassembliesofthebudgerigargenome
AT jarviserichd highcoveragesequencingandannotatedassembliesofthebudgerigargenome