Cargando…

Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly

BACKGROUND: Aedes aegypti is the principal vector of yellow fever and dengue viruses throughout the tropical world. To provide a set of manually curated and annotated sequences from the Ae. aegypti genome, 14 mapped bacterial artificial chromosome (BAC) clones encompassing 1.57 Mb were sequenced, as...

Descripción completa

Detalles Bibliográficos
Autores principales: Lobo, Neil F, Campbell, Kathy S, Thaner, Daniel, deBruyn, Becky, Koo, Hean, Gelbart, William M, Loftus, Brendan J, Severson, David W, Collins, Frank H
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1929151/
https://www.ncbi.nlm.nih.gov/pubmed/17519023
http://dx.doi.org/10.1186/gb-2007-8-5-r88
_version_ 1782134271167692800
author Lobo, Neil F
Campbell, Kathy S
Thaner, Daniel
deBruyn, Becky
Koo, Hean
Gelbart, William M
Loftus, Brendan J
Severson, David W
Collins, Frank H
author_facet Lobo, Neil F
Campbell, Kathy S
Thaner, Daniel
deBruyn, Becky
Koo, Hean
Gelbart, William M
Loftus, Brendan J
Severson, David W
Collins, Frank H
author_sort Lobo, Neil F
collection PubMed
description BACKGROUND: Aedes aegypti is the principal vector of yellow fever and dengue viruses throughout the tropical world. To provide a set of manually curated and annotated sequences from the Ae. aegypti genome, 14 mapped bacterial artificial chromosome (BAC) clones encompassing 1.57 Mb were sequenced, assembled and manually annotated using a combination of computational gene-finding, expressed sequence tag (EST) matches and comparative protein homology. PCR and sequencing were used to experimentally confirm expression and sequence of a subset of these transcripts. RESULTS: Of the 51 manual annotations, 50 and 43 demonstrated a high level of similarity to Anopheles gambiae and Drosophila melanogaster genes, respectively. Ten of the 12 BAC sequences with more than one annotated gene exhibited synteny with the A. gambiae genome. Putative transcripts from eight BAC clones were found in multiple copies (two copies in most cases) in the Aedes genome assembly, which point to the probable presence of haplotype polymorphisms and/or misassemblies. CONCLUSION: This study not only provides a benchmark set of manually annotated transcripts for this genome that can be used to assess the quality of the auto-annotation pipeline and the assembly, but it also looks at the effect of a high repeat content on the genome assembly and annotation pipeline.
format Text
id pubmed-1929151
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19291512007-07-21 Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly Lobo, Neil F Campbell, Kathy S Thaner, Daniel deBruyn, Becky Koo, Hean Gelbart, William M Loftus, Brendan J Severson, David W Collins, Frank H Genome Biol Research BACKGROUND: Aedes aegypti is the principal vector of yellow fever and dengue viruses throughout the tropical world. To provide a set of manually curated and annotated sequences from the Ae. aegypti genome, 14 mapped bacterial artificial chromosome (BAC) clones encompassing 1.57 Mb were sequenced, assembled and manually annotated using a combination of computational gene-finding, expressed sequence tag (EST) matches and comparative protein homology. PCR and sequencing were used to experimentally confirm expression and sequence of a subset of these transcripts. RESULTS: Of the 51 manual annotations, 50 and 43 demonstrated a high level of similarity to Anopheles gambiae and Drosophila melanogaster genes, respectively. Ten of the 12 BAC sequences with more than one annotated gene exhibited synteny with the A. gambiae genome. Putative transcripts from eight BAC clones were found in multiple copies (two copies in most cases) in the Aedes genome assembly, which point to the probable presence of haplotype polymorphisms and/or misassemblies. CONCLUSION: This study not only provides a benchmark set of manually annotated transcripts for this genome that can be used to assess the quality of the auto-annotation pipeline and the assembly, but it also looks at the effect of a high repeat content on the genome assembly and annotation pipeline. BioMed Central 2007 2007-05-22 /pmc/articles/PMC1929151/ /pubmed/17519023 http://dx.doi.org/10.1186/gb-2007-8-5-r88 Text en Copyright © 2007 Lobo et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Lobo, Neil F
Campbell, Kathy S
Thaner, Daniel
deBruyn, Becky
Koo, Hean
Gelbart, William M
Loftus, Brendan J
Severson, David W
Collins, Frank H
Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly
title Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly
title_full Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly
title_fullStr Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly
title_full_unstemmed Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly
title_short Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly
title_sort analysis of 14 bac sequences from the aedes aegypti genome: a benchmark for genome annotation and assembly
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1929151/
https://www.ncbi.nlm.nih.gov/pubmed/17519023
http://dx.doi.org/10.1186/gb-2007-8-5-r88
work_keys_str_mv AT loboneilf analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly
AT campbellkathys analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly
AT thanerdaniel analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly
AT debruynbecky analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly
AT koohean analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly
AT gelbartwilliamm analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly
AT loftusbrendanj analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly
AT seversondavidw analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly
AT collinsfrankh analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly