Cargando…
Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly
BACKGROUND: Aedes aegypti is the principal vector of yellow fever and dengue viruses throughout the tropical world. To provide a set of manually curated and annotated sequences from the Ae. aegypti genome, 14 mapped bacterial artificial chromosome (BAC) clones encompassing 1.57 Mb were sequenced, as...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1929151/ https://www.ncbi.nlm.nih.gov/pubmed/17519023 http://dx.doi.org/10.1186/gb-2007-8-5-r88 |
_version_ | 1782134271167692800 |
---|---|
author | Lobo, Neil F Campbell, Kathy S Thaner, Daniel deBruyn, Becky Koo, Hean Gelbart, William M Loftus, Brendan J Severson, David W Collins, Frank H |
author_facet | Lobo, Neil F Campbell, Kathy S Thaner, Daniel deBruyn, Becky Koo, Hean Gelbart, William M Loftus, Brendan J Severson, David W Collins, Frank H |
author_sort | Lobo, Neil F |
collection | PubMed |
description | BACKGROUND: Aedes aegypti is the principal vector of yellow fever and dengue viruses throughout the tropical world. To provide a set of manually curated and annotated sequences from the Ae. aegypti genome, 14 mapped bacterial artificial chromosome (BAC) clones encompassing 1.57 Mb were sequenced, assembled and manually annotated using a combination of computational gene-finding, expressed sequence tag (EST) matches and comparative protein homology. PCR and sequencing were used to experimentally confirm expression and sequence of a subset of these transcripts. RESULTS: Of the 51 manual annotations, 50 and 43 demonstrated a high level of similarity to Anopheles gambiae and Drosophila melanogaster genes, respectively. Ten of the 12 BAC sequences with more than one annotated gene exhibited synteny with the A. gambiae genome. Putative transcripts from eight BAC clones were found in multiple copies (two copies in most cases) in the Aedes genome assembly, which point to the probable presence of haplotype polymorphisms and/or misassemblies. CONCLUSION: This study not only provides a benchmark set of manually annotated transcripts for this genome that can be used to assess the quality of the auto-annotation pipeline and the assembly, but it also looks at the effect of a high repeat content on the genome assembly and annotation pipeline. |
format | Text |
id | pubmed-1929151 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-19291512007-07-21 Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly Lobo, Neil F Campbell, Kathy S Thaner, Daniel deBruyn, Becky Koo, Hean Gelbart, William M Loftus, Brendan J Severson, David W Collins, Frank H Genome Biol Research BACKGROUND: Aedes aegypti is the principal vector of yellow fever and dengue viruses throughout the tropical world. To provide a set of manually curated and annotated sequences from the Ae. aegypti genome, 14 mapped bacterial artificial chromosome (BAC) clones encompassing 1.57 Mb were sequenced, assembled and manually annotated using a combination of computational gene-finding, expressed sequence tag (EST) matches and comparative protein homology. PCR and sequencing were used to experimentally confirm expression and sequence of a subset of these transcripts. RESULTS: Of the 51 manual annotations, 50 and 43 demonstrated a high level of similarity to Anopheles gambiae and Drosophila melanogaster genes, respectively. Ten of the 12 BAC sequences with more than one annotated gene exhibited synteny with the A. gambiae genome. Putative transcripts from eight BAC clones were found in multiple copies (two copies in most cases) in the Aedes genome assembly, which point to the probable presence of haplotype polymorphisms and/or misassemblies. CONCLUSION: This study not only provides a benchmark set of manually annotated transcripts for this genome that can be used to assess the quality of the auto-annotation pipeline and the assembly, but it also looks at the effect of a high repeat content on the genome assembly and annotation pipeline. BioMed Central 2007 2007-05-22 /pmc/articles/PMC1929151/ /pubmed/17519023 http://dx.doi.org/10.1186/gb-2007-8-5-r88 Text en Copyright © 2007 Lobo et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Lobo, Neil F Campbell, Kathy S Thaner, Daniel deBruyn, Becky Koo, Hean Gelbart, William M Loftus, Brendan J Severson, David W Collins, Frank H Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly |
title | Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly |
title_full | Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly |
title_fullStr | Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly |
title_full_unstemmed | Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly |
title_short | Analysis of 14 BAC sequences from the Aedes aegypti genome: a benchmark for genome annotation and assembly |
title_sort | analysis of 14 bac sequences from the aedes aegypti genome: a benchmark for genome annotation and assembly |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1929151/ https://www.ncbi.nlm.nih.gov/pubmed/17519023 http://dx.doi.org/10.1186/gb-2007-8-5-r88 |
work_keys_str_mv | AT loboneilf analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly AT campbellkathys analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly AT thanerdaniel analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly AT debruynbecky analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly AT koohean analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly AT gelbartwilliamm analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly AT loftusbrendanj analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly AT seversondavidw analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly AT collinsfrankh analysisof14bacsequencesfromtheaedesaegyptigenomeabenchmarkforgenomeannotationandassembly |