Cargando…

Progression of the canonical reference malaria parasite genome from 2002–2019

Here we describe the ways in which the sequence and annotation of the Plasmodium falciparum reference genome has changed since its publication in 2002. As the malaria species responsible for the most deaths worldwide, the richness of annotation and accuracy of the sequence are important resources fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Böhme, Ulrike, Otto, Thomas D., Sanders, Mandy, Newbold, Chris I., Berriman, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6484455/
https://www.ncbi.nlm.nih.gov/pubmed/31080894
http://dx.doi.org/10.12688/wellcomeopenres.15194.2
_version_ 1783414118612467712
author Böhme, Ulrike
Otto, Thomas D.
Sanders, Mandy
Newbold, Chris I.
Berriman, Matthew
author_facet Böhme, Ulrike
Otto, Thomas D.
Sanders, Mandy
Newbold, Chris I.
Berriman, Matthew
author_sort Böhme, Ulrike
collection PubMed
description Here we describe the ways in which the sequence and annotation of the Plasmodium falciparum reference genome has changed since its publication in 2002. As the malaria species responsible for the most deaths worldwide, the richness of annotation and accuracy of the sequence are important resources for the P. falciparum research community as well as the basis for interpreting the genomes of subsequently sequenced species. At the time of publication in 2002 over 60% of predicted genes had unknown functions. As of March 2019, this number has been significantly decreased to 33%. The reduction is due to the inclusion of genes that were subsequently characterised experimentally and genes with significant similarity to others with known functions. In addition, the structural annotation of genes has been significantly refined; 27% of gene structures have been changed since 2002, comprising changes in exon-intron boundaries, addition or deletion of exons and the addition or deletion of genes. The sequence has also undergone significant improvements. In addition to the correction of a large number of single-base and insertion or deletion errors, a major miss-assembly between the subtelomeres of chromosome 7 and 8 has been corrected. As the number of sequenced isolates continues to grow rapidly, a single reference genome will not be an adequate basis for interpreting intra-species sequence diversity. We therefore describe in this publication a population reference genome of P. falciparum, called Pfref1. This reference will enable the community to map to regions that are not present in the current assembly. P. falciparum 3D7 will continue to be maintained, with ongoing curation ensuring continual improvements in annotation quality.
format Online
Article
Text
id pubmed-6484455
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-64844552019-05-09 Progression of the canonical reference malaria parasite genome from 2002–2019 Böhme, Ulrike Otto, Thomas D. Sanders, Mandy Newbold, Chris I. Berriman, Matthew Wellcome Open Res Data Note Here we describe the ways in which the sequence and annotation of the Plasmodium falciparum reference genome has changed since its publication in 2002. As the malaria species responsible for the most deaths worldwide, the richness of annotation and accuracy of the sequence are important resources for the P. falciparum research community as well as the basis for interpreting the genomes of subsequently sequenced species. At the time of publication in 2002 over 60% of predicted genes had unknown functions. As of March 2019, this number has been significantly decreased to 33%. The reduction is due to the inclusion of genes that were subsequently characterised experimentally and genes with significant similarity to others with known functions. In addition, the structural annotation of genes has been significantly refined; 27% of gene structures have been changed since 2002, comprising changes in exon-intron boundaries, addition or deletion of exons and the addition or deletion of genes. The sequence has also undergone significant improvements. In addition to the correction of a large number of single-base and insertion or deletion errors, a major miss-assembly between the subtelomeres of chromosome 7 and 8 has been corrected. As the number of sequenced isolates continues to grow rapidly, a single reference genome will not be an adequate basis for interpreting intra-species sequence diversity. We therefore describe in this publication a population reference genome of P. falciparum, called Pfref1. This reference will enable the community to map to regions that are not present in the current assembly. P. falciparum 3D7 will continue to be maintained, with ongoing curation ensuring continual improvements in annotation quality. F1000 Research Limited 2019-05-28 /pmc/articles/PMC6484455/ /pubmed/31080894 http://dx.doi.org/10.12688/wellcomeopenres.15194.2 Text en Copyright: © 2019 Böhme U et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Note
Böhme, Ulrike
Otto, Thomas D.
Sanders, Mandy
Newbold, Chris I.
Berriman, Matthew
Progression of the canonical reference malaria parasite genome from 2002–2019
title Progression of the canonical reference malaria parasite genome from 2002–2019
title_full Progression of the canonical reference malaria parasite genome from 2002–2019
title_fullStr Progression of the canonical reference malaria parasite genome from 2002–2019
title_full_unstemmed Progression of the canonical reference malaria parasite genome from 2002–2019
title_short Progression of the canonical reference malaria parasite genome from 2002–2019
title_sort progression of the canonical reference malaria parasite genome from 2002–2019
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6484455/
https://www.ncbi.nlm.nih.gov/pubmed/31080894
http://dx.doi.org/10.12688/wellcomeopenres.15194.2
work_keys_str_mv AT bohmeulrike progressionofthecanonicalreferencemalariaparasitegenomefrom20022019
AT ottothomasd progressionofthecanonicalreferencemalariaparasitegenomefrom20022019
AT sandersmandy progressionofthecanonicalreferencemalariaparasitegenomefrom20022019
AT newboldchrisi progressionofthecanonicalreferencemalariaparasitegenomefrom20022019
AT berrimanmatthew progressionofthecanonicalreferencemalariaparasitegenomefrom20022019