Cargando…

Allele-specific assembly of a eukaryotic genome corrects apparent frameshifts and reveals a lack of nonsense-mediated mRNA decay

To date, most reference genomes represent a mosaic consensus sequence in which the homologous chromosomes are collapsed into one sequence. This approach produces sequence artefacts and impedes analyses of allele-specific mechanisms. Here, we report an allele-specific genome assembly of the diploid p...

Descripción completa

Detalles Bibliográficos
Autores principales: Cosentino, Raúl O, Brink, Benedikt G, Siegel, T Nicolai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8445201/
https://www.ncbi.nlm.nih.gov/pubmed/34541528
http://dx.doi.org/10.1093/nargab/lqab082
_version_ 1784568600568266752
author Cosentino, Raúl O
Brink, Benedikt G
Siegel, T Nicolai
author_facet Cosentino, Raúl O
Brink, Benedikt G
Siegel, T Nicolai
author_sort Cosentino, Raúl O
collection PubMed
description To date, most reference genomes represent a mosaic consensus sequence in which the homologous chromosomes are collapsed into one sequence. This approach produces sequence artefacts and impedes analyses of allele-specific mechanisms. Here, we report an allele-specific genome assembly of the diploid parasite Trypanosoma brucei and reveal allelic variants affecting gene expression. Using long-read sequencing and chromosome conformation capture data, we could assign 99.5% of all heterozygote variants to a specific homologous chromosome and build a 66 Mb long allele-specific genome assembly. The phasing of haplotypes allowed us to resolve hundreds of artefacts present in the previous mosaic consensus assembly. In addition, it revealed allelic recombination events, visible as regions of low allelic heterozygosity, enabling the lineage tracing of T. brucei isolates. Interestingly, analyses of transcriptome and translatome data of genes with allele-specific premature termination codons point to the absence of a nonsense-mediated decay mechanism in trypanosomes. Taken together, this study delivers a reference quality allele-specific genome assembly of T. brucei and demonstrates the importance of such assemblies for the study of gene expression control. We expect the new genome assembly will increase the awareness of allele-specific phenomena and provide a platform to investigate them.
format Online
Article
Text
id pubmed-8445201
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84452012021-09-17 Allele-specific assembly of a eukaryotic genome corrects apparent frameshifts and reveals a lack of nonsense-mediated mRNA decay Cosentino, Raúl O Brink, Benedikt G Siegel, T Nicolai NAR Genom Bioinform High Throughput Sequencing Methods To date, most reference genomes represent a mosaic consensus sequence in which the homologous chromosomes are collapsed into one sequence. This approach produces sequence artefacts and impedes analyses of allele-specific mechanisms. Here, we report an allele-specific genome assembly of the diploid parasite Trypanosoma brucei and reveal allelic variants affecting gene expression. Using long-read sequencing and chromosome conformation capture data, we could assign 99.5% of all heterozygote variants to a specific homologous chromosome and build a 66 Mb long allele-specific genome assembly. The phasing of haplotypes allowed us to resolve hundreds of artefacts present in the previous mosaic consensus assembly. In addition, it revealed allelic recombination events, visible as regions of low allelic heterozygosity, enabling the lineage tracing of T. brucei isolates. Interestingly, analyses of transcriptome and translatome data of genes with allele-specific premature termination codons point to the absence of a nonsense-mediated decay mechanism in trypanosomes. Taken together, this study delivers a reference quality allele-specific genome assembly of T. brucei and demonstrates the importance of such assemblies for the study of gene expression control. We expect the new genome assembly will increase the awareness of allele-specific phenomena and provide a platform to investigate them. Oxford University Press 2021-09-16 /pmc/articles/PMC8445201/ /pubmed/34541528 http://dx.doi.org/10.1093/nargab/lqab082 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle High Throughput Sequencing Methods
Cosentino, Raúl O
Brink, Benedikt G
Siegel, T Nicolai
Allele-specific assembly of a eukaryotic genome corrects apparent frameshifts and reveals a lack of nonsense-mediated mRNA decay
title Allele-specific assembly of a eukaryotic genome corrects apparent frameshifts and reveals a lack of nonsense-mediated mRNA decay
title_full Allele-specific assembly of a eukaryotic genome corrects apparent frameshifts and reveals a lack of nonsense-mediated mRNA decay
title_fullStr Allele-specific assembly of a eukaryotic genome corrects apparent frameshifts and reveals a lack of nonsense-mediated mRNA decay
title_full_unstemmed Allele-specific assembly of a eukaryotic genome corrects apparent frameshifts and reveals a lack of nonsense-mediated mRNA decay
title_short Allele-specific assembly of a eukaryotic genome corrects apparent frameshifts and reveals a lack of nonsense-mediated mRNA decay
title_sort allele-specific assembly of a eukaryotic genome corrects apparent frameshifts and reveals a lack of nonsense-mediated mrna decay
topic High Throughput Sequencing Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8445201/
https://www.ncbi.nlm.nih.gov/pubmed/34541528
http://dx.doi.org/10.1093/nargab/lqab082
work_keys_str_mv AT cosentinoraulo allelespecificassemblyofaeukaryoticgenomecorrectsapparentframeshiftsandrevealsalackofnonsensemediatedmrnadecay
AT brinkbenediktg allelespecificassemblyofaeukaryoticgenomecorrectsapparentframeshiftsandrevealsalackofnonsensemediatedmrnadecay
AT siegeltnicolai allelespecificassemblyofaeukaryoticgenomecorrectsapparentframeshiftsandrevealsalackofnonsensemediatedmrnadecay