Cargando…

Probabilistic models of biological enzymatic polymerization

In this study, hierarchies of probabilistic models are evaluated for their ability to characterize the untemplated addition of adenine and uracil to the 3’ ends of mitochondrial mRNAs of the human pathogen Trypanosoma brucei, and for their generative abilities to reproduce populations of these untem...

Descripción completa

Detalles Bibliográficos
Autores principales: Hampton, Marshall, Galey, Miranda, Smoniewski, Clara, Zimmer, Sara L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7787436/
https://www.ncbi.nlm.nih.gov/pubmed/33406128
http://dx.doi.org/10.1371/journal.pone.0244858
_version_ 1783632823392927744
author Hampton, Marshall
Galey, Miranda
Smoniewski, Clara
Zimmer, Sara L.
author_facet Hampton, Marshall
Galey, Miranda
Smoniewski, Clara
Zimmer, Sara L.
author_sort Hampton, Marshall
collection PubMed
description In this study, hierarchies of probabilistic models are evaluated for their ability to characterize the untemplated addition of adenine and uracil to the 3’ ends of mitochondrial mRNAs of the human pathogen Trypanosoma brucei, and for their generative abilities to reproduce populations of these untemplated adenine/uridine “tails”. We determined the most ideal Hidden Markov Models (HMMs) for this biological system. While our HMMs were not able to generatively reproduce the length distribution of the tails, they fared better in reproducing nucleotide composition aspects of the tail populations. The HMMs robustly identified distinct states of nucleotide addition that correlate to experimentally verified tail nucleotide composition differences. However they also identified a surprising subclass of tails among the ND1 gene transcript populations that is unexpected given the current idea of sequential enzymatic action of untemplated tail addition in this system. Therefore, these models can not only be utilized to reflect biological states that we already know about, they can also identify hypotheses to be experimentally tested. Finally, our HMMs supplied a way to correct a portion of the sequencing errors present in our data. Importantly, these models constitute rare simple pedagogical examples of applied bioinformatic HMMs, due to their binary emissions.
format Online
Article
Text
id pubmed-7787436
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-77874362021-01-14 Probabilistic models of biological enzymatic polymerization Hampton, Marshall Galey, Miranda Smoniewski, Clara Zimmer, Sara L. PLoS One Research Article In this study, hierarchies of probabilistic models are evaluated for their ability to characterize the untemplated addition of adenine and uracil to the 3’ ends of mitochondrial mRNAs of the human pathogen Trypanosoma brucei, and for their generative abilities to reproduce populations of these untemplated adenine/uridine “tails”. We determined the most ideal Hidden Markov Models (HMMs) for this biological system. While our HMMs were not able to generatively reproduce the length distribution of the tails, they fared better in reproducing nucleotide composition aspects of the tail populations. The HMMs robustly identified distinct states of nucleotide addition that correlate to experimentally verified tail nucleotide composition differences. However they also identified a surprising subclass of tails among the ND1 gene transcript populations that is unexpected given the current idea of sequential enzymatic action of untemplated tail addition in this system. Therefore, these models can not only be utilized to reflect biological states that we already know about, they can also identify hypotheses to be experimentally tested. Finally, our HMMs supplied a way to correct a portion of the sequencing errors present in our data. Importantly, these models constitute rare simple pedagogical examples of applied bioinformatic HMMs, due to their binary emissions. Public Library of Science 2021-01-06 /pmc/articles/PMC7787436/ /pubmed/33406128 http://dx.doi.org/10.1371/journal.pone.0244858 Text en © 2021 Hampton et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hampton, Marshall
Galey, Miranda
Smoniewski, Clara
Zimmer, Sara L.
Probabilistic models of biological enzymatic polymerization
title Probabilistic models of biological enzymatic polymerization
title_full Probabilistic models of biological enzymatic polymerization
title_fullStr Probabilistic models of biological enzymatic polymerization
title_full_unstemmed Probabilistic models of biological enzymatic polymerization
title_short Probabilistic models of biological enzymatic polymerization
title_sort probabilistic models of biological enzymatic polymerization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7787436/
https://www.ncbi.nlm.nih.gov/pubmed/33406128
http://dx.doi.org/10.1371/journal.pone.0244858
work_keys_str_mv AT hamptonmarshall probabilisticmodelsofbiologicalenzymaticpolymerization
AT galeymiranda probabilisticmodelsofbiologicalenzymaticpolymerization
AT smoniewskiclara probabilisticmodelsofbiologicalenzymaticpolymerization
AT zimmersaral probabilisticmodelsofbiologicalenzymaticpolymerization