Cargando…
Probabilistic models of biological enzymatic polymerization
In this study, hierarchies of probabilistic models are evaluated for their ability to characterize the untemplated addition of adenine and uracil to the 3’ ends of mitochondrial mRNAs of the human pathogen Trypanosoma brucei, and for their generative abilities to reproduce populations of these untem...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7787436/ https://www.ncbi.nlm.nih.gov/pubmed/33406128 http://dx.doi.org/10.1371/journal.pone.0244858 |
_version_ | 1783632823392927744 |
---|---|
author | Hampton, Marshall Galey, Miranda Smoniewski, Clara Zimmer, Sara L. |
author_facet | Hampton, Marshall Galey, Miranda Smoniewski, Clara Zimmer, Sara L. |
author_sort | Hampton, Marshall |
collection | PubMed |
description | In this study, hierarchies of probabilistic models are evaluated for their ability to characterize the untemplated addition of adenine and uracil to the 3’ ends of mitochondrial mRNAs of the human pathogen Trypanosoma brucei, and for their generative abilities to reproduce populations of these untemplated adenine/uridine “tails”. We determined the most ideal Hidden Markov Models (HMMs) for this biological system. While our HMMs were not able to generatively reproduce the length distribution of the tails, they fared better in reproducing nucleotide composition aspects of the tail populations. The HMMs robustly identified distinct states of nucleotide addition that correlate to experimentally verified tail nucleotide composition differences. However they also identified a surprising subclass of tails among the ND1 gene transcript populations that is unexpected given the current idea of sequential enzymatic action of untemplated tail addition in this system. Therefore, these models can not only be utilized to reflect biological states that we already know about, they can also identify hypotheses to be experimentally tested. Finally, our HMMs supplied a way to correct a portion of the sequencing errors present in our data. Importantly, these models constitute rare simple pedagogical examples of applied bioinformatic HMMs, due to their binary emissions. |
format | Online Article Text |
id | pubmed-7787436 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-77874362021-01-14 Probabilistic models of biological enzymatic polymerization Hampton, Marshall Galey, Miranda Smoniewski, Clara Zimmer, Sara L. PLoS One Research Article In this study, hierarchies of probabilistic models are evaluated for their ability to characterize the untemplated addition of adenine and uracil to the 3’ ends of mitochondrial mRNAs of the human pathogen Trypanosoma brucei, and for their generative abilities to reproduce populations of these untemplated adenine/uridine “tails”. We determined the most ideal Hidden Markov Models (HMMs) for this biological system. While our HMMs were not able to generatively reproduce the length distribution of the tails, they fared better in reproducing nucleotide composition aspects of the tail populations. The HMMs robustly identified distinct states of nucleotide addition that correlate to experimentally verified tail nucleotide composition differences. However they also identified a surprising subclass of tails among the ND1 gene transcript populations that is unexpected given the current idea of sequential enzymatic action of untemplated tail addition in this system. Therefore, these models can not only be utilized to reflect biological states that we already know about, they can also identify hypotheses to be experimentally tested. Finally, our HMMs supplied a way to correct a portion of the sequencing errors present in our data. Importantly, these models constitute rare simple pedagogical examples of applied bioinformatic HMMs, due to their binary emissions. Public Library of Science 2021-01-06 /pmc/articles/PMC7787436/ /pubmed/33406128 http://dx.doi.org/10.1371/journal.pone.0244858 Text en © 2021 Hampton et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Hampton, Marshall Galey, Miranda Smoniewski, Clara Zimmer, Sara L. Probabilistic models of biological enzymatic polymerization |
title | Probabilistic models of biological enzymatic polymerization |
title_full | Probabilistic models of biological enzymatic polymerization |
title_fullStr | Probabilistic models of biological enzymatic polymerization |
title_full_unstemmed | Probabilistic models of biological enzymatic polymerization |
title_short | Probabilistic models of biological enzymatic polymerization |
title_sort | probabilistic models of biological enzymatic polymerization |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7787436/ https://www.ncbi.nlm.nih.gov/pubmed/33406128 http://dx.doi.org/10.1371/journal.pone.0244858 |
work_keys_str_mv | AT hamptonmarshall probabilisticmodelsofbiologicalenzymaticpolymerization AT galeymiranda probabilisticmodelsofbiologicalenzymaticpolymerization AT smoniewskiclara probabilisticmodelsofbiologicalenzymaticpolymerization AT zimmersaral probabilisticmodelsofbiologicalenzymaticpolymerization |