Cargando…
An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis
The emergence of antibiotic resistance in bacteria limits the availability of antibiotic choices for treatment and infection control, thereby representing a major threat to human health. The de novo mutation of bacterial genomes is an essential mechanism by which bacteria acquire antibiotic resistan...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Nature Singapore
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9888353/ https://www.ncbi.nlm.nih.gov/pubmed/36744155 http://dx.doi.org/10.1007/s42995-022-00144-z |
_version_ | 1784880515867738112 |
---|---|
author | Zhang, Yu Zhang, Cong Huo, Wenwen Wang, Xinlei Zhang, Michael Palmer, Kelli Chen, Min |
author_facet | Zhang, Yu Zhang, Cong Huo, Wenwen Wang, Xinlei Zhang, Michael Palmer, Kelli Chen, Min |
author_sort | Zhang, Yu |
collection | PubMed |
description | The emergence of antibiotic resistance in bacteria limits the availability of antibiotic choices for treatment and infection control, thereby representing a major threat to human health. The de novo mutation of bacterial genomes is an essential mechanism by which bacteria acquire antibiotic resistance. Previously, deletion mutations within bacterial immune systems, ranging from dozens to thousands of base pairs (bps) in length, have been associated with the spread of antibiotic resistance. Most current methods for evaluating genomic structural variations (SVs) have concentrated on detecting them, rather than estimating the proportions of populations that carry distinct SVs. A better understanding of the distribution of mutations and subpopulations dynamics in bacterial populations is needed to appreciate antibiotic resistance evolution and movement of resistance genes through populations. Here, we propose a statistical model to estimate the proportions of genomic deletions in a mixed population based on Expectation–Maximization (EM) algorithms and next-generation sequencing (NGS) data. The method integrates both insert size and split-read mapping information to iteratively update estimated distributions. The proposed method was evaluated with three simulations that demonstrated the production of accurate estimations. The proposed method was then applied to investigate the horizontal transfers of antibiotic resistance genes in concert with changes in the CRISPR-Cas system of E. faecalis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s42995-022-00144-z. |
format | Online Article Text |
id | pubmed-9888353 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer Nature Singapore |
record_format | MEDLINE/PubMed |
spelling | pubmed-98883532023-02-01 An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis Zhang, Yu Zhang, Cong Huo, Wenwen Wang, Xinlei Zhang, Michael Palmer, Kelli Chen, Min Mar Life Sci Technol Research Paper The emergence of antibiotic resistance in bacteria limits the availability of antibiotic choices for treatment and infection control, thereby representing a major threat to human health. The de novo mutation of bacterial genomes is an essential mechanism by which bacteria acquire antibiotic resistance. Previously, deletion mutations within bacterial immune systems, ranging from dozens to thousands of base pairs (bps) in length, have been associated with the spread of antibiotic resistance. Most current methods for evaluating genomic structural variations (SVs) have concentrated on detecting them, rather than estimating the proportions of populations that carry distinct SVs. A better understanding of the distribution of mutations and subpopulations dynamics in bacterial populations is needed to appreciate antibiotic resistance evolution and movement of resistance genes through populations. Here, we propose a statistical model to estimate the proportions of genomic deletions in a mixed population based on Expectation–Maximization (EM) algorithms and next-generation sequencing (NGS) data. The method integrates both insert size and split-read mapping information to iteratively update estimated distributions. The proposed method was evaluated with three simulations that demonstrated the production of accurate estimations. The proposed method was then applied to investigate the horizontal transfers of antibiotic resistance genes in concert with changes in the CRISPR-Cas system of E. faecalis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s42995-022-00144-z. Springer Nature Singapore 2023-01-31 /pmc/articles/PMC9888353/ /pubmed/36744155 http://dx.doi.org/10.1007/s42995-022-00144-z Text en © Ocean University of China 2023, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. |
spellingShingle | Research Paper Zhang, Yu Zhang, Cong Huo, Wenwen Wang, Xinlei Zhang, Michael Palmer, Kelli Chen, Min An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis |
title | An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis |
title_full | An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis |
title_fullStr | An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis |
title_full_unstemmed | An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis |
title_short | An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis |
title_sort | expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in enterococcus faecalis |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9888353/ https://www.ncbi.nlm.nih.gov/pubmed/36744155 http://dx.doi.org/10.1007/s42995-022-00144-z |
work_keys_str_mv | AT zhangyu anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT zhangcong anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT huowenwen anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT wangxinlei anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT zhangmichael anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT palmerkelli anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT chenmin anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT zhangyu expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT zhangcong expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT huowenwen expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT wangxinlei expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT zhangmichael expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT palmerkelli expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis AT chenmin expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis |