Cargando…

An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis

The emergence of antibiotic resistance in bacteria limits the availability of antibiotic choices for treatment and infection control, thereby representing a major threat to human health. The de novo mutation of bacterial genomes is an essential mechanism by which bacteria acquire antibiotic resistan...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yu, Zhang, Cong, Huo, Wenwen, Wang, Xinlei, Zhang, Michael, Palmer, Kelli, Chen, Min
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Nature Singapore 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9888353/
https://www.ncbi.nlm.nih.gov/pubmed/36744155
http://dx.doi.org/10.1007/s42995-022-00144-z
_version_ 1784880515867738112
author Zhang, Yu
Zhang, Cong
Huo, Wenwen
Wang, Xinlei
Zhang, Michael
Palmer, Kelli
Chen, Min
author_facet Zhang, Yu
Zhang, Cong
Huo, Wenwen
Wang, Xinlei
Zhang, Michael
Palmer, Kelli
Chen, Min
author_sort Zhang, Yu
collection PubMed
description The emergence of antibiotic resistance in bacteria limits the availability of antibiotic choices for treatment and infection control, thereby representing a major threat to human health. The de novo mutation of bacterial genomes is an essential mechanism by which bacteria acquire antibiotic resistance. Previously, deletion mutations within bacterial immune systems, ranging from dozens to thousands of base pairs (bps) in length, have been associated with the spread of antibiotic resistance. Most current methods for evaluating genomic structural variations (SVs) have concentrated on detecting them, rather than estimating the proportions of populations that carry distinct SVs. A better understanding of the distribution of mutations and subpopulations dynamics in bacterial populations is needed to appreciate antibiotic resistance evolution and movement of resistance genes through populations. Here, we propose a statistical model to estimate the proportions of genomic deletions in a mixed population based on Expectation–Maximization (EM) algorithms and next-generation sequencing (NGS) data. The method integrates both insert size and split-read mapping information to iteratively update estimated distributions. The proposed method was evaluated with three simulations that demonstrated the production of accurate estimations. The proposed method was then applied to investigate the horizontal transfers of antibiotic resistance genes in concert with changes in the CRISPR-Cas system of E. faecalis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s42995-022-00144-z.
format Online
Article
Text
id pubmed-9888353
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer Nature Singapore
record_format MEDLINE/PubMed
spelling pubmed-98883532023-02-01 An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis Zhang, Yu Zhang, Cong Huo, Wenwen Wang, Xinlei Zhang, Michael Palmer, Kelli Chen, Min Mar Life Sci Technol Research Paper The emergence of antibiotic resistance in bacteria limits the availability of antibiotic choices for treatment and infection control, thereby representing a major threat to human health. The de novo mutation of bacterial genomes is an essential mechanism by which bacteria acquire antibiotic resistance. Previously, deletion mutations within bacterial immune systems, ranging from dozens to thousands of base pairs (bps) in length, have been associated with the spread of antibiotic resistance. Most current methods for evaluating genomic structural variations (SVs) have concentrated on detecting them, rather than estimating the proportions of populations that carry distinct SVs. A better understanding of the distribution of mutations and subpopulations dynamics in bacterial populations is needed to appreciate antibiotic resistance evolution and movement of resistance genes through populations. Here, we propose a statistical model to estimate the proportions of genomic deletions in a mixed population based on Expectation–Maximization (EM) algorithms and next-generation sequencing (NGS) data. The method integrates both insert size and split-read mapping information to iteratively update estimated distributions. The proposed method was evaluated with three simulations that demonstrated the production of accurate estimations. The proposed method was then applied to investigate the horizontal transfers of antibiotic resistance genes in concert with changes in the CRISPR-Cas system of E. faecalis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s42995-022-00144-z. Springer Nature Singapore 2023-01-31 /pmc/articles/PMC9888353/ /pubmed/36744155 http://dx.doi.org/10.1007/s42995-022-00144-z Text en © Ocean University of China 2023, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
spellingShingle Research Paper
Zhang, Yu
Zhang, Cong
Huo, Wenwen
Wang, Xinlei
Zhang, Michael
Palmer, Kelli
Chen, Min
An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis
title An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis
title_full An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis
title_fullStr An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis
title_full_unstemmed An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis
title_short An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis
title_sort expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in enterococcus faecalis
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9888353/
https://www.ncbi.nlm.nih.gov/pubmed/36744155
http://dx.doi.org/10.1007/s42995-022-00144-z
work_keys_str_mv AT zhangyu anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT zhangcong anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT huowenwen anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT wangxinlei anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT zhangmichael anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT palmerkelli anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT chenmin anexpectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT zhangyu expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT zhangcong expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT huowenwen expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT wangxinlei expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT zhangmichael expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT palmerkelli expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis
AT chenmin expectationmaximizationalgorithmforestimatingproportionsofdeletionsamongbacterialpopulationswithapplicationtostudyantibioticresistancegenetransferinenterococcusfaecalis