Cargando…
SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks
Data availability is a consistent bottleneck for the development of bacterial species-specific promoter prediction software. In this work we leverage genome-wide promoter datasets generated with dRNA-seq in the Gram-negative bacteria Pseudomonas aeruginosa and Salmonella enterica for promoter predic...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9478156/ https://www.ncbi.nlm.nih.gov/pubmed/36147675 http://dx.doi.org/10.1016/j.csbj.2022.09.006 |
_version_ | 1784790505273425920 |
---|---|
author | Coppens, Lucas Wicke, Laura Lavigne, Rob |
author_facet | Coppens, Lucas Wicke, Laura Lavigne, Rob |
author_sort | Coppens, Lucas |
collection | PubMed |
description | Data availability is a consistent bottleneck for the development of bacterial species-specific promoter prediction software. In this work we leverage genome-wide promoter datasets generated with dRNA-seq in the Gram-negative bacteria Pseudomonas aeruginosa and Salmonella enterica for promoter prediction. Convolutional neural networks are presented as an optimal architecture for model training and are further modified and tailored for promoter prediction. The resulting predictors reach high binary accuracies (95% and 94.9%) on test sets and outperform each other when predicting promoters in their associated species. SAPPHIRE.CNN is available online and can also be downloaded to run locally. Our results indicate a dependency of binary promoter classification on an organism’s GC content and a decreased performance of our classifiers on genera they were not trained for, further supporting the need for dedicated, species-specific promoter classification tools. |
format | Online Article Text |
id | pubmed-9478156 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-94781562022-09-21 SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks Coppens, Lucas Wicke, Laura Lavigne, Rob Comput Struct Biotechnol J Short Communication Data availability is a consistent bottleneck for the development of bacterial species-specific promoter prediction software. In this work we leverage genome-wide promoter datasets generated with dRNA-seq in the Gram-negative bacteria Pseudomonas aeruginosa and Salmonella enterica for promoter prediction. Convolutional neural networks are presented as an optimal architecture for model training and are further modified and tailored for promoter prediction. The resulting predictors reach high binary accuracies (95% and 94.9%) on test sets and outperform each other when predicting promoters in their associated species. SAPPHIRE.CNN is available online and can also be downloaded to run locally. Our results indicate a dependency of binary promoter classification on an organism’s GC content and a decreased performance of our classifiers on genera they were not trained for, further supporting the need for dedicated, species-specific promoter classification tools. Research Network of Computational and Structural Biotechnology 2022-09-09 /pmc/articles/PMC9478156/ /pubmed/36147675 http://dx.doi.org/10.1016/j.csbj.2022.09.006 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Short Communication Coppens, Lucas Wicke, Laura Lavigne, Rob SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks |
title | SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks |
title_full | SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks |
title_fullStr | SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks |
title_full_unstemmed | SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks |
title_short | SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks |
title_sort | sapphire.cnn: implementation of drna-seq-driven, species-specific promoter prediction using convolutional neural networks |
topic | Short Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9478156/ https://www.ncbi.nlm.nih.gov/pubmed/36147675 http://dx.doi.org/10.1016/j.csbj.2022.09.006 |
work_keys_str_mv | AT coppenslucas sapphirecnnimplementationofdrnaseqdrivenspeciesspecificpromoterpredictionusingconvolutionalneuralnetworks AT wickelaura sapphirecnnimplementationofdrnaseqdrivenspeciesspecificpromoterpredictionusingconvolutionalneuralnetworks AT lavignerob sapphirecnnimplementationofdrnaseqdrivenspeciesspecificpromoterpredictionusingconvolutionalneuralnetworks |