Cargando…

SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks

Data availability is a consistent bottleneck for the development of bacterial species-specific promoter prediction software. In this work we leverage genome-wide promoter datasets generated with dRNA-seq in the Gram-negative bacteria Pseudomonas aeruginosa and Salmonella enterica for promoter predic...

Descripción completa

Detalles Bibliográficos
Autores principales: Coppens, Lucas, Wicke, Laura, Lavigne, Rob
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9478156/
https://www.ncbi.nlm.nih.gov/pubmed/36147675
http://dx.doi.org/10.1016/j.csbj.2022.09.006
_version_ 1784790505273425920
author Coppens, Lucas
Wicke, Laura
Lavigne, Rob
author_facet Coppens, Lucas
Wicke, Laura
Lavigne, Rob
author_sort Coppens, Lucas
collection PubMed
description Data availability is a consistent bottleneck for the development of bacterial species-specific promoter prediction software. In this work we leverage genome-wide promoter datasets generated with dRNA-seq in the Gram-negative bacteria Pseudomonas aeruginosa and Salmonella enterica for promoter prediction. Convolutional neural networks are presented as an optimal architecture for model training and are further modified and tailored for promoter prediction. The resulting predictors reach high binary accuracies (95% and 94.9%) on test sets and outperform each other when predicting promoters in their associated species. SAPPHIRE.CNN is available online and can also be downloaded to run locally. Our results indicate a dependency of binary promoter classification on an organism’s GC content and a decreased performance of our classifiers on genera they were not trained for, further supporting the need for dedicated, species-specific promoter classification tools.
format Online
Article
Text
id pubmed-9478156
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-94781562022-09-21 SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks Coppens, Lucas Wicke, Laura Lavigne, Rob Comput Struct Biotechnol J Short Communication Data availability is a consistent bottleneck for the development of bacterial species-specific promoter prediction software. In this work we leverage genome-wide promoter datasets generated with dRNA-seq in the Gram-negative bacteria Pseudomonas aeruginosa and Salmonella enterica for promoter prediction. Convolutional neural networks are presented as an optimal architecture for model training and are further modified and tailored for promoter prediction. The resulting predictors reach high binary accuracies (95% and 94.9%) on test sets and outperform each other when predicting promoters in their associated species. SAPPHIRE.CNN is available online and can also be downloaded to run locally. Our results indicate a dependency of binary promoter classification on an organism’s GC content and a decreased performance of our classifiers on genera they were not trained for, further supporting the need for dedicated, species-specific promoter classification tools. Research Network of Computational and Structural Biotechnology 2022-09-09 /pmc/articles/PMC9478156/ /pubmed/36147675 http://dx.doi.org/10.1016/j.csbj.2022.09.006 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Short Communication
Coppens, Lucas
Wicke, Laura
Lavigne, Rob
SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks
title SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks
title_full SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks
title_fullStr SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks
title_full_unstemmed SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks
title_short SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks
title_sort sapphire.cnn: implementation of drna-seq-driven, species-specific promoter prediction using convolutional neural networks
topic Short Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9478156/
https://www.ncbi.nlm.nih.gov/pubmed/36147675
http://dx.doi.org/10.1016/j.csbj.2022.09.006
work_keys_str_mv AT coppenslucas sapphirecnnimplementationofdrnaseqdrivenspeciesspecificpromoterpredictionusingconvolutionalneuralnetworks
AT wickelaura sapphirecnnimplementationofdrnaseqdrivenspeciesspecificpromoterpredictionusingconvolutionalneuralnetworks
AT lavignerob sapphirecnnimplementationofdrnaseqdrivenspeciesspecificpromoterpredictionusingconvolutionalneuralnetworks