Cargando…
Identifying genomic islands with deep neural networks
BACKGROUND: Horizontal gene transfer is the main source of adaptability for bacteria, through which genes are obtained from different sources including bacteria, archaea, viruses, and eukaryotes. This process promotes the rapid spread of genetic information across lineages, typically in the form of...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8170982/ https://www.ncbi.nlm.nih.gov/pubmed/34078279 http://dx.doi.org/10.1186/s12864-021-07575-5 |
_version_ | 1783702347294179328 |
---|---|
author | Assaf, Rida Xia, Fangfang Stevens, Rick |
author_facet | Assaf, Rida Xia, Fangfang Stevens, Rick |
author_sort | Assaf, Rida |
collection | PubMed |
description | BACKGROUND: Horizontal gene transfer is the main source of adaptability for bacteria, through which genes are obtained from different sources including bacteria, archaea, viruses, and eukaryotes. This process promotes the rapid spread of genetic information across lineages, typically in the form of clusters of genes referred to as genomic islands (GIs). Different types of GIs exist, and are often classified by the content of their cargo genes or their means of integration and mobility. While various computational methods have been devised to detect different types of GIs, no single method is capable of detecting all types. RESULTS: We propose a method, which we call Shutter Island, that uses a deep learning model (Inception V3, widely used in computer vision) to detect genomic islands. The intrinsic value of deep learning methods lies in their ability to generalize. Via a technique called transfer learning, the model is pre-trained on a large generic dataset and then re-trained on images that we generate to represent genomic fragments. We demonstrate that this image-based approach generalizes better than the existing tools. CONCLUSIONS: We used a deep neural network and an image-based approach to detect the most out of the correct GI predictions made by other tools, in addition to making novel GI predictions. The fact that the deep neural network was re-trained on only a limited number of GI datasets and then successfully generalized indicates that this approach could be applied to other problems in the field where data is still lacking or hard to curate. |
format | Online Article Text |
id | pubmed-8170982 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-81709822021-06-03 Identifying genomic islands with deep neural networks Assaf, Rida Xia, Fangfang Stevens, Rick BMC Genomics Methodology BACKGROUND: Horizontal gene transfer is the main source of adaptability for bacteria, through which genes are obtained from different sources including bacteria, archaea, viruses, and eukaryotes. This process promotes the rapid spread of genetic information across lineages, typically in the form of clusters of genes referred to as genomic islands (GIs). Different types of GIs exist, and are often classified by the content of their cargo genes or their means of integration and mobility. While various computational methods have been devised to detect different types of GIs, no single method is capable of detecting all types. RESULTS: We propose a method, which we call Shutter Island, that uses a deep learning model (Inception V3, widely used in computer vision) to detect genomic islands. The intrinsic value of deep learning methods lies in their ability to generalize. Via a technique called transfer learning, the model is pre-trained on a large generic dataset and then re-trained on images that we generate to represent genomic fragments. We demonstrate that this image-based approach generalizes better than the existing tools. CONCLUSIONS: We used a deep neural network and an image-based approach to detect the most out of the correct GI predictions made by other tools, in addition to making novel GI predictions. The fact that the deep neural network was re-trained on only a limited number of GI datasets and then successfully generalized indicates that this approach could be applied to other problems in the field where data is still lacking or hard to curate. BioMed Central 2021-06-02 /pmc/articles/PMC8170982/ /pubmed/34078279 http://dx.doi.org/10.1186/s12864-021-07575-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Assaf, Rida Xia, Fangfang Stevens, Rick Identifying genomic islands with deep neural networks |
title | Identifying genomic islands with deep neural networks |
title_full | Identifying genomic islands with deep neural networks |
title_fullStr | Identifying genomic islands with deep neural networks |
title_full_unstemmed | Identifying genomic islands with deep neural networks |
title_short | Identifying genomic islands with deep neural networks |
title_sort | identifying genomic islands with deep neural networks |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8170982/ https://www.ncbi.nlm.nih.gov/pubmed/34078279 http://dx.doi.org/10.1186/s12864-021-07575-5 |
work_keys_str_mv | AT assafrida identifyinggenomicislandswithdeepneuralnetworks AT xiafangfang identifyinggenomicislandswithdeepneuralnetworks AT stevensrick identifyinggenomicislandswithdeepneuralnetworks |