Cargando…
Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks
Analysis of eukaryotic genomes requires the detection and classification of transposable elements (TEs), a crucial but complex and time-consuming task. To improve the performance of tools that accomplish these tasks, Machine Learning approaches (ML) that leverage computer resources, such as GPUs (Gr...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10513252/ https://www.ncbi.nlm.nih.gov/pubmed/37733731 http://dx.doi.org/10.1371/journal.pone.0291925 |
_version_ | 1785108527286583296 |
---|---|
author | Orozco-Arias, Simon Lopez-Murillo, Luis Humberto Piña, Johan S. Valencia-Castrillon, Estiven Tabares-Soto, Reinel Castillo-Ossa, Luis Isaza, Gustavo Guyot, Romain |
author_facet | Orozco-Arias, Simon Lopez-Murillo, Luis Humberto Piña, Johan S. Valencia-Castrillon, Estiven Tabares-Soto, Reinel Castillo-Ossa, Luis Isaza, Gustavo Guyot, Romain |
author_sort | Orozco-Arias, Simon |
collection | PubMed |
description | Analysis of eukaryotic genomes requires the detection and classification of transposable elements (TEs), a crucial but complex and time-consuming task. To improve the performance of tools that accomplish these tasks, Machine Learning approaches (ML) that leverage computer resources, such as GPUs (Graphical Processing Unit) and multiple CPU (Central Processing Unit) cores, have been adopted. However, until now, the use of ML techniques has mostly been limited to classification of TEs. Herein, a detection-classification strategy (named YORO) based on convolutional neural networks is adapted from computer vision (YOLO) to genomics. This approach enables the detection of genomic objects through the prediction of the position, length, and classification in large DNA sequences such as fully sequenced genomes. As a proof of concept, the internal protein-coding domains of LTR-retrotransposons are used to train the proposed neural network. Precision, recall, accuracy, F1-score, execution times and time ratios, as well as several graphical representations were used as metrics to measure performance. These promising results open the door for a new generation of Deep Learning tools for genomics. YORO architecture is available at https://github.com/simonorozcoarias/YORO. |
format | Online Article Text |
id | pubmed-10513252 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-105132522023-09-22 Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks Orozco-Arias, Simon Lopez-Murillo, Luis Humberto Piña, Johan S. Valencia-Castrillon, Estiven Tabares-Soto, Reinel Castillo-Ossa, Luis Isaza, Gustavo Guyot, Romain PLoS One Research Article Analysis of eukaryotic genomes requires the detection and classification of transposable elements (TEs), a crucial but complex and time-consuming task. To improve the performance of tools that accomplish these tasks, Machine Learning approaches (ML) that leverage computer resources, such as GPUs (Graphical Processing Unit) and multiple CPU (Central Processing Unit) cores, have been adopted. However, until now, the use of ML techniques has mostly been limited to classification of TEs. Herein, a detection-classification strategy (named YORO) based on convolutional neural networks is adapted from computer vision (YOLO) to genomics. This approach enables the detection of genomic objects through the prediction of the position, length, and classification in large DNA sequences such as fully sequenced genomes. As a proof of concept, the internal protein-coding domains of LTR-retrotransposons are used to train the proposed neural network. Precision, recall, accuracy, F1-score, execution times and time ratios, as well as several graphical representations were used as metrics to measure performance. These promising results open the door for a new generation of Deep Learning tools for genomics. YORO architecture is available at https://github.com/simonorozcoarias/YORO. Public Library of Science 2023-09-21 /pmc/articles/PMC10513252/ /pubmed/37733731 http://dx.doi.org/10.1371/journal.pone.0291925 Text en © 2023 Orozco-Arias et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Orozco-Arias, Simon Lopez-Murillo, Luis Humberto Piña, Johan S. Valencia-Castrillon, Estiven Tabares-Soto, Reinel Castillo-Ossa, Luis Isaza, Gustavo Guyot, Romain Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks |
title | Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks |
title_full | Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks |
title_fullStr | Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks |
title_full_unstemmed | Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks |
title_short | Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks |
title_sort | genomic object detection: an improved approach for transposable elements detection and classification using convolutional neural networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10513252/ https://www.ncbi.nlm.nih.gov/pubmed/37733731 http://dx.doi.org/10.1371/journal.pone.0291925 |
work_keys_str_mv | AT orozcoariassimon genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks AT lopezmurilloluishumberto genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks AT pinajohans genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks AT valenciacastrillonestiven genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks AT tabaressotoreinel genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks AT castilloossaluis genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks AT isazagustavo genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks AT guyotromain genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks |