Cargando…

Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks

Analysis of eukaryotic genomes requires the detection and classification of transposable elements (TEs), a crucial but complex and time-consuming task. To improve the performance of tools that accomplish these tasks, Machine Learning approaches (ML) that leverage computer resources, such as GPUs (Gr...

Descripción completa

Detalles Bibliográficos
Autores principales: Orozco-Arias, Simon, Lopez-Murillo, Luis Humberto, Piña, Johan S., Valencia-Castrillon, Estiven, Tabares-Soto, Reinel, Castillo-Ossa, Luis, Isaza, Gustavo, Guyot, Romain
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10513252/
https://www.ncbi.nlm.nih.gov/pubmed/37733731
http://dx.doi.org/10.1371/journal.pone.0291925
_version_ 1785108527286583296
author Orozco-Arias, Simon
Lopez-Murillo, Luis Humberto
Piña, Johan S.
Valencia-Castrillon, Estiven
Tabares-Soto, Reinel
Castillo-Ossa, Luis
Isaza, Gustavo
Guyot, Romain
author_facet Orozco-Arias, Simon
Lopez-Murillo, Luis Humberto
Piña, Johan S.
Valencia-Castrillon, Estiven
Tabares-Soto, Reinel
Castillo-Ossa, Luis
Isaza, Gustavo
Guyot, Romain
author_sort Orozco-Arias, Simon
collection PubMed
description Analysis of eukaryotic genomes requires the detection and classification of transposable elements (TEs), a crucial but complex and time-consuming task. To improve the performance of tools that accomplish these tasks, Machine Learning approaches (ML) that leverage computer resources, such as GPUs (Graphical Processing Unit) and multiple CPU (Central Processing Unit) cores, have been adopted. However, until now, the use of ML techniques has mostly been limited to classification of TEs. Herein, a detection-classification strategy (named YORO) based on convolutional neural networks is adapted from computer vision (YOLO) to genomics. This approach enables the detection of genomic objects through the prediction of the position, length, and classification in large DNA sequences such as fully sequenced genomes. As a proof of concept, the internal protein-coding domains of LTR-retrotransposons are used to train the proposed neural network. Precision, recall, accuracy, F1-score, execution times and time ratios, as well as several graphical representations were used as metrics to measure performance. These promising results open the door for a new generation of Deep Learning tools for genomics. YORO architecture is available at https://github.com/simonorozcoarias/YORO.
format Online
Article
Text
id pubmed-10513252
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-105132522023-09-22 Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks Orozco-Arias, Simon Lopez-Murillo, Luis Humberto Piña, Johan S. Valencia-Castrillon, Estiven Tabares-Soto, Reinel Castillo-Ossa, Luis Isaza, Gustavo Guyot, Romain PLoS One Research Article Analysis of eukaryotic genomes requires the detection and classification of transposable elements (TEs), a crucial but complex and time-consuming task. To improve the performance of tools that accomplish these tasks, Machine Learning approaches (ML) that leverage computer resources, such as GPUs (Graphical Processing Unit) and multiple CPU (Central Processing Unit) cores, have been adopted. However, until now, the use of ML techniques has mostly been limited to classification of TEs. Herein, a detection-classification strategy (named YORO) based on convolutional neural networks is adapted from computer vision (YOLO) to genomics. This approach enables the detection of genomic objects through the prediction of the position, length, and classification in large DNA sequences such as fully sequenced genomes. As a proof of concept, the internal protein-coding domains of LTR-retrotransposons are used to train the proposed neural network. Precision, recall, accuracy, F1-score, execution times and time ratios, as well as several graphical representations were used as metrics to measure performance. These promising results open the door for a new generation of Deep Learning tools for genomics. YORO architecture is available at https://github.com/simonorozcoarias/YORO. Public Library of Science 2023-09-21 /pmc/articles/PMC10513252/ /pubmed/37733731 http://dx.doi.org/10.1371/journal.pone.0291925 Text en © 2023 Orozco-Arias et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Orozco-Arias, Simon
Lopez-Murillo, Luis Humberto
Piña, Johan S.
Valencia-Castrillon, Estiven
Tabares-Soto, Reinel
Castillo-Ossa, Luis
Isaza, Gustavo
Guyot, Romain
Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks
title Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks
title_full Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks
title_fullStr Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks
title_full_unstemmed Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks
title_short Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks
title_sort genomic object detection: an improved approach for transposable elements detection and classification using convolutional neural networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10513252/
https://www.ncbi.nlm.nih.gov/pubmed/37733731
http://dx.doi.org/10.1371/journal.pone.0291925
work_keys_str_mv AT orozcoariassimon genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks
AT lopezmurilloluishumberto genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks
AT pinajohans genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks
AT valenciacastrillonestiven genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks
AT tabaressotoreinel genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks
AT castilloossaluis genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks
AT isazagustavo genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks
AT guyotromain genomicobjectdetectionanimprovedapproachfortransposableelementsdetectionandclassificationusingconvolutionalneuralnetworks