Cargando…
Graphical Workflow System for Modification Calling by Machine Learning of Reverse Transcription Signatures
Modification mapping from cDNA data has become a tremendously important approach in epitranscriptomics. So-called reverse transcription signatures in cDNA contain information on the position and nature of their causative RNA modifications. Data mining of, e.g. Illumina-based high-throughput sequenci...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6774277/ https://www.ncbi.nlm.nih.gov/pubmed/31608115 http://dx.doi.org/10.3389/fgene.2019.00876 |
_version_ | 1783456050655002624 |
---|---|
author | Schmidt, Lukas Werner, Stephan Kemmer, Thomas Niebler, Stefan Kristen, Marco Ayadi, Lilia Johe, Patrick Marchand, Virginie Schirmeister, Tanja Motorin, Yuri Hildebrandt, Andreas Schmidt, Bertil Helm, Mark |
author_facet | Schmidt, Lukas Werner, Stephan Kemmer, Thomas Niebler, Stefan Kristen, Marco Ayadi, Lilia Johe, Patrick Marchand, Virginie Schirmeister, Tanja Motorin, Yuri Hildebrandt, Andreas Schmidt, Bertil Helm, Mark |
author_sort | Schmidt, Lukas |
collection | PubMed |
description | Modification mapping from cDNA data has become a tremendously important approach in epitranscriptomics. So-called reverse transcription signatures in cDNA contain information on the position and nature of their causative RNA modifications. Data mining of, e.g. Illumina-based high-throughput sequencing data, is therefore fast growing in importance, and the field is still lacking effective tools. Here we present a versatile user-friendly graphical workflow system for modification calling based on machine learning. The workflow commences with a principal module for trimming, mapping, and postprocessing. The latter includes a quantification of mismatch and arrest rates with single-nucleotide resolution across the mapped transcriptome. Further downstream modules include tools for visualization, machine learning, and modification calling. From the machine-learning module, quality assessment parameters are provided to gauge the suitability of the initial dataset for effective machine learning and modification calling. This output is useful to improve the experimental parameters for library preparation and sequencing. In summary, the automation of the bioinformatics workflow allows a faster turnaround of the optimization cycles in modification calling. |
format | Online Article Text |
id | pubmed-6774277 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-67742772019-10-13 Graphical Workflow System for Modification Calling by Machine Learning of Reverse Transcription Signatures Schmidt, Lukas Werner, Stephan Kemmer, Thomas Niebler, Stefan Kristen, Marco Ayadi, Lilia Johe, Patrick Marchand, Virginie Schirmeister, Tanja Motorin, Yuri Hildebrandt, Andreas Schmidt, Bertil Helm, Mark Front Genet Genetics Modification mapping from cDNA data has become a tremendously important approach in epitranscriptomics. So-called reverse transcription signatures in cDNA contain information on the position and nature of their causative RNA modifications. Data mining of, e.g. Illumina-based high-throughput sequencing data, is therefore fast growing in importance, and the field is still lacking effective tools. Here we present a versatile user-friendly graphical workflow system for modification calling based on machine learning. The workflow commences with a principal module for trimming, mapping, and postprocessing. The latter includes a quantification of mismatch and arrest rates with single-nucleotide resolution across the mapped transcriptome. Further downstream modules include tools for visualization, machine learning, and modification calling. From the machine-learning module, quality assessment parameters are provided to gauge the suitability of the initial dataset for effective machine learning and modification calling. This output is useful to improve the experimental parameters for library preparation and sequencing. In summary, the automation of the bioinformatics workflow allows a faster turnaround of the optimization cycles in modification calling. Frontiers Media S.A. 2019-09-25 /pmc/articles/PMC6774277/ /pubmed/31608115 http://dx.doi.org/10.3389/fgene.2019.00876 Text en Copyright © 2019 Schmidt, Werner, Kemmer, Niebler, Kristen, Ayadi, Johe, Marchand, Schirmeister, Motorin, Hildebrandt, Schmidt and Helm http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Schmidt, Lukas Werner, Stephan Kemmer, Thomas Niebler, Stefan Kristen, Marco Ayadi, Lilia Johe, Patrick Marchand, Virginie Schirmeister, Tanja Motorin, Yuri Hildebrandt, Andreas Schmidt, Bertil Helm, Mark Graphical Workflow System for Modification Calling by Machine Learning of Reverse Transcription Signatures |
title | Graphical Workflow System for Modification Calling by Machine Learning of Reverse Transcription Signatures |
title_full | Graphical Workflow System for Modification Calling by Machine Learning of Reverse Transcription Signatures |
title_fullStr | Graphical Workflow System for Modification Calling by Machine Learning of Reverse Transcription Signatures |
title_full_unstemmed | Graphical Workflow System for Modification Calling by Machine Learning of Reverse Transcription Signatures |
title_short | Graphical Workflow System for Modification Calling by Machine Learning of Reverse Transcription Signatures |
title_sort | graphical workflow system for modification calling by machine learning of reverse transcription signatures |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6774277/ https://www.ncbi.nlm.nih.gov/pubmed/31608115 http://dx.doi.org/10.3389/fgene.2019.00876 |
work_keys_str_mv | AT schmidtlukas graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures AT wernerstephan graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures AT kemmerthomas graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures AT nieblerstefan graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures AT kristenmarco graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures AT ayadililia graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures AT johepatrick graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures AT marchandvirginie graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures AT schirmeistertanja graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures AT motorinyuri graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures AT hildebrandtandreas graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures AT schmidtbertil graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures AT helmmark graphicalworkflowsystemformodificationcallingbymachinelearningofreversetranscriptionsignatures |