Cargando…

Proposal of supervised data analysis strategy of plasma miRNAs from hybridisation array data with an application to assess hemolysis-related deregulation

BACKGROUND: Plasma miRNAs have the potential as cancer biomarkers but no consolidated guidelines for data mining in this field are available. The purpose of the study was to apply a supervised data analysis strategy in a context where prior knowledge is available, i.e., that of hemolysis-related miR...

Descripción completa

Detalles Bibliográficos
Autores principales: Landoni, Elena, Miceli, Rosalba, Callari, Maurizio, Tiberio, Paola, Appierto, Valentina, Angeloni, Valentina, Mariani, Luigi, Daidone, Maria Grazia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4650369/
https://www.ncbi.nlm.nih.gov/pubmed/26581577
http://dx.doi.org/10.1186/s12859-015-0820-9
_version_ 1782401482702716928
author Landoni, Elena
Miceli, Rosalba
Callari, Maurizio
Tiberio, Paola
Appierto, Valentina
Angeloni, Valentina
Mariani, Luigi
Daidone, Maria Grazia
author_facet Landoni, Elena
Miceli, Rosalba
Callari, Maurizio
Tiberio, Paola
Appierto, Valentina
Angeloni, Valentina
Mariani, Luigi
Daidone, Maria Grazia
author_sort Landoni, Elena
collection PubMed
description BACKGROUND: Plasma miRNAs have the potential as cancer biomarkers but no consolidated guidelines for data mining in this field are available. The purpose of the study was to apply a supervised data analysis strategy in a context where prior knowledge is available, i.e., that of hemolysis-related miRNAs deregulation, so as to compare our results with existing evidence. RESULTS: We developed a structured strategy with innovative applications of existing bioinformatics methods for supervised analyses including: 1) the combination of two statistical (t- and Anderson-Darling) test results to detect miRNAs with significant fold change or general distributional differences in class comparison, which could reveal hidden differential biological processes worth to be considered for building predictive tools; 2) a bootstrap selection procedure together with machine learning techniques in class prediction to guarantee the transferability of results and explore the interconnections among the selected miRNAs, which is important for highlighting their inherent biological dependences. The strategy was applied to develop a classifier for discriminating between hemolyzed and not hemolyzed plasma samples, defined according to a recently published hemolysis score. We identified five miRNAs with increased expression in hemolyzed plasma samples (miR-486-5p, miR-92a, miR-451, miR-16, miR-22). CONCLUSIONS: We identified four miRNAs previously reported in the literature as hemolysis related together with a new one (miR-22).which needs further investigations. Our findings confirm the validity of the proposed strategy and, in parallel, the hemolysis score capability to be used as pre-analytic hemolysis detector. R codes for implementing the approaches are provided. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0820-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4650369
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46503692015-11-19 Proposal of supervised data analysis strategy of plasma miRNAs from hybridisation array data with an application to assess hemolysis-related deregulation Landoni, Elena Miceli, Rosalba Callari, Maurizio Tiberio, Paola Appierto, Valentina Angeloni, Valentina Mariani, Luigi Daidone, Maria Grazia BMC Bioinformatics Research Article BACKGROUND: Plasma miRNAs have the potential as cancer biomarkers but no consolidated guidelines for data mining in this field are available. The purpose of the study was to apply a supervised data analysis strategy in a context where prior knowledge is available, i.e., that of hemolysis-related miRNAs deregulation, so as to compare our results with existing evidence. RESULTS: We developed a structured strategy with innovative applications of existing bioinformatics methods for supervised analyses including: 1) the combination of two statistical (t- and Anderson-Darling) test results to detect miRNAs with significant fold change or general distributional differences in class comparison, which could reveal hidden differential biological processes worth to be considered for building predictive tools; 2) a bootstrap selection procedure together with machine learning techniques in class prediction to guarantee the transferability of results and explore the interconnections among the selected miRNAs, which is important for highlighting their inherent biological dependences. The strategy was applied to develop a classifier for discriminating between hemolyzed and not hemolyzed plasma samples, defined according to a recently published hemolysis score. We identified five miRNAs with increased expression in hemolyzed plasma samples (miR-486-5p, miR-92a, miR-451, miR-16, miR-22). CONCLUSIONS: We identified four miRNAs previously reported in the literature as hemolysis related together with a new one (miR-22).which needs further investigations. Our findings confirm the validity of the proposed strategy and, in parallel, the hemolysis score capability to be used as pre-analytic hemolysis detector. R codes for implementing the approaches are provided. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0820-9) contains supplementary material, which is available to authorized users. BioMed Central 2015-11-18 /pmc/articles/PMC4650369/ /pubmed/26581577 http://dx.doi.org/10.1186/s12859-015-0820-9 Text en © Landoni et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Landoni, Elena
Miceli, Rosalba
Callari, Maurizio
Tiberio, Paola
Appierto, Valentina
Angeloni, Valentina
Mariani, Luigi
Daidone, Maria Grazia
Proposal of supervised data analysis strategy of plasma miRNAs from hybridisation array data with an application to assess hemolysis-related deregulation
title Proposal of supervised data analysis strategy of plasma miRNAs from hybridisation array data with an application to assess hemolysis-related deregulation
title_full Proposal of supervised data analysis strategy of plasma miRNAs from hybridisation array data with an application to assess hemolysis-related deregulation
title_fullStr Proposal of supervised data analysis strategy of plasma miRNAs from hybridisation array data with an application to assess hemolysis-related deregulation
title_full_unstemmed Proposal of supervised data analysis strategy of plasma miRNAs from hybridisation array data with an application to assess hemolysis-related deregulation
title_short Proposal of supervised data analysis strategy of plasma miRNAs from hybridisation array data with an application to assess hemolysis-related deregulation
title_sort proposal of supervised data analysis strategy of plasma mirnas from hybridisation array data with an application to assess hemolysis-related deregulation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4650369/
https://www.ncbi.nlm.nih.gov/pubmed/26581577
http://dx.doi.org/10.1186/s12859-015-0820-9
work_keys_str_mv AT landonielena proposalofsuperviseddataanalysisstrategyofplasmamirnasfromhybridisationarraydatawithanapplicationtoassesshemolysisrelatedderegulation
AT micelirosalba proposalofsuperviseddataanalysisstrategyofplasmamirnasfromhybridisationarraydatawithanapplicationtoassesshemolysisrelatedderegulation
AT callarimaurizio proposalofsuperviseddataanalysisstrategyofplasmamirnasfromhybridisationarraydatawithanapplicationtoassesshemolysisrelatedderegulation
AT tiberiopaola proposalofsuperviseddataanalysisstrategyofplasmamirnasfromhybridisationarraydatawithanapplicationtoassesshemolysisrelatedderegulation
AT appiertovalentina proposalofsuperviseddataanalysisstrategyofplasmamirnasfromhybridisationarraydatawithanapplicationtoassesshemolysisrelatedderegulation
AT angelonivalentina proposalofsuperviseddataanalysisstrategyofplasmamirnasfromhybridisationarraydatawithanapplicationtoassesshemolysisrelatedderegulation
AT marianiluigi proposalofsuperviseddataanalysisstrategyofplasmamirnasfromhybridisationarraydatawithanapplicationtoassesshemolysisrelatedderegulation
AT daidonemariagrazia proposalofsuperviseddataanalysisstrategyofplasmamirnasfromhybridisationarraydatawithanapplicationtoassesshemolysisrelatedderegulation