Cargando…

multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles

Clustering is carried out to identify patterns in transcriptomics profiles to determine clinically relevant subgroups of patients. Feature (gene) selection is a critical and an integral part of the process. Currently, there are many feature selection and clustering methods to identify the relevant g...

Descripción completa

Detalles Bibliográficos
Autores principales: Lawlor, Nathan, Fabbri, Alec, Guan, Peiyong, George, Joshy, Karuturi, R. Krishna Murthy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4907340/
https://www.ncbi.nlm.nih.gov/pubmed/27330269
http://dx.doi.org/10.4137/CIN.S38000
_version_ 1782437530503741440
author Lawlor, Nathan
Fabbri, Alec
Guan, Peiyong
George, Joshy
Karuturi, R. Krishna Murthy
author_facet Lawlor, Nathan
Fabbri, Alec
Guan, Peiyong
George, Joshy
Karuturi, R. Krishna Murthy
author_sort Lawlor, Nathan
collection PubMed
description Clustering is carried out to identify patterns in transcriptomics profiles to determine clinically relevant subgroups of patients. Feature (gene) selection is a critical and an integral part of the process. Currently, there are many feature selection and clustering methods to identify the relevant genes and perform clustering of samples. However, choosing an appropriate methodology is difficult. In addition, extensive feature selection methods have not been supported by the available packages. Hence, we developed an integrative R-package called multiClust that allows researchers to experiment with the choice of combination of methods for gene selection and clustering with ease. Using multiClust, we identified the best performing clustering methodology in the context of clinical outcome. Our observations demonstrate that simple methods such as variance-based ranking perform well on the majority of data sets, provided that the appropriate number of genes is selected. However, different gene ranking and selection methods remain relevant as no methodology works for all studies.
format Online
Article
Text
id pubmed-4907340
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-49073402016-06-17 multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles Lawlor, Nathan Fabbri, Alec Guan, Peiyong George, Joshy Karuturi, R. Krishna Murthy Cancer Inform Software or Database Review Clustering is carried out to identify patterns in transcriptomics profiles to determine clinically relevant subgroups of patients. Feature (gene) selection is a critical and an integral part of the process. Currently, there are many feature selection and clustering methods to identify the relevant genes and perform clustering of samples. However, choosing an appropriate methodology is difficult. In addition, extensive feature selection methods have not been supported by the available packages. Hence, we developed an integrative R-package called multiClust that allows researchers to experiment with the choice of combination of methods for gene selection and clustering with ease. Using multiClust, we identified the best performing clustering methodology in the context of clinical outcome. Our observations demonstrate that simple methods such as variance-based ranking perform well on the majority of data sets, provided that the appropriate number of genes is selected. However, different gene ranking and selection methods remain relevant as no methodology works for all studies. Libertas Academica 2016-06-12 /pmc/articles/PMC4907340/ /pubmed/27330269 http://dx.doi.org/10.4137/CIN.S38000 Text en © 2016 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 license.
spellingShingle Software or Database Review
Lawlor, Nathan
Fabbri, Alec
Guan, Peiyong
George, Joshy
Karuturi, R. Krishna Murthy
multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles
title multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles
title_full multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles
title_fullStr multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles
title_full_unstemmed multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles
title_short multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles
title_sort multiclust: an r-package for identifying biologically relevant clusters in cancer transcriptome profiles
topic Software or Database Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4907340/
https://www.ncbi.nlm.nih.gov/pubmed/27330269
http://dx.doi.org/10.4137/CIN.S38000
work_keys_str_mv AT lawlornathan multiclustanrpackageforidentifyingbiologicallyrelevantclustersincancertranscriptomeprofiles
AT fabbrialec multiclustanrpackageforidentifyingbiologicallyrelevantclustersincancertranscriptomeprofiles
AT guanpeiyong multiclustanrpackageforidentifyingbiologicallyrelevantclustersincancertranscriptomeprofiles
AT georgejoshy multiclustanrpackageforidentifyingbiologicallyrelevantclustersincancertranscriptomeprofiles
AT karuturirkrishnamurthy multiclustanrpackageforidentifyingbiologicallyrelevantclustersincancertranscriptomeprofiles