Cargando…
multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles
Clustering is carried out to identify patterns in transcriptomics profiles to determine clinically relevant subgroups of patients. Feature (gene) selection is a critical and an integral part of the process. Currently, there are many feature selection and clustering methods to identify the relevant g...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4907340/ https://www.ncbi.nlm.nih.gov/pubmed/27330269 http://dx.doi.org/10.4137/CIN.S38000 |
_version_ | 1782437530503741440 |
---|---|
author | Lawlor, Nathan Fabbri, Alec Guan, Peiyong George, Joshy Karuturi, R. Krishna Murthy |
author_facet | Lawlor, Nathan Fabbri, Alec Guan, Peiyong George, Joshy Karuturi, R. Krishna Murthy |
author_sort | Lawlor, Nathan |
collection | PubMed |
description | Clustering is carried out to identify patterns in transcriptomics profiles to determine clinically relevant subgroups of patients. Feature (gene) selection is a critical and an integral part of the process. Currently, there are many feature selection and clustering methods to identify the relevant genes and perform clustering of samples. However, choosing an appropriate methodology is difficult. In addition, extensive feature selection methods have not been supported by the available packages. Hence, we developed an integrative R-package called multiClust that allows researchers to experiment with the choice of combination of methods for gene selection and clustering with ease. Using multiClust, we identified the best performing clustering methodology in the context of clinical outcome. Our observations demonstrate that simple methods such as variance-based ranking perform well on the majority of data sets, provided that the appropriate number of genes is selected. However, different gene ranking and selection methods remain relevant as no methodology works for all studies. |
format | Online Article Text |
id | pubmed-4907340 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-49073402016-06-17 multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles Lawlor, Nathan Fabbri, Alec Guan, Peiyong George, Joshy Karuturi, R. Krishna Murthy Cancer Inform Software or Database Review Clustering is carried out to identify patterns in transcriptomics profiles to determine clinically relevant subgroups of patients. Feature (gene) selection is a critical and an integral part of the process. Currently, there are many feature selection and clustering methods to identify the relevant genes and perform clustering of samples. However, choosing an appropriate methodology is difficult. In addition, extensive feature selection methods have not been supported by the available packages. Hence, we developed an integrative R-package called multiClust that allows researchers to experiment with the choice of combination of methods for gene selection and clustering with ease. Using multiClust, we identified the best performing clustering methodology in the context of clinical outcome. Our observations demonstrate that simple methods such as variance-based ranking perform well on the majority of data sets, provided that the appropriate number of genes is selected. However, different gene ranking and selection methods remain relevant as no methodology works for all studies. Libertas Academica 2016-06-12 /pmc/articles/PMC4907340/ /pubmed/27330269 http://dx.doi.org/10.4137/CIN.S38000 Text en © 2016 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 license. |
spellingShingle | Software or Database Review Lawlor, Nathan Fabbri, Alec Guan, Peiyong George, Joshy Karuturi, R. Krishna Murthy multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles |
title | multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles |
title_full | multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles |
title_fullStr | multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles |
title_full_unstemmed | multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles |
title_short | multiClust: An R-package for Identifying Biologically Relevant Clusters in Cancer Transcriptome Profiles |
title_sort | multiclust: an r-package for identifying biologically relevant clusters in cancer transcriptome profiles |
topic | Software or Database Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4907340/ https://www.ncbi.nlm.nih.gov/pubmed/27330269 http://dx.doi.org/10.4137/CIN.S38000 |
work_keys_str_mv | AT lawlornathan multiclustanrpackageforidentifyingbiologicallyrelevantclustersincancertranscriptomeprofiles AT fabbrialec multiclustanrpackageforidentifyingbiologicallyrelevantclustersincancertranscriptomeprofiles AT guanpeiyong multiclustanrpackageforidentifyingbiologicallyrelevantclustersincancertranscriptomeprofiles AT georgejoshy multiclustanrpackageforidentifyingbiologicallyrelevantclustersincancertranscriptomeprofiles AT karuturirkrishnamurthy multiclustanrpackageforidentifyingbiologicallyrelevantclustersincancertranscriptomeprofiles |