Cargando…

Evaluating, Filtering and Clustering Genetic Disease Cohorts Based on Human Phenotype Ontology Data with Cohort Analyzer

Exhaustive and comprehensive analysis of pathological traits is essential to understanding genetic diseases, performing precise diagnosis and prescribing personalized treatments. It is particularly important for disease cohorts, as thoroughly detailed phenotypic profiles allow patients to be compare...

Descripción completa

Detalles Bibliográficos
Autores principales: Rojano, Elena, Córdoba-Caballero, José, Jabato, Fernando M., Gallego, Diana, Serrano, Mercedes, Pérez, Belén, Parés-Aguilar, Álvaro, Perkins, James R., Ranea, Juan A. G., Seoane-Zonjic, Pedro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8398478/
https://www.ncbi.nlm.nih.gov/pubmed/34442375
http://dx.doi.org/10.3390/jpm11080730
_version_ 1783744849946607616
author Rojano, Elena
Córdoba-Caballero, José
Jabato, Fernando M.
Gallego, Diana
Serrano, Mercedes
Pérez, Belén
Parés-Aguilar, Álvaro
Perkins, James R.
Ranea, Juan A. G.
Seoane-Zonjic, Pedro
author_facet Rojano, Elena
Córdoba-Caballero, José
Jabato, Fernando M.
Gallego, Diana
Serrano, Mercedes
Pérez, Belén
Parés-Aguilar, Álvaro
Perkins, James R.
Ranea, Juan A. G.
Seoane-Zonjic, Pedro
author_sort Rojano, Elena
collection PubMed
description Exhaustive and comprehensive analysis of pathological traits is essential to understanding genetic diseases, performing precise diagnosis and prescribing personalized treatments. It is particularly important for disease cohorts, as thoroughly detailed phenotypic profiles allow patients to be compared and contrasted. However, many disease cohorts contain patients that have been ascribed low numbers of very general and relatively uninformative phenotypes. We present Cohort Analyzer, a tool that measures the phenotyping quality of patient cohorts. It calculates multiple statistics to give a general overview of the cohort status in terms of the depth and breadth of phenotyping, allowing us to detect less well-phenotyped patients for re-examining or excluding from further analyses. In addition, it performs clustering analysis to find subgroups of patients that share similar phenotypic profiles. We used it to analyse three cohorts of genetic diseases patients with very different properties. We found that cohorts with the most specific and complete phenotypic characterization give more potential insights into the disease than those that were less deeply characterised by forming more informative clusters. For two of the cohorts, we also analysed genomic data related to the patients, and linked the genomic data to the patient-subgroups by mapping shared variants to genes and functions. The work highlights the need for improved phenotyping in this era of personalized medicine. The tool itself is freely available alongside a workflow to allow the analyses shown in this work to be applied to other datasets.
format Online
Article
Text
id pubmed-8398478
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83984782021-08-29 Evaluating, Filtering and Clustering Genetic Disease Cohorts Based on Human Phenotype Ontology Data with Cohort Analyzer Rojano, Elena Córdoba-Caballero, José Jabato, Fernando M. Gallego, Diana Serrano, Mercedes Pérez, Belén Parés-Aguilar, Álvaro Perkins, James R. Ranea, Juan A. G. Seoane-Zonjic, Pedro J Pers Med Article Exhaustive and comprehensive analysis of pathological traits is essential to understanding genetic diseases, performing precise diagnosis and prescribing personalized treatments. It is particularly important for disease cohorts, as thoroughly detailed phenotypic profiles allow patients to be compared and contrasted. However, many disease cohorts contain patients that have been ascribed low numbers of very general and relatively uninformative phenotypes. We present Cohort Analyzer, a tool that measures the phenotyping quality of patient cohorts. It calculates multiple statistics to give a general overview of the cohort status in terms of the depth and breadth of phenotyping, allowing us to detect less well-phenotyped patients for re-examining or excluding from further analyses. In addition, it performs clustering analysis to find subgroups of patients that share similar phenotypic profiles. We used it to analyse three cohorts of genetic diseases patients with very different properties. We found that cohorts with the most specific and complete phenotypic characterization give more potential insights into the disease than those that were less deeply characterised by forming more informative clusters. For two of the cohorts, we also analysed genomic data related to the patients, and linked the genomic data to the patient-subgroups by mapping shared variants to genes and functions. The work highlights the need for improved phenotyping in this era of personalized medicine. The tool itself is freely available alongside a workflow to allow the analyses shown in this work to be applied to other datasets. MDPI 2021-07-27 /pmc/articles/PMC8398478/ /pubmed/34442375 http://dx.doi.org/10.3390/jpm11080730 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Rojano, Elena
Córdoba-Caballero, José
Jabato, Fernando M.
Gallego, Diana
Serrano, Mercedes
Pérez, Belén
Parés-Aguilar, Álvaro
Perkins, James R.
Ranea, Juan A. G.
Seoane-Zonjic, Pedro
Evaluating, Filtering and Clustering Genetic Disease Cohorts Based on Human Phenotype Ontology Data with Cohort Analyzer
title Evaluating, Filtering and Clustering Genetic Disease Cohorts Based on Human Phenotype Ontology Data with Cohort Analyzer
title_full Evaluating, Filtering and Clustering Genetic Disease Cohorts Based on Human Phenotype Ontology Data with Cohort Analyzer
title_fullStr Evaluating, Filtering and Clustering Genetic Disease Cohorts Based on Human Phenotype Ontology Data with Cohort Analyzer
title_full_unstemmed Evaluating, Filtering and Clustering Genetic Disease Cohorts Based on Human Phenotype Ontology Data with Cohort Analyzer
title_short Evaluating, Filtering and Clustering Genetic Disease Cohorts Based on Human Phenotype Ontology Data with Cohort Analyzer
title_sort evaluating, filtering and clustering genetic disease cohorts based on human phenotype ontology data with cohort analyzer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8398478/
https://www.ncbi.nlm.nih.gov/pubmed/34442375
http://dx.doi.org/10.3390/jpm11080730
work_keys_str_mv AT rojanoelena evaluatingfilteringandclusteringgeneticdiseasecohortsbasedonhumanphenotypeontologydatawithcohortanalyzer
AT cordobacaballerojose evaluatingfilteringandclusteringgeneticdiseasecohortsbasedonhumanphenotypeontologydatawithcohortanalyzer
AT jabatofernandom evaluatingfilteringandclusteringgeneticdiseasecohortsbasedonhumanphenotypeontologydatawithcohortanalyzer
AT gallegodiana evaluatingfilteringandclusteringgeneticdiseasecohortsbasedonhumanphenotypeontologydatawithcohortanalyzer
AT serranomercedes evaluatingfilteringandclusteringgeneticdiseasecohortsbasedonhumanphenotypeontologydatawithcohortanalyzer
AT perezbelen evaluatingfilteringandclusteringgeneticdiseasecohortsbasedonhumanphenotypeontologydatawithcohortanalyzer
AT paresaguilaralvaro evaluatingfilteringandclusteringgeneticdiseasecohortsbasedonhumanphenotypeontologydatawithcohortanalyzer
AT perkinsjamesr evaluatingfilteringandclusteringgeneticdiseasecohortsbasedonhumanphenotypeontologydatawithcohortanalyzer
AT raneajuanag evaluatingfilteringandclusteringgeneticdiseasecohortsbasedonhumanphenotypeontologydatawithcohortanalyzer
AT seoanezonjicpedro evaluatingfilteringandclusteringgeneticdiseasecohortsbasedonhumanphenotypeontologydatawithcohortanalyzer