Cargando…

A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information

BACKGROUND: Human mitochondrial DNA has an important role in the cellular energy production through oxidative phosphorylation. Therefore, this process may be the cause and have an effect on mitochondrial DNA mutability, functional alteration, and disease onset related to a wide range of different cl...

Descripción completa

Detalles Bibliográficos
Autores principales: Vitale, Ornella, Preste, Roberto, Palmisano, Donato, Attimonelli, Marcella
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7005629/
https://www.ncbi.nlm.nih.gov/pubmed/31821723
http://dx.doi.org/10.1002/mgg3.1085
_version_ 1783494977359183872
author Vitale, Ornella
Preste, Roberto
Palmisano, Donato
Attimonelli, Marcella
author_facet Vitale, Ornella
Preste, Roberto
Palmisano, Donato
Attimonelli, Marcella
author_sort Vitale, Ornella
collection PubMed
description BACKGROUND: Human mitochondrial DNA has an important role in the cellular energy production through oxidative phosphorylation. Therefore, this process may be the cause and have an effect on mitochondrial DNA mutability, functional alteration, and disease onset related to a wide range of different clinical expressions and phenotypes. Although a large part of the observed variations is fixed in a population and hence expected to be benign, the estimation of the degree of the pathogenicity of any possible human mitochondrial DNA variant is clinically pivotal. METHODS: In this scenario, the establishment of standard criteria based on functional studies is required. In this context, a “data and text mining” pipeline is proposed here, developed using the programming language R, capable of extracting information regarding mitochondrial DNA functional studies and related clinical assessments from the literature, thus improving the annotation of human mitochondrial variants reported in the HmtVar database. RESULTS: The data mining pipeline has produced a list of 1,073 Pubmed IDs (PMIDs) from which the text mining pipeline has retrieved information on 932 human mitochondrial variants regarding experimental validation and clinical features. CONCLUSIONS: The application of the pipeline will contribute to supporting the interpretation of pathogenicity of human mitochondrial variants by facilitating diagnosis to clinicians and researchers faced with this task.
format Online
Article
Text
id pubmed-7005629
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-70056292020-02-13 A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information Vitale, Ornella Preste, Roberto Palmisano, Donato Attimonelli, Marcella Mol Genet Genomic Med Original Articles BACKGROUND: Human mitochondrial DNA has an important role in the cellular energy production through oxidative phosphorylation. Therefore, this process may be the cause and have an effect on mitochondrial DNA mutability, functional alteration, and disease onset related to a wide range of different clinical expressions and phenotypes. Although a large part of the observed variations is fixed in a population and hence expected to be benign, the estimation of the degree of the pathogenicity of any possible human mitochondrial DNA variant is clinically pivotal. METHODS: In this scenario, the establishment of standard criteria based on functional studies is required. In this context, a “data and text mining” pipeline is proposed here, developed using the programming language R, capable of extracting information regarding mitochondrial DNA functional studies and related clinical assessments from the literature, thus improving the annotation of human mitochondrial variants reported in the HmtVar database. RESULTS: The data mining pipeline has produced a list of 1,073 Pubmed IDs (PMIDs) from which the text mining pipeline has retrieved information on 932 human mitochondrial variants regarding experimental validation and clinical features. CONCLUSIONS: The application of the pipeline will contribute to supporting the interpretation of pathogenicity of human mitochondrial variants by facilitating diagnosis to clinicians and researchers faced with this task. John Wiley and Sons Inc. 2019-12-10 /pmc/articles/PMC7005629/ /pubmed/31821723 http://dx.doi.org/10.1002/mgg3.1085 Text en © 2019 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Articles
Vitale, Ornella
Preste, Roberto
Palmisano, Donato
Attimonelli, Marcella
A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information
title A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information
title_full A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information
title_fullStr A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information
title_full_unstemmed A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information
title_short A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information
title_sort data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7005629/
https://www.ncbi.nlm.nih.gov/pubmed/31821723
http://dx.doi.org/10.1002/mgg3.1085
work_keys_str_mv AT vitaleornella adataandtextminingpipelinetoannotatehumanmitochondrialvariantswithfunctionalandclinicalinformation
AT presteroberto adataandtextminingpipelinetoannotatehumanmitochondrialvariantswithfunctionalandclinicalinformation
AT palmisanodonato adataandtextminingpipelinetoannotatehumanmitochondrialvariantswithfunctionalandclinicalinformation
AT attimonellimarcella adataandtextminingpipelinetoannotatehumanmitochondrialvariantswithfunctionalandclinicalinformation
AT vitaleornella dataandtextminingpipelinetoannotatehumanmitochondrialvariantswithfunctionalandclinicalinformation
AT presteroberto dataandtextminingpipelinetoannotatehumanmitochondrialvariantswithfunctionalandclinicalinformation
AT palmisanodonato dataandtextminingpipelinetoannotatehumanmitochondrialvariantswithfunctionalandclinicalinformation
AT attimonellimarcella dataandtextminingpipelinetoannotatehumanmitochondrialvariantswithfunctionalandclinicalinformation