Cargando…

A pipeline for RNA-seq based eQTL analysis with automated quality control procedures

BACKGROUND: Advances in the expression quantitative trait loci (eQTL) studies have provided valuable insights into the mechanism of diseases and traits-associated genetic variants. However, it remains challenging to evaluate and control the quality of multi-source heterogeneous eQTL raw data for res...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Tao, Liu, Yongzhuang, Ruan, Junpeng, Dong, Xianjun, Wang, Yadong, Peng, Jiajie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8386049/
https://www.ncbi.nlm.nih.gov/pubmed/34433407
http://dx.doi.org/10.1186/s12859-021-04307-0
_version_ 1783742190033305600
author Wang, Tao
Liu, Yongzhuang
Ruan, Junpeng
Dong, Xianjun
Wang, Yadong
Peng, Jiajie
author_facet Wang, Tao
Liu, Yongzhuang
Ruan, Junpeng
Dong, Xianjun
Wang, Yadong
Peng, Jiajie
author_sort Wang, Tao
collection PubMed
description BACKGROUND: Advances in the expression quantitative trait loci (eQTL) studies have provided valuable insights into the mechanism of diseases and traits-associated genetic variants. However, it remains challenging to evaluate and control the quality of multi-source heterogeneous eQTL raw data for researchers with limited computational background. There is an urgent need to develop a powerful and user-friendly tool to automatically process the raw datasets in various formats and perform the eQTL mapping afterward. RESULTS: In this work, we present a pipeline for eQTL analysis, termed eQTLQC, featured with automated data preprocessing for both genotype data and gene expression data. Our pipeline provides a set of quality control and normalization approaches, and utilizes automated techniques to reduce manual intervention. We demonstrate the utility and robustness of this pipeline by performing eQTL case studies using multiple independent real-world datasets with RNA-seq data and whole genome sequencing (WGS) based genotype data. CONCLUSIONS: eQTLQC provides a reliable computational workflow for eQTL analysis. It provides standard quality control and normalization as well as eQTL mapping procedures for eQTL raw data in multiple formats. The source code, demo data, and instructions are freely available at https://github.com/stormlovetao/eQTLQC.
format Online
Article
Text
id pubmed-8386049
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-83860492021-08-26 A pipeline for RNA-seq based eQTL analysis with automated quality control procedures Wang, Tao Liu, Yongzhuang Ruan, Junpeng Dong, Xianjun Wang, Yadong Peng, Jiajie BMC Bioinformatics Research BACKGROUND: Advances in the expression quantitative trait loci (eQTL) studies have provided valuable insights into the mechanism of diseases and traits-associated genetic variants. However, it remains challenging to evaluate and control the quality of multi-source heterogeneous eQTL raw data for researchers with limited computational background. There is an urgent need to develop a powerful and user-friendly tool to automatically process the raw datasets in various formats and perform the eQTL mapping afterward. RESULTS: In this work, we present a pipeline for eQTL analysis, termed eQTLQC, featured with automated data preprocessing for both genotype data and gene expression data. Our pipeline provides a set of quality control and normalization approaches, and utilizes automated techniques to reduce manual intervention. We demonstrate the utility and robustness of this pipeline by performing eQTL case studies using multiple independent real-world datasets with RNA-seq data and whole genome sequencing (WGS) based genotype data. CONCLUSIONS: eQTLQC provides a reliable computational workflow for eQTL analysis. It provides standard quality control and normalization as well as eQTL mapping procedures for eQTL raw data in multiple formats. The source code, demo data, and instructions are freely available at https://github.com/stormlovetao/eQTLQC. BioMed Central 2021-08-25 /pmc/articles/PMC8386049/ /pubmed/34433407 http://dx.doi.org/10.1186/s12859-021-04307-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Wang, Tao
Liu, Yongzhuang
Ruan, Junpeng
Dong, Xianjun
Wang, Yadong
Peng, Jiajie
A pipeline for RNA-seq based eQTL analysis with automated quality control procedures
title A pipeline for RNA-seq based eQTL analysis with automated quality control procedures
title_full A pipeline for RNA-seq based eQTL analysis with automated quality control procedures
title_fullStr A pipeline for RNA-seq based eQTL analysis with automated quality control procedures
title_full_unstemmed A pipeline for RNA-seq based eQTL analysis with automated quality control procedures
title_short A pipeline for RNA-seq based eQTL analysis with automated quality control procedures
title_sort pipeline for rna-seq based eqtl analysis with automated quality control procedures
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8386049/
https://www.ncbi.nlm.nih.gov/pubmed/34433407
http://dx.doi.org/10.1186/s12859-021-04307-0
work_keys_str_mv AT wangtao apipelineforrnaseqbasedeqtlanalysiswithautomatedqualitycontrolprocedures
AT liuyongzhuang apipelineforrnaseqbasedeqtlanalysiswithautomatedqualitycontrolprocedures
AT ruanjunpeng apipelineforrnaseqbasedeqtlanalysiswithautomatedqualitycontrolprocedures
AT dongxianjun apipelineforrnaseqbasedeqtlanalysiswithautomatedqualitycontrolprocedures
AT wangyadong apipelineforrnaseqbasedeqtlanalysiswithautomatedqualitycontrolprocedures
AT pengjiajie apipelineforrnaseqbasedeqtlanalysiswithautomatedqualitycontrolprocedures
AT wangtao pipelineforrnaseqbasedeqtlanalysiswithautomatedqualitycontrolprocedures
AT liuyongzhuang pipelineforrnaseqbasedeqtlanalysiswithautomatedqualitycontrolprocedures
AT ruanjunpeng pipelineforrnaseqbasedeqtlanalysiswithautomatedqualitycontrolprocedures
AT dongxianjun pipelineforrnaseqbasedeqtlanalysiswithautomatedqualitycontrolprocedures
AT wangyadong pipelineforrnaseqbasedeqtlanalysiswithautomatedqualitycontrolprocedures
AT pengjiajie pipelineforrnaseqbasedeqtlanalysiswithautomatedqualitycontrolprocedures