Cargando…

A statistical framework for QTL hotspot detection

Quantitative trait loci (QTL) hotspots (genomic locations enriched in QTL) are a common and notable feature when collecting many QTL for various traits in many areas of biological studies. The QTL hotspots are important and attractive since they are highly informative and may harbor genes for the qu...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Po-Ya, Yang, Man-Hsia, Kao, Chen-Hung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8049418/
https://www.ncbi.nlm.nih.gov/pubmed/33638985
http://dx.doi.org/10.1093/g3journal/jkab056
_version_ 1783679422726930432
author Wu, Po-Ya
Yang, Man-Hsia
Kao, Chen-Hung
author_facet Wu, Po-Ya
Yang, Man-Hsia
Kao, Chen-Hung
author_sort Wu, Po-Ya
collection PubMed
description Quantitative trait loci (QTL) hotspots (genomic locations enriched in QTL) are a common and notable feature when collecting many QTL for various traits in many areas of biological studies. The QTL hotspots are important and attractive since they are highly informative and may harbor genes for the quantitative traits. So far, the current statistical methods for QTL hotspot detection use either the individual-level data from the genetical genomics experiments or the summarized data from public QTL databases to proceed with the detection analysis. These methods may suffer from the problems of ignoring the correlation structure among traits, neglecting the magnitude of LOD scores for the QTL, or paying a very high computational cost, which often lead to the detection of excessive spurious hotspots, failure to discover biologically interesting hotspots composed of a small-to-moderate number of QTL with strong LOD scores, and computational intractability, respectively, during the detection process. In this article, we describe a statistical framework that can handle both types of data as well as address all the problems at a time for QTL hotspot detection. Our statistical framework directly operates on the QTL matrix and hence has a very cheap computational cost and is deployed to take advantage of the QTL mapping results for assisting the detection analysis. Two special devices, trait grouping and top [Formula: see text] profile, are introduced into the framework. The trait grouping attempts to group the traits controlled by closely linked or pleiotropic QTL together into the same trait groups and randomly allocates these QTL together across the genomic positions separately by trait group to account for the correlation structure among traits, so as to have the ability to obtain much stricter thresholds and dismiss spurious hotspots. The top [Formula: see text] profile is designed to outline the LOD-score pattern of QTL in a hotspot across the different hotspot architectures, so that it can serve to identify and characterize the types of QTL hotspots with varying sizes and LOD-score distributions. Real examples, numerical analysis, and simulation study are performed to validate our statistical framework, investigate the detection properties, and also compare with the current methods in QTL hotspot detection. The results demonstrate that the proposed statistical framework can effectively accommodate the correlation structure among traits, identify the types of hotspots, and still keep the notable features of easy implementation and fast computation for practical QTL hotspot detection.
format Online
Article
Text
id pubmed-8049418
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80494182021-04-21 A statistical framework for QTL hotspot detection Wu, Po-Ya Yang, Man-Hsia Kao, Chen-Hung G3 (Bethesda) Investigation Quantitative trait loci (QTL) hotspots (genomic locations enriched in QTL) are a common and notable feature when collecting many QTL for various traits in many areas of biological studies. The QTL hotspots are important and attractive since they are highly informative and may harbor genes for the quantitative traits. So far, the current statistical methods for QTL hotspot detection use either the individual-level data from the genetical genomics experiments or the summarized data from public QTL databases to proceed with the detection analysis. These methods may suffer from the problems of ignoring the correlation structure among traits, neglecting the magnitude of LOD scores for the QTL, or paying a very high computational cost, which often lead to the detection of excessive spurious hotspots, failure to discover biologically interesting hotspots composed of a small-to-moderate number of QTL with strong LOD scores, and computational intractability, respectively, during the detection process. In this article, we describe a statistical framework that can handle both types of data as well as address all the problems at a time for QTL hotspot detection. Our statistical framework directly operates on the QTL matrix and hence has a very cheap computational cost and is deployed to take advantage of the QTL mapping results for assisting the detection analysis. Two special devices, trait grouping and top [Formula: see text] profile, are introduced into the framework. The trait grouping attempts to group the traits controlled by closely linked or pleiotropic QTL together into the same trait groups and randomly allocates these QTL together across the genomic positions separately by trait group to account for the correlation structure among traits, so as to have the ability to obtain much stricter thresholds and dismiss spurious hotspots. The top [Formula: see text] profile is designed to outline the LOD-score pattern of QTL in a hotspot across the different hotspot architectures, so that it can serve to identify and characterize the types of QTL hotspots with varying sizes and LOD-score distributions. Real examples, numerical analysis, and simulation study are performed to validate our statistical framework, investigate the detection properties, and also compare with the current methods in QTL hotspot detection. The results demonstrate that the proposed statistical framework can effectively accommodate the correlation structure among traits, identify the types of hotspots, and still keep the notable features of easy implementation and fast computation for practical QTL hotspot detection. Oxford University Press 2021-02-26 /pmc/articles/PMC8049418/ /pubmed/33638985 http://dx.doi.org/10.1093/g3journal/jkab056 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Wu, Po-Ya
Yang, Man-Hsia
Kao, Chen-Hung
A statistical framework for QTL hotspot detection
title A statistical framework for QTL hotspot detection
title_full A statistical framework for QTL hotspot detection
title_fullStr A statistical framework for QTL hotspot detection
title_full_unstemmed A statistical framework for QTL hotspot detection
title_short A statistical framework for QTL hotspot detection
title_sort statistical framework for qtl hotspot detection
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8049418/
https://www.ncbi.nlm.nih.gov/pubmed/33638985
http://dx.doi.org/10.1093/g3journal/jkab056
work_keys_str_mv AT wupoya astatisticalframeworkforqtlhotspotdetection
AT yangmanhsia astatisticalframeworkforqtlhotspotdetection
AT kaochenhung astatisticalframeworkforqtlhotspotdetection
AT wupoya statisticalframeworkforqtlhotspotdetection
AT yangmanhsia statisticalframeworkforqtlhotspotdetection
AT kaochenhung statisticalframeworkforqtlhotspotdetection