Cargando…

A statistical framework for QTL hotspot detection

Quantitative trait loci (QTL) hotspots (genomic locations enriched in QTL) are a common and notable feature when collecting many QTL for various traits in many areas of biological studies. The QTL hotspots are important and attractive since they are highly informative and may harbor genes for the qu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wu, Po-Ya, Yang, Man-Hsia, Kao, Chen-Hung
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2021
Materias:	Investigation
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8049418/ https://www.ncbi.nlm.nih.gov/pubmed/33638985 http://dx.doi.org/10.1093/g3journal/jkab056

_version_	1783679422726930432
author	Wu, Po-Ya Yang, Man-Hsia Kao, Chen-Hung
author_facet	Wu, Po-Ya Yang, Man-Hsia Kao, Chen-Hung
author_sort	Wu, Po-Ya
collection	PubMed
description	Quantitative trait loci (QTL) hotspots (genomic locations enriched in QTL) are a common and notable feature when collecting many QTL for various traits in many areas of biological studies. The QTL hotspots are important and attractive since they are highly informative and may harbor genes for the quantitative traits. So far, the current statistical methods for QTL hotspot detection use either the individual-level data from the genetical genomics experiments or the summarized data from public QTL databases to proceed with the detection analysis. These methods may suffer from the problems of ignoring the correlation structure among traits, neglecting the magnitude of LOD scores for the QTL, or paying a very high computational cost, which often lead to the detection of excessive spurious hotspots, failure to discover biologically interesting hotspots composed of a small-to-moderate number of QTL with strong LOD scores, and computational intractability, respectively, during the detection process. In this article, we describe a statistical framework that can handle both types of data as well as address all the problems at a time for QTL hotspot detection. Our statistical framework directly operates on the QTL matrix and hence has a very cheap computational cost and is deployed to take advantage of the QTL mapping results for assisting the detection analysis. Two special devices, trait grouping and top [Formula: see text] profile, are introduced into the framework. The trait grouping attempts to group the traits controlled by closely linked or pleiotropic QTL together into the same trait groups and randomly allocates these QTL together across the genomic positions separately by trait group to account for the correlation structure among traits, so as to have the ability to obtain much stricter thresholds and dismiss spurious hotspots. The top [Formula: see text] profile is designed to outline the LOD-score pattern of QTL in a hotspot across the different hotspot architectures, so that it can serve to identify and characterize the types of QTL hotspots with varying sizes and LOD-score distributions. Real examples, numerical analysis, and simulation study are performed to validate our statistical framework, investigate the detection properties, and also compare with the current methods in QTL hotspot detection. The results demonstrate that the proposed statistical framework can effectively accommodate the correlation structure among traits, identify the types of hotspots, and still keep the notable features of easy implementation and fast computation for practical QTL hotspot detection.
format	Online Article Text
id	pubmed-8049418
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-80494182021-04-21 A statistical framework for QTL hotspot detection Wu, Po-Ya Yang, Man-Hsia Kao, Chen-Hung G3 (Bethesda) Investigation Quantitative trait loci (QTL) hotspots (genomic locations enriched in QTL) are a common and notable feature when collecting many QTL for various traits in many areas of biological studies. The QTL hotspots are important and attractive since they are highly informative and may harbor genes for the quantitative traits. So far, the current statistical methods for QTL hotspot detection use either the individual-level data from the genetical genomics experiments or the summarized data from public QTL databases to proceed with the detection analysis. These methods may suffer from the problems of ignoring the correlation structure among traits, neglecting the magnitude of LOD scores for the QTL, or paying a very high computational cost, which often lead to the detection of excessive spurious hotspots, failure to discover biologically interesting hotspots composed of a small-to-moderate number of QTL with strong LOD scores, and computational intractability, respectively, during the detection process. In this article, we describe a statistical framework that can handle both types of data as well as address all the problems at a time for QTL hotspot detection. Our statistical framework directly operates on the QTL matrix and hence has a very cheap computational cost and is deployed to take advantage of the QTL mapping results for assisting the detection analysis. Two special devices, trait grouping and top [Formula: see text] profile, are introduced into the framework. The trait grouping attempts to group the traits controlled by closely linked or pleiotropic QTL together into the same trait groups and randomly allocates these QTL together across the genomic positions separately by trait group to account for the correlation structure among traits, so as to have the ability to obtain much stricter thresholds and dismiss spurious hotspots. The top [Formula: see text] profile is designed to outline the LOD-score pattern of QTL in a hotspot across the different hotspot architectures, so that it can serve to identify and characterize the types of QTL hotspots with varying sizes and LOD-score distributions. Real examples, numerical analysis, and simulation study are performed to validate our statistical framework, investigate the detection properties, and also compare with the current methods in QTL hotspot detection. The results demonstrate that the proposed statistical framework can effectively accommodate the correlation structure among traits, identify the types of hotspots, and still keep the notable features of easy implementation and fast computation for practical QTL hotspot detection. Oxford University Press 2021-02-26 /pmc/articles/PMC8049418/ /pubmed/33638985 http://dx.doi.org/10.1093/g3journal/jkab056 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Investigation Wu, Po-Ya Yang, Man-Hsia Kao, Chen-Hung A statistical framework for QTL hotspot detection
title	A statistical framework for QTL hotspot detection
title_full	A statistical framework for QTL hotspot detection
title_fullStr	A statistical framework for QTL hotspot detection
title_full_unstemmed	A statistical framework for QTL hotspot detection
title_short	A statistical framework for QTL hotspot detection
title_sort	statistical framework for qtl hotspot detection
topic	Investigation
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8049418/ https://www.ncbi.nlm.nih.gov/pubmed/33638985 http://dx.doi.org/10.1093/g3journal/jkab056
work_keys_str_mv	AT wupoya astatisticalframeworkforqtlhotspotdetection AT yangmanhsia astatisticalframeworkforqtlhotspotdetection AT kaochenhung astatisticalframeworkforqtlhotspotdetection AT wupoya statisticalframeworkforqtlhotspotdetection AT yangmanhsia statisticalframeworkforqtlhotspotdetection AT kaochenhung statisticalframeworkforqtlhotspotdetection

A statistical framework for QTL hotspot detection

Ejemplares similares