Cargando…

Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression

BACKGROUND: Multivariate approaches have been successfully applied to genome wide association studies. Recently, a Partial Least Squares (PLS) based approach was introduced for mapping yeast genotype-phenotype relations, where background information such as gene function classification, gene dispens...

Descripción completa

Detalles Bibliográficos
Autores principales: Mehmood, Tahir, Warringer, Jonas, Snipen, Lars, Sæbø, Solve
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3598729/
https://www.ncbi.nlm.nih.gov/pubmed/23216988
http://dx.doi.org/10.1186/1471-2105-13-327
_version_ 1782262810102726656
author Mehmood, Tahir
Warringer, Jonas
Snipen, Lars
Sæbø, Solve
author_facet Mehmood, Tahir
Warringer, Jonas
Snipen, Lars
Sæbø, Solve
author_sort Mehmood, Tahir
collection PubMed
description BACKGROUND: Multivariate approaches have been successfully applied to genome wide association studies. Recently, a Partial Least Squares (PLS) based approach was introduced for mapping yeast genotype-phenotype relations, where background information such as gene function classification, gene dispensability, recent or ancient gene copy number variations and the presence of premature stop codons or frameshift mutations in reading frames, were used post hoc to explain selected genes. One of the latest advancement in PLS named L-Partial Least Squares (L-PLS), where ‘L’ presents the used data structure, enables the use of background information at the modeling level. Here, a modification of L-PLS with variable importance on projection (VIP) was implemented using a stepwise regularized procedure for gene and background information selection. Results were compared to PLS-based procedures, where no background information was used. RESULTS: Applying the proposed methodology to yeast Saccharomyces cerevisiae data, we found the relationship between genotype-phenotype to have improved understandability. Phenotypic variations were explained by the variations of relatively stable genes and stable background variations. The suggested procedure provides an automatic way for genotype-phenotype mapping. The selected phenotype influencing genes were evolving 29% faster than non-influential genes, and the current results are supported by a recently conducted study. Further power analysis on simulated data verified that the proposed methodology selects relevant variables. CONCLUSIONS: A modification of L-PLS with VIP in a stepwise regularized elimination procedure can improve the understandability and stability of selected genes and background information. The approach is recommended for genome wide association studies where background information is available.
format Online
Article
Text
id pubmed-3598729
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35987292013-03-26 Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression Mehmood, Tahir Warringer, Jonas Snipen, Lars Sæbø, Solve BMC Bioinformatics Research Article BACKGROUND: Multivariate approaches have been successfully applied to genome wide association studies. Recently, a Partial Least Squares (PLS) based approach was introduced for mapping yeast genotype-phenotype relations, where background information such as gene function classification, gene dispensability, recent or ancient gene copy number variations and the presence of premature stop codons or frameshift mutations in reading frames, were used post hoc to explain selected genes. One of the latest advancement in PLS named L-Partial Least Squares (L-PLS), where ‘L’ presents the used data structure, enables the use of background information at the modeling level. Here, a modification of L-PLS with variable importance on projection (VIP) was implemented using a stepwise regularized procedure for gene and background information selection. Results were compared to PLS-based procedures, where no background information was used. RESULTS: Applying the proposed methodology to yeast Saccharomyces cerevisiae data, we found the relationship between genotype-phenotype to have improved understandability. Phenotypic variations were explained by the variations of relatively stable genes and stable background variations. The suggested procedure provides an automatic way for genotype-phenotype mapping. The selected phenotype influencing genes were evolving 29% faster than non-influential genes, and the current results are supported by a recently conducted study. Further power analysis on simulated data verified that the proposed methodology selects relevant variables. CONCLUSIONS: A modification of L-PLS with VIP in a stepwise regularized elimination procedure can improve the understandability and stability of selected genes and background information. The approach is recommended for genome wide association studies where background information is available. BioMed Central 2012-12-08 /pmc/articles/PMC3598729/ /pubmed/23216988 http://dx.doi.org/10.1186/1471-2105-13-327 Text en Copyright ©2012 Mehmood et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mehmood, Tahir
Warringer, Jonas
Snipen, Lars
Sæbø, Solve
Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression
title Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression
title_full Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression
title_fullStr Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression
title_full_unstemmed Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression
title_short Improving stability and understandability of genotype-phenotype mapping in Saccharomyces using regularized variable selection in L-PLS regression
title_sort improving stability and understandability of genotype-phenotype mapping in saccharomyces using regularized variable selection in l-pls regression
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3598729/
https://www.ncbi.nlm.nih.gov/pubmed/23216988
http://dx.doi.org/10.1186/1471-2105-13-327
work_keys_str_mv AT mehmoodtahir improvingstabilityandunderstandabilityofgenotypephenotypemappinginsaccharomycesusingregularizedvariableselectioninlplsregression
AT warringerjonas improvingstabilityandunderstandabilityofgenotypephenotypemappinginsaccharomycesusingregularizedvariableselectioninlplsregression
AT snipenlars improvingstabilityandunderstandabilityofgenotypephenotypemappinginsaccharomycesusingregularizedvariableselectioninlplsregression
AT sæbøsolve improvingstabilityandunderstandabilityofgenotypephenotypemappinginsaccharomycesusingregularizedvariableselectioninlplsregression