Cargando…

Mining for genotype-phenotype relations in Saccharomyces using partial least squares

BACKGROUND: Multivariate approaches are important due to their versatility and applications in many fields as it provides decisive advantages over univariate analysis in many ways. Genome wide association studies are rapidly emerging, but approaches in hand pay less attention to multivariate relatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Mehmood, Tahir, Martens, Harald, Sæbø, Solve, Warringer, Jonas, Snipen, Lars
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3175482/
https://www.ncbi.nlm.nih.gov/pubmed/21812956
http://dx.doi.org/10.1186/1471-2105-12-318
_version_ 1782212161756463104
author Mehmood, Tahir
Martens, Harald
Sæbø, Solve
Warringer, Jonas
Snipen, Lars
author_facet Mehmood, Tahir
Martens, Harald
Sæbø, Solve
Warringer, Jonas
Snipen, Lars
author_sort Mehmood, Tahir
collection PubMed
description BACKGROUND: Multivariate approaches are important due to their versatility and applications in many fields as it provides decisive advantages over univariate analysis in many ways. Genome wide association studies are rapidly emerging, but approaches in hand pay less attention to multivariate relation between genotype and phenotype. We introduce a methodology based on a BLAST approach for extracting information from genomic sequences and Soft- Thresholding Partial Least Squares (ST-PLS) for mapping genotype-phenotype relations. RESULTS: Applying this methodology to an extensive data set for the model yeast Saccharomyces cerevisiae, we found that the relationship between genotype-phenotype involves surprisingly few genes in the sense that an overwhelmingly large fraction of the phenotypic variation can be explained by variation in less than 1% of the full gene reference set containing 5791 genes. These phenotype influencing genes were evolving 20% faster than non-influential genes and were unevenly distributed over cellular functions, with strong enrichments in functions such as cellular respiration and transposition. These genes were also enriched with known paralogs, stop codon variations and copy number variations, suggesting that such molecular adjustments have had a disproportionate influence on Saccharomyces yeasts recent adaptation to environmental changes in its ecological niche. CONCLUSIONS: BLAST and PLS based multivariate approach derived results that adhere to the known yeast phylogeny and gene ontology and thus verify that the methodology extracts a set of fast evolving genes that capture the phylogeny of the yeast strains. The approach is worth pursuing, and future investigations should be made to improve the computations of genotype signals as well as variable selection procedure within the PLS framework.
format Online
Article
Text
id pubmed-3175482
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31754822011-09-19 Mining for genotype-phenotype relations in Saccharomyces using partial least squares Mehmood, Tahir Martens, Harald Sæbø, Solve Warringer, Jonas Snipen, Lars BMC Bioinformatics Research Article BACKGROUND: Multivariate approaches are important due to their versatility and applications in many fields as it provides decisive advantages over univariate analysis in many ways. Genome wide association studies are rapidly emerging, but approaches in hand pay less attention to multivariate relation between genotype and phenotype. We introduce a methodology based on a BLAST approach for extracting information from genomic sequences and Soft- Thresholding Partial Least Squares (ST-PLS) for mapping genotype-phenotype relations. RESULTS: Applying this methodology to an extensive data set for the model yeast Saccharomyces cerevisiae, we found that the relationship between genotype-phenotype involves surprisingly few genes in the sense that an overwhelmingly large fraction of the phenotypic variation can be explained by variation in less than 1% of the full gene reference set containing 5791 genes. These phenotype influencing genes were evolving 20% faster than non-influential genes and were unevenly distributed over cellular functions, with strong enrichments in functions such as cellular respiration and transposition. These genes were also enriched with known paralogs, stop codon variations and copy number variations, suggesting that such molecular adjustments have had a disproportionate influence on Saccharomyces yeasts recent adaptation to environmental changes in its ecological niche. CONCLUSIONS: BLAST and PLS based multivariate approach derived results that adhere to the known yeast phylogeny and gene ontology and thus verify that the methodology extracts a set of fast evolving genes that capture the phylogeny of the yeast strains. The approach is worth pursuing, and future investigations should be made to improve the computations of genotype signals as well as variable selection procedure within the PLS framework. BioMed Central 2011-08-03 /pmc/articles/PMC3175482/ /pubmed/21812956 http://dx.doi.org/10.1186/1471-2105-12-318 Text en Copyright ©2011 Mehmood et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mehmood, Tahir
Martens, Harald
Sæbø, Solve
Warringer, Jonas
Snipen, Lars
Mining for genotype-phenotype relations in Saccharomyces using partial least squares
title Mining for genotype-phenotype relations in Saccharomyces using partial least squares
title_full Mining for genotype-phenotype relations in Saccharomyces using partial least squares
title_fullStr Mining for genotype-phenotype relations in Saccharomyces using partial least squares
title_full_unstemmed Mining for genotype-phenotype relations in Saccharomyces using partial least squares
title_short Mining for genotype-phenotype relations in Saccharomyces using partial least squares
title_sort mining for genotype-phenotype relations in saccharomyces using partial least squares
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3175482/
https://www.ncbi.nlm.nih.gov/pubmed/21812956
http://dx.doi.org/10.1186/1471-2105-12-318
work_keys_str_mv AT mehmoodtahir miningforgenotypephenotyperelationsinsaccharomycesusingpartialleastsquares
AT martensharald miningforgenotypephenotyperelationsinsaccharomycesusingpartialleastsquares
AT sæbøsolve miningforgenotypephenotyperelationsinsaccharomycesusingpartialleastsquares
AT warringerjonas miningforgenotypephenotyperelationsinsaccharomycesusingpartialleastsquares
AT snipenlars miningforgenotypephenotyperelationsinsaccharomycesusingpartialleastsquares