Cargando…
Mining for genotype-phenotype relations in Saccharomyces using partial least squares
BACKGROUND: Multivariate approaches are important due to their versatility and applications in many fields as it provides decisive advantages over univariate analysis in many ways. Genome wide association studies are rapidly emerging, but approaches in hand pay less attention to multivariate relatio...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3175482/ https://www.ncbi.nlm.nih.gov/pubmed/21812956 http://dx.doi.org/10.1186/1471-2105-12-318 |
_version_ | 1782212161756463104 |
---|---|
author | Mehmood, Tahir Martens, Harald Sæbø, Solve Warringer, Jonas Snipen, Lars |
author_facet | Mehmood, Tahir Martens, Harald Sæbø, Solve Warringer, Jonas Snipen, Lars |
author_sort | Mehmood, Tahir |
collection | PubMed |
description | BACKGROUND: Multivariate approaches are important due to their versatility and applications in many fields as it provides decisive advantages over univariate analysis in many ways. Genome wide association studies are rapidly emerging, but approaches in hand pay less attention to multivariate relation between genotype and phenotype. We introduce a methodology based on a BLAST approach for extracting information from genomic sequences and Soft- Thresholding Partial Least Squares (ST-PLS) for mapping genotype-phenotype relations. RESULTS: Applying this methodology to an extensive data set for the model yeast Saccharomyces cerevisiae, we found that the relationship between genotype-phenotype involves surprisingly few genes in the sense that an overwhelmingly large fraction of the phenotypic variation can be explained by variation in less than 1% of the full gene reference set containing 5791 genes. These phenotype influencing genes were evolving 20% faster than non-influential genes and were unevenly distributed over cellular functions, with strong enrichments in functions such as cellular respiration and transposition. These genes were also enriched with known paralogs, stop codon variations and copy number variations, suggesting that such molecular adjustments have had a disproportionate influence on Saccharomyces yeasts recent adaptation to environmental changes in its ecological niche. CONCLUSIONS: BLAST and PLS based multivariate approach derived results that adhere to the known yeast phylogeny and gene ontology and thus verify that the methodology extracts a set of fast evolving genes that capture the phylogeny of the yeast strains. The approach is worth pursuing, and future investigations should be made to improve the computations of genotype signals as well as variable selection procedure within the PLS framework. |
format | Online Article Text |
id | pubmed-3175482 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-31754822011-09-19 Mining for genotype-phenotype relations in Saccharomyces using partial least squares Mehmood, Tahir Martens, Harald Sæbø, Solve Warringer, Jonas Snipen, Lars BMC Bioinformatics Research Article BACKGROUND: Multivariate approaches are important due to their versatility and applications in many fields as it provides decisive advantages over univariate analysis in many ways. Genome wide association studies are rapidly emerging, but approaches in hand pay less attention to multivariate relation between genotype and phenotype. We introduce a methodology based on a BLAST approach for extracting information from genomic sequences and Soft- Thresholding Partial Least Squares (ST-PLS) for mapping genotype-phenotype relations. RESULTS: Applying this methodology to an extensive data set for the model yeast Saccharomyces cerevisiae, we found that the relationship between genotype-phenotype involves surprisingly few genes in the sense that an overwhelmingly large fraction of the phenotypic variation can be explained by variation in less than 1% of the full gene reference set containing 5791 genes. These phenotype influencing genes were evolving 20% faster than non-influential genes and were unevenly distributed over cellular functions, with strong enrichments in functions such as cellular respiration and transposition. These genes were also enriched with known paralogs, stop codon variations and copy number variations, suggesting that such molecular adjustments have had a disproportionate influence on Saccharomyces yeasts recent adaptation to environmental changes in its ecological niche. CONCLUSIONS: BLAST and PLS based multivariate approach derived results that adhere to the known yeast phylogeny and gene ontology and thus verify that the methodology extracts a set of fast evolving genes that capture the phylogeny of the yeast strains. The approach is worth pursuing, and future investigations should be made to improve the computations of genotype signals as well as variable selection procedure within the PLS framework. BioMed Central 2011-08-03 /pmc/articles/PMC3175482/ /pubmed/21812956 http://dx.doi.org/10.1186/1471-2105-12-318 Text en Copyright ©2011 Mehmood et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Mehmood, Tahir Martens, Harald Sæbø, Solve Warringer, Jonas Snipen, Lars Mining for genotype-phenotype relations in Saccharomyces using partial least squares |
title | Mining for genotype-phenotype relations in Saccharomyces using partial least squares |
title_full | Mining for genotype-phenotype relations in Saccharomyces using partial least squares |
title_fullStr | Mining for genotype-phenotype relations in Saccharomyces using partial least squares |
title_full_unstemmed | Mining for genotype-phenotype relations in Saccharomyces using partial least squares |
title_short | Mining for genotype-phenotype relations in Saccharomyces using partial least squares |
title_sort | mining for genotype-phenotype relations in saccharomyces using partial least squares |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3175482/ https://www.ncbi.nlm.nih.gov/pubmed/21812956 http://dx.doi.org/10.1186/1471-2105-12-318 |
work_keys_str_mv | AT mehmoodtahir miningforgenotypephenotyperelationsinsaccharomycesusingpartialleastsquares AT martensharald miningforgenotypephenotyperelationsinsaccharomycesusingpartialleastsquares AT sæbøsolve miningforgenotypephenotyperelationsinsaccharomycesusingpartialleastsquares AT warringerjonas miningforgenotypephenotyperelationsinsaccharomycesusingpartialleastsquares AT snipenlars miningforgenotypephenotyperelationsinsaccharomycesusingpartialleastsquares |