Cargando…
Protein Database and Quantitative Analysis Considerations when Integrating Genetics and Proteomics to Compare Mouse Strains
[Image: see text] Decades of genetics research comparing mouse strains has identified many regions of the genome associated with quantitative traits. Microarrays have been used to identify which genes in those regions are differentially expressed and are therefore potentially causal; however, geneti...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2011
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3128464/ https://www.ncbi.nlm.nih.gov/pubmed/21553863 http://dx.doi.org/10.1021/pr200133p |
_version_ | 1782207443303923712 |
---|---|
author | Fei, Suzanne S. Wilmarth, Phillip A. Hitzemann, Robert J. McWeeney, Shannon K. Belknap, John K. David, Larry L. |
author_facet | Fei, Suzanne S. Wilmarth, Phillip A. Hitzemann, Robert J. McWeeney, Shannon K. Belknap, John K. David, Larry L. |
author_sort | Fei, Suzanne S. |
collection | PubMed |
description | [Image: see text] Decades of genetics research comparing mouse strains has identified many regions of the genome associated with quantitative traits. Microarrays have been used to identify which genes in those regions are differentially expressed and are therefore potentially causal; however, genetic variants that affect probe hybridization lead to many false conclusions. Here we used spectral counting to compare brain striata between two mouse strains. Using strain-specific protein databases, we concluded that proteomics was more robust to sequence differences than microarrays; however, some proteins were still significantly affected. To generate strain-specific databases, we used a complete database that contained all of the putative genetic isoforms for each protein. While the increased proteome coverage in the databases led to a 6.8% gain in peptide assignments compared to a nonredundant database, it also necessitated the development of a strategy for grouping similar proteins due to a large number of shared peptides. Of the 4563 identified proteins (2.1% FDR), there were 1807 quantifiable proteins/groups that exceeded minimum count cutoffs. With four pooled biological replicates per strain, we used quantile normalization, ComBat (a package that adjusts for batch effects), and edgeR (a package for differential expression analysis of count data) to identify 101 differentially expressed proteins/groups, 84 of which had a coding region within one of the genomic regions of interest identified by the Portland Alcohol Research Center. |
format | Online Article Text |
id | pubmed-3128464 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-31284642011-07-01 Protein Database and Quantitative Analysis Considerations when Integrating Genetics and Proteomics to Compare Mouse Strains Fei, Suzanne S. Wilmarth, Phillip A. Hitzemann, Robert J. McWeeney, Shannon K. Belknap, John K. David, Larry L. J Proteome Res [Image: see text] Decades of genetics research comparing mouse strains has identified many regions of the genome associated with quantitative traits. Microarrays have been used to identify which genes in those regions are differentially expressed and are therefore potentially causal; however, genetic variants that affect probe hybridization lead to many false conclusions. Here we used spectral counting to compare brain striata between two mouse strains. Using strain-specific protein databases, we concluded that proteomics was more robust to sequence differences than microarrays; however, some proteins were still significantly affected. To generate strain-specific databases, we used a complete database that contained all of the putative genetic isoforms for each protein. While the increased proteome coverage in the databases led to a 6.8% gain in peptide assignments compared to a nonredundant database, it also necessitated the development of a strategy for grouping similar proteins due to a large number of shared peptides. Of the 4563 identified proteins (2.1% FDR), there were 1807 quantifiable proteins/groups that exceeded minimum count cutoffs. With four pooled biological replicates per strain, we used quantile normalization, ComBat (a package that adjusts for batch effects), and edgeR (a package for differential expression analysis of count data) to identify 101 differentially expressed proteins/groups, 84 of which had a coding region within one of the genomic regions of interest identified by the Portland Alcohol Research Center. American Chemical Society 2011-05-09 2011-07-01 /pmc/articles/PMC3128464/ /pubmed/21553863 http://dx.doi.org/10.1021/pr200133p Text en Copyright © 2011 American Chemical Society http://pubs.acs.org This is an open-access article distributed under the ACS AuthorChoice Terms & Conditions. Any use of this article, must conform to the terms of that license which are available at http://pubs.acs.org. |
spellingShingle | Fei, Suzanne S. Wilmarth, Phillip A. Hitzemann, Robert J. McWeeney, Shannon K. Belknap, John K. David, Larry L. Protein Database and Quantitative Analysis Considerations when Integrating Genetics and Proteomics to Compare Mouse Strains |
title | Protein Database and Quantitative Analysis Considerations when Integrating Genetics and Proteomics to Compare Mouse Strains |
title_full | Protein Database and Quantitative Analysis Considerations when Integrating Genetics and Proteomics to Compare Mouse Strains |
title_fullStr | Protein Database and Quantitative Analysis Considerations when Integrating Genetics and Proteomics to Compare Mouse Strains |
title_full_unstemmed | Protein Database and Quantitative Analysis Considerations when Integrating Genetics and Proteomics to Compare Mouse Strains |
title_short | Protein Database and Quantitative Analysis Considerations when Integrating Genetics and Proteomics to Compare Mouse Strains |
title_sort | protein database and quantitative analysis considerations when integrating genetics and proteomics to compare mouse strains |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3128464/ https://www.ncbi.nlm.nih.gov/pubmed/21553863 http://dx.doi.org/10.1021/pr200133p |
work_keys_str_mv | AT feisuzannes proteindatabaseandquantitativeanalysisconsiderationswhenintegratinggeneticsandproteomicstocomparemousestrains AT wilmarthphillipa proteindatabaseandquantitativeanalysisconsiderationswhenintegratinggeneticsandproteomicstocomparemousestrains AT hitzemannrobertj proteindatabaseandquantitativeanalysisconsiderationswhenintegratinggeneticsandproteomicstocomparemousestrains AT mcweeneyshannonk proteindatabaseandquantitativeanalysisconsiderationswhenintegratinggeneticsandproteomicstocomparemousestrains AT belknapjohnk proteindatabaseandquantitativeanalysisconsiderationswhenintegratinggeneticsandproteomicstocomparemousestrains AT davidlarryl proteindatabaseandquantitativeanalysisconsiderationswhenintegratinggeneticsandproteomicstocomparemousestrains |