Cargando…

Incorporation of a unified protein abundance dataset into the Saccharomyces genome database

The identification and accurate quantitation of protein abundance has been a major objective of proteomics research. Abundance studies have the potential to provide users with data that can be used to gain a deeper understanding of protein function and regulation and can also help identify cellular...

Descripción completa

Detalles Bibliográficos
Autores principales: Nash, Robert S, Weng, Shuai, Karra, Kalpana, Wong, Edith D, Engel, Stacia R, Cherry, J Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7054198/
https://www.ncbi.nlm.nih.gov/pubmed/32128557
http://dx.doi.org/10.1093/database/baaa008
_version_ 1783503143234961408
author Nash, Robert S
Weng, Shuai
Karra, Kalpana
Wong, Edith D
Engel, Stacia R
Cherry, J Michael
author_facet Nash, Robert S
Weng, Shuai
Karra, Kalpana
Wong, Edith D
Engel, Stacia R
Cherry, J Michael
author_sort Nash, Robert S
collection PubMed
description The identification and accurate quantitation of protein abundance has been a major objective of proteomics research. Abundance studies have the potential to provide users with data that can be used to gain a deeper understanding of protein function and regulation and can also help identify cellular pathways and modules that operate under various environmental stress conditions. One of the central missions of the Saccharomyces Genome Database (SGD; https://www.yeastgenome.org) is to work with researchers to identify and incorporate datasets of interest to the wider scientific community, thereby enabling hypothesis-driven research. A large number of studies have detailed efforts to generate proteome-wide abundance data, but deeper analyses of these data have been hampered by the inability to compare results between studies. Recently, a unified protein abundance dataset was generated through the evaluation of more than 20 abundance datasets, which were normalized and converted to common measurement units, in this case molecules per cell. We have incorporated these normalized protein abundance data and associated metadata into the SGD database, as well as the SGD YeastMine data warehouse, resulting in the addition of 56 487 values for untreated cells grown in either rich or defined media and 28 335 values for cells treated with environmental stressors. Abundance data for protein-coding genes are displayed in a sortable, filterable table on Protein pages, available through Locus Summary pages. A median abundance value was incorporated, and a median absolute deviation was calculated for each protein-coding gene and incorporated into SGD. These values are displayed in the Protein section of the Locus Summary page. The inclusion of these data has enhanced the quality and quantity of protein experimental information presented at SGD and provides opportunities for researchers to access and utilize the data to further their research.
format Online
Article
Text
id pubmed-7054198
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-70541982020-03-09 Incorporation of a unified protein abundance dataset into the Saccharomyces genome database Nash, Robert S Weng, Shuai Karra, Kalpana Wong, Edith D Engel, Stacia R Cherry, J Michael Database (Oxford) Database Update The identification and accurate quantitation of protein abundance has been a major objective of proteomics research. Abundance studies have the potential to provide users with data that can be used to gain a deeper understanding of protein function and regulation and can also help identify cellular pathways and modules that operate under various environmental stress conditions. One of the central missions of the Saccharomyces Genome Database (SGD; https://www.yeastgenome.org) is to work with researchers to identify and incorporate datasets of interest to the wider scientific community, thereby enabling hypothesis-driven research. A large number of studies have detailed efforts to generate proteome-wide abundance data, but deeper analyses of these data have been hampered by the inability to compare results between studies. Recently, a unified protein abundance dataset was generated through the evaluation of more than 20 abundance datasets, which were normalized and converted to common measurement units, in this case molecules per cell. We have incorporated these normalized protein abundance data and associated metadata into the SGD database, as well as the SGD YeastMine data warehouse, resulting in the addition of 56 487 values for untreated cells grown in either rich or defined media and 28 335 values for cells treated with environmental stressors. Abundance data for protein-coding genes are displayed in a sortable, filterable table on Protein pages, available through Locus Summary pages. A median abundance value was incorporated, and a median absolute deviation was calculated for each protein-coding gene and incorporated into SGD. These values are displayed in the Protein section of the Locus Summary page. The inclusion of these data has enhanced the quality and quantity of protein experimental information presented at SGD and provides opportunities for researchers to access and utilize the data to further their research. Oxford University Press 2020-03-04 /pmc/articles/PMC7054198/ /pubmed/32128557 http://dx.doi.org/10.1093/database/baaa008 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Update
Nash, Robert S
Weng, Shuai
Karra, Kalpana
Wong, Edith D
Engel, Stacia R
Cherry, J Michael
Incorporation of a unified protein abundance dataset into the Saccharomyces genome database
title Incorporation of a unified protein abundance dataset into the Saccharomyces genome database
title_full Incorporation of a unified protein abundance dataset into the Saccharomyces genome database
title_fullStr Incorporation of a unified protein abundance dataset into the Saccharomyces genome database
title_full_unstemmed Incorporation of a unified protein abundance dataset into the Saccharomyces genome database
title_short Incorporation of a unified protein abundance dataset into the Saccharomyces genome database
title_sort incorporation of a unified protein abundance dataset into the saccharomyces genome database
topic Database Update
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7054198/
https://www.ncbi.nlm.nih.gov/pubmed/32128557
http://dx.doi.org/10.1093/database/baaa008
work_keys_str_mv AT nashroberts incorporationofaunifiedproteinabundancedatasetintothesaccharomycesgenomedatabase
AT wengshuai incorporationofaunifiedproteinabundancedatasetintothesaccharomycesgenomedatabase
AT karrakalpana incorporationofaunifiedproteinabundancedatasetintothesaccharomycesgenomedatabase
AT wongedithd incorporationofaunifiedproteinabundancedatasetintothesaccharomycesgenomedatabase
AT engelstaciar incorporationofaunifiedproteinabundancedatasetintothesaccharomycesgenomedatabase
AT cherryjmichael incorporationofaunifiedproteinabundancedatasetintothesaccharomycesgenomedatabase
AT incorporationofaunifiedproteinabundancedatasetintothesaccharomycesgenomedatabase