Cargando…

MSstatsTMT: Statistical Detection of Differentially Abundant Proteins in Experiments with Isobaric Labeling and Multiple Mixtures

Tandem mass tag (TMT) is a multiplexing technology widely-used in proteomic research. It enables relative quantification of proteins from multiple biological samples in a single MS run with high efficiency and high throughput. However, experiments often require more biological replicates or conditio...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Ting, Choi, Meena, Tzouros, Manuel, Golling, Sabrina, Pandya, Nikhil Janak, Banfai, Balazs, Dunkley, Tom, Vitek, Olga
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Biochemistry and Molecular Biology 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8015007/
https://www.ncbi.nlm.nih.gov/pubmed/32680918
http://dx.doi.org/10.1074/mcp.RA120.002105
_version_ 1783673598025662464
author Huang, Ting
Choi, Meena
Tzouros, Manuel
Golling, Sabrina
Pandya, Nikhil Janak
Banfai, Balazs
Dunkley, Tom
Vitek, Olga
author_facet Huang, Ting
Choi, Meena
Tzouros, Manuel
Golling, Sabrina
Pandya, Nikhil Janak
Banfai, Balazs
Dunkley, Tom
Vitek, Olga
author_sort Huang, Ting
collection PubMed
description Tandem mass tag (TMT) is a multiplexing technology widely-used in proteomic research. It enables relative quantification of proteins from multiple biological samples in a single MS run with high efficiency and high throughput. However, experiments often require more biological replicates or conditions than can be accommodated by a single run, and involve multiple TMT mixtures and multiple runs. Such larger-scale experiments combine sources of biological and technical variation in patterns that are complex, unique to TMT-based workflows, and challenging for the downstream statistical analysis. These patterns cannot be adequately characterized by statistical methods designed for other technologies, such as label-free proteomics or transcriptomics. This manuscript proposes a general statistical approach for relative protein quantification in MS- based experiments with TMT labeling. It is applicable to experiments with multiple conditions, multiple biological replicate runs and multiple technical replicate runs, and unbalanced designs. It is based on a flexible family of linear mixed-effects models that handle complex patterns of technical artifacts and missing values. The approach is implemented in MSstatsTMT, a freely available open-source R/Bioconductor package compatible with data processing tools such as Proteome Discoverer, MaxQuant, OpenMS, and SpectroMine. Evaluation on a controlled mixture, simulated datasets, and three biological investigations with diverse designs demonstrated that MSstatsTMT balanced the sensitivity and the specificity of detecting differentially abundant proteins, in large-scale experiments with multiple biological mixtures.
format Online
Article
Text
id pubmed-8015007
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher American Society for Biochemistry and Molecular Biology
record_format MEDLINE/PubMed
spelling pubmed-80150072021-04-12 MSstatsTMT: Statistical Detection of Differentially Abundant Proteins in Experiments with Isobaric Labeling and Multiple Mixtures Huang, Ting Choi, Meena Tzouros, Manuel Golling, Sabrina Pandya, Nikhil Janak Banfai, Balazs Dunkley, Tom Vitek, Olga Mol Cell Proteomics Technological Innovation and Resources Tandem mass tag (TMT) is a multiplexing technology widely-used in proteomic research. It enables relative quantification of proteins from multiple biological samples in a single MS run with high efficiency and high throughput. However, experiments often require more biological replicates or conditions than can be accommodated by a single run, and involve multiple TMT mixtures and multiple runs. Such larger-scale experiments combine sources of biological and technical variation in patterns that are complex, unique to TMT-based workflows, and challenging for the downstream statistical analysis. These patterns cannot be adequately characterized by statistical methods designed for other technologies, such as label-free proteomics or transcriptomics. This manuscript proposes a general statistical approach for relative protein quantification in MS- based experiments with TMT labeling. It is applicable to experiments with multiple conditions, multiple biological replicate runs and multiple technical replicate runs, and unbalanced designs. It is based on a flexible family of linear mixed-effects models that handle complex patterns of technical artifacts and missing values. The approach is implemented in MSstatsTMT, a freely available open-source R/Bioconductor package compatible with data processing tools such as Proteome Discoverer, MaxQuant, OpenMS, and SpectroMine. Evaluation on a controlled mixture, simulated datasets, and three biological investigations with diverse designs demonstrated that MSstatsTMT balanced the sensitivity and the specificity of detecting differentially abundant proteins, in large-scale experiments with multiple biological mixtures. American Society for Biochemistry and Molecular Biology 2020-11-25 /pmc/articles/PMC8015007/ /pubmed/32680918 http://dx.doi.org/10.1074/mcp.RA120.002105 Text en © 2020 © 2020 Huang et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Technological Innovation and Resources
Huang, Ting
Choi, Meena
Tzouros, Manuel
Golling, Sabrina
Pandya, Nikhil Janak
Banfai, Balazs
Dunkley, Tom
Vitek, Olga
MSstatsTMT: Statistical Detection of Differentially Abundant Proteins in Experiments with Isobaric Labeling and Multiple Mixtures
title MSstatsTMT: Statistical Detection of Differentially Abundant Proteins in Experiments with Isobaric Labeling and Multiple Mixtures
title_full MSstatsTMT: Statistical Detection of Differentially Abundant Proteins in Experiments with Isobaric Labeling and Multiple Mixtures
title_fullStr MSstatsTMT: Statistical Detection of Differentially Abundant Proteins in Experiments with Isobaric Labeling and Multiple Mixtures
title_full_unstemmed MSstatsTMT: Statistical Detection of Differentially Abundant Proteins in Experiments with Isobaric Labeling and Multiple Mixtures
title_short MSstatsTMT: Statistical Detection of Differentially Abundant Proteins in Experiments with Isobaric Labeling and Multiple Mixtures
title_sort msstatstmt: statistical detection of differentially abundant proteins in experiments with isobaric labeling and multiple mixtures
topic Technological Innovation and Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8015007/
https://www.ncbi.nlm.nih.gov/pubmed/32680918
http://dx.doi.org/10.1074/mcp.RA120.002105
work_keys_str_mv AT huangting msstatstmtstatisticaldetectionofdifferentiallyabundantproteinsinexperimentswithisobariclabelingandmultiplemixtures
AT choimeena msstatstmtstatisticaldetectionofdifferentiallyabundantproteinsinexperimentswithisobariclabelingandmultiplemixtures
AT tzourosmanuel msstatstmtstatisticaldetectionofdifferentiallyabundantproteinsinexperimentswithisobariclabelingandmultiplemixtures
AT gollingsabrina msstatstmtstatisticaldetectionofdifferentiallyabundantproteinsinexperimentswithisobariclabelingandmultiplemixtures
AT pandyanikhiljanak msstatstmtstatisticaldetectionofdifferentiallyabundantproteinsinexperimentswithisobariclabelingandmultiplemixtures
AT banfaibalazs msstatstmtstatisticaldetectionofdifferentiallyabundantproteinsinexperimentswithisobariclabelingandmultiplemixtures
AT dunkleytom msstatstmtstatisticaldetectionofdifferentiallyabundantproteinsinexperimentswithisobariclabelingandmultiplemixtures
AT vitekolga msstatstmtstatisticaldetectionofdifferentiallyabundantproteinsinexperimentswithisobariclabelingandmultiplemixtures