Cargando…

A statistical framework for analyzing deep mutational scanning data

Deep mutational scanning is a widely used method for multiplex measurement of functional consequences of protein variants. We developed a new deep mutational scanning statistical model that generates error estimates for each measurement, capturing both sampling error and consistency between replicat...

Descripción completa

Detalles Bibliográficos
Autores principales: Rubin, Alan F., Gelman, Hannah, Lucas, Nathan, Bajjalieh, Sandra M., Papenfuss, Anthony T., Speed, Terence P., Fowler, Douglas M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5547491/
https://www.ncbi.nlm.nih.gov/pubmed/28784151
http://dx.doi.org/10.1186/s13059-017-1272-5
_version_ 1783255702021603328
author Rubin, Alan F.
Gelman, Hannah
Lucas, Nathan
Bajjalieh, Sandra M.
Papenfuss, Anthony T.
Speed, Terence P.
Fowler, Douglas M.
author_facet Rubin, Alan F.
Gelman, Hannah
Lucas, Nathan
Bajjalieh, Sandra M.
Papenfuss, Anthony T.
Speed, Terence P.
Fowler, Douglas M.
author_sort Rubin, Alan F.
collection PubMed
description Deep mutational scanning is a widely used method for multiplex measurement of functional consequences of protein variants. We developed a new deep mutational scanning statistical model that generates error estimates for each measurement, capturing both sampling error and consistency between replicates. We apply our model to one novel and five published datasets comprising 243,732 variants and demonstrate its superiority in removing noisy variants and conducting hypothesis testing. Simulations show our model applies to scans based on cell growth or binding and handles common experimental errors. We implemented our model in Enrich2, software that can empower researchers analyzing deep mutational scanning data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1272-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5547491
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55474912017-08-09 A statistical framework for analyzing deep mutational scanning data Rubin, Alan F. Gelman, Hannah Lucas, Nathan Bajjalieh, Sandra M. Papenfuss, Anthony T. Speed, Terence P. Fowler, Douglas M. Genome Biol Method Deep mutational scanning is a widely used method for multiplex measurement of functional consequences of protein variants. We developed a new deep mutational scanning statistical model that generates error estimates for each measurement, capturing both sampling error and consistency between replicates. We apply our model to one novel and five published datasets comprising 243,732 variants and demonstrate its superiority in removing noisy variants and conducting hypothesis testing. Simulations show our model applies to scans based on cell growth or binding and handles common experimental errors. We implemented our model in Enrich2, software that can empower researchers analyzing deep mutational scanning data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1272-5) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-07 /pmc/articles/PMC5547491/ /pubmed/28784151 http://dx.doi.org/10.1186/s13059-017-1272-5 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Method
Rubin, Alan F.
Gelman, Hannah
Lucas, Nathan
Bajjalieh, Sandra M.
Papenfuss, Anthony T.
Speed, Terence P.
Fowler, Douglas M.
A statistical framework for analyzing deep mutational scanning data
title A statistical framework for analyzing deep mutational scanning data
title_full A statistical framework for analyzing deep mutational scanning data
title_fullStr A statistical framework for analyzing deep mutational scanning data
title_full_unstemmed A statistical framework for analyzing deep mutational scanning data
title_short A statistical framework for analyzing deep mutational scanning data
title_sort statistical framework for analyzing deep mutational scanning data
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5547491/
https://www.ncbi.nlm.nih.gov/pubmed/28784151
http://dx.doi.org/10.1186/s13059-017-1272-5
work_keys_str_mv AT rubinalanf astatisticalframeworkforanalyzingdeepmutationalscanningdata
AT gelmanhannah astatisticalframeworkforanalyzingdeepmutationalscanningdata
AT lucasnathan astatisticalframeworkforanalyzingdeepmutationalscanningdata
AT bajjaliehsandram astatisticalframeworkforanalyzingdeepmutationalscanningdata
AT papenfussanthonyt astatisticalframeworkforanalyzingdeepmutationalscanningdata
AT speedterencep astatisticalframeworkforanalyzingdeepmutationalscanningdata
AT fowlerdouglasm astatisticalframeworkforanalyzingdeepmutationalscanningdata
AT rubinalanf statisticalframeworkforanalyzingdeepmutationalscanningdata
AT gelmanhannah statisticalframeworkforanalyzingdeepmutationalscanningdata
AT lucasnathan statisticalframeworkforanalyzingdeepmutationalscanningdata
AT bajjaliehsandram statisticalframeworkforanalyzingdeepmutationalscanningdata
AT papenfussanthonyt statisticalframeworkforanalyzingdeepmutationalscanningdata
AT speedterencep statisticalframeworkforanalyzingdeepmutationalscanningdata
AT fowlerdouglasm statisticalframeworkforanalyzingdeepmutationalscanningdata