Cargando…

Analysis of the first genetic engineering attribution challenge

The ability to identify the designer of engineered biological sequences—termed genetic engineering attribution (GEA)—would help ensure due credit for biotechnological innovation, while holding designers accountable to the communities they affect. Here, we present the results of the first Genetic Eng...

Descripción completa

Detalles Bibliográficos
Autores principales: Crook, Oliver M., Warmbrod, Kelsey Lane, Lipstein, Greg, Chung, Christine, Bakerlee, Christopher W., McKelvey, T. Greg, Holland, Shelly R., Swett, Jacob L., Esvelt, Kevin M., Alley, Ethan C., Bradshaw, William J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9712580/
https://www.ncbi.nlm.nih.gov/pubmed/36450726
http://dx.doi.org/10.1038/s41467-022-35032-8
_version_ 1784841819066990592
author Crook, Oliver M.
Warmbrod, Kelsey Lane
Lipstein, Greg
Chung, Christine
Bakerlee, Christopher W.
McKelvey, T. Greg
Holland, Shelly R.
Swett, Jacob L.
Esvelt, Kevin M.
Alley, Ethan C.
Bradshaw, William J.
author_facet Crook, Oliver M.
Warmbrod, Kelsey Lane
Lipstein, Greg
Chung, Christine
Bakerlee, Christopher W.
McKelvey, T. Greg
Holland, Shelly R.
Swett, Jacob L.
Esvelt, Kevin M.
Alley, Ethan C.
Bradshaw, William J.
author_sort Crook, Oliver M.
collection PubMed
description The ability to identify the designer of engineered biological sequences—termed genetic engineering attribution (GEA)—would help ensure due credit for biotechnological innovation, while holding designers accountable to the communities they affect. Here, we present the results of the first Genetic Engineering Attribution Challenge, a public data-science competition to advance GEA techniques. Top-scoring teams dramatically outperformed previous models at identifying the true lab-of-origin of engineered plasmid sequences, including an increase in top-1 and top-10 accuracy of 10 percentage points. A simple ensemble of prizewinning models further increased performance. New metrics, designed to assess a model’s ability to confidently exclude candidate labs, also showed major improvements, especially for the ensemble. Most winning teams adopted CNN-based machine-learning approaches; however, one team achieved very high accuracy with an extremely fast neural-network-free approach. Future work, including future competitions, should further explore a wide diversity of approaches for bringing GEA technology into practical use.
format Online
Article
Text
id pubmed-9712580
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-97125802022-12-02 Analysis of the first genetic engineering attribution challenge Crook, Oliver M. Warmbrod, Kelsey Lane Lipstein, Greg Chung, Christine Bakerlee, Christopher W. McKelvey, T. Greg Holland, Shelly R. Swett, Jacob L. Esvelt, Kevin M. Alley, Ethan C. Bradshaw, William J. Nat Commun Article The ability to identify the designer of engineered biological sequences—termed genetic engineering attribution (GEA)—would help ensure due credit for biotechnological innovation, while holding designers accountable to the communities they affect. Here, we present the results of the first Genetic Engineering Attribution Challenge, a public data-science competition to advance GEA techniques. Top-scoring teams dramatically outperformed previous models at identifying the true lab-of-origin of engineered plasmid sequences, including an increase in top-1 and top-10 accuracy of 10 percentage points. A simple ensemble of prizewinning models further increased performance. New metrics, designed to assess a model’s ability to confidently exclude candidate labs, also showed major improvements, especially for the ensemble. Most winning teams adopted CNN-based machine-learning approaches; however, one team achieved very high accuracy with an extremely fast neural-network-free approach. Future work, including future competitions, should further explore a wide diversity of approaches for bringing GEA technology into practical use. Nature Publishing Group UK 2022-11-30 /pmc/articles/PMC9712580/ /pubmed/36450726 http://dx.doi.org/10.1038/s41467-022-35032-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Crook, Oliver M.
Warmbrod, Kelsey Lane
Lipstein, Greg
Chung, Christine
Bakerlee, Christopher W.
McKelvey, T. Greg
Holland, Shelly R.
Swett, Jacob L.
Esvelt, Kevin M.
Alley, Ethan C.
Bradshaw, William J.
Analysis of the first genetic engineering attribution challenge
title Analysis of the first genetic engineering attribution challenge
title_full Analysis of the first genetic engineering attribution challenge
title_fullStr Analysis of the first genetic engineering attribution challenge
title_full_unstemmed Analysis of the first genetic engineering attribution challenge
title_short Analysis of the first genetic engineering attribution challenge
title_sort analysis of the first genetic engineering attribution challenge
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9712580/
https://www.ncbi.nlm.nih.gov/pubmed/36450726
http://dx.doi.org/10.1038/s41467-022-35032-8
work_keys_str_mv AT crookoliverm analysisofthefirstgeneticengineeringattributionchallenge
AT warmbrodkelseylane analysisofthefirstgeneticengineeringattributionchallenge
AT lipsteingreg analysisofthefirstgeneticengineeringattributionchallenge
AT chungchristine analysisofthefirstgeneticengineeringattributionchallenge
AT bakerleechristopherw analysisofthefirstgeneticengineeringattributionchallenge
AT mckelveytgreg analysisofthefirstgeneticengineeringattributionchallenge
AT hollandshellyr analysisofthefirstgeneticengineeringattributionchallenge
AT swettjacobl analysisofthefirstgeneticengineeringattributionchallenge
AT esveltkevinm analysisofthefirstgeneticengineeringattributionchallenge
AT alleyethanc analysisofthefirstgeneticengineeringattributionchallenge
AT bradshawwilliamj analysisofthefirstgeneticengineeringattributionchallenge