Cargando…
Analysis of the first genetic engineering attribution challenge
The ability to identify the designer of engineered biological sequences—termed genetic engineering attribution (GEA)—would help ensure due credit for biotechnological innovation, while holding designers accountable to the communities they affect. Here, we present the results of the first Genetic Eng...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9712580/ https://www.ncbi.nlm.nih.gov/pubmed/36450726 http://dx.doi.org/10.1038/s41467-022-35032-8 |
_version_ | 1784841819066990592 |
---|---|
author | Crook, Oliver M. Warmbrod, Kelsey Lane Lipstein, Greg Chung, Christine Bakerlee, Christopher W. McKelvey, T. Greg Holland, Shelly R. Swett, Jacob L. Esvelt, Kevin M. Alley, Ethan C. Bradshaw, William J. |
author_facet | Crook, Oliver M. Warmbrod, Kelsey Lane Lipstein, Greg Chung, Christine Bakerlee, Christopher W. McKelvey, T. Greg Holland, Shelly R. Swett, Jacob L. Esvelt, Kevin M. Alley, Ethan C. Bradshaw, William J. |
author_sort | Crook, Oliver M. |
collection | PubMed |
description | The ability to identify the designer of engineered biological sequences—termed genetic engineering attribution (GEA)—would help ensure due credit for biotechnological innovation, while holding designers accountable to the communities they affect. Here, we present the results of the first Genetic Engineering Attribution Challenge, a public data-science competition to advance GEA techniques. Top-scoring teams dramatically outperformed previous models at identifying the true lab-of-origin of engineered plasmid sequences, including an increase in top-1 and top-10 accuracy of 10 percentage points. A simple ensemble of prizewinning models further increased performance. New metrics, designed to assess a model’s ability to confidently exclude candidate labs, also showed major improvements, especially for the ensemble. Most winning teams adopted CNN-based machine-learning approaches; however, one team achieved very high accuracy with an extremely fast neural-network-free approach. Future work, including future competitions, should further explore a wide diversity of approaches for bringing GEA technology into practical use. |
format | Online Article Text |
id | pubmed-9712580 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-97125802022-12-02 Analysis of the first genetic engineering attribution challenge Crook, Oliver M. Warmbrod, Kelsey Lane Lipstein, Greg Chung, Christine Bakerlee, Christopher W. McKelvey, T. Greg Holland, Shelly R. Swett, Jacob L. Esvelt, Kevin M. Alley, Ethan C. Bradshaw, William J. Nat Commun Article The ability to identify the designer of engineered biological sequences—termed genetic engineering attribution (GEA)—would help ensure due credit for biotechnological innovation, while holding designers accountable to the communities they affect. Here, we present the results of the first Genetic Engineering Attribution Challenge, a public data-science competition to advance GEA techniques. Top-scoring teams dramatically outperformed previous models at identifying the true lab-of-origin of engineered plasmid sequences, including an increase in top-1 and top-10 accuracy of 10 percentage points. A simple ensemble of prizewinning models further increased performance. New metrics, designed to assess a model’s ability to confidently exclude candidate labs, also showed major improvements, especially for the ensemble. Most winning teams adopted CNN-based machine-learning approaches; however, one team achieved very high accuracy with an extremely fast neural-network-free approach. Future work, including future competitions, should further explore a wide diversity of approaches for bringing GEA technology into practical use. Nature Publishing Group UK 2022-11-30 /pmc/articles/PMC9712580/ /pubmed/36450726 http://dx.doi.org/10.1038/s41467-022-35032-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Crook, Oliver M. Warmbrod, Kelsey Lane Lipstein, Greg Chung, Christine Bakerlee, Christopher W. McKelvey, T. Greg Holland, Shelly R. Swett, Jacob L. Esvelt, Kevin M. Alley, Ethan C. Bradshaw, William J. Analysis of the first genetic engineering attribution challenge |
title | Analysis of the first genetic engineering attribution challenge |
title_full | Analysis of the first genetic engineering attribution challenge |
title_fullStr | Analysis of the first genetic engineering attribution challenge |
title_full_unstemmed | Analysis of the first genetic engineering attribution challenge |
title_short | Analysis of the first genetic engineering attribution challenge |
title_sort | analysis of the first genetic engineering attribution challenge |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9712580/ https://www.ncbi.nlm.nih.gov/pubmed/36450726 http://dx.doi.org/10.1038/s41467-022-35032-8 |
work_keys_str_mv | AT crookoliverm analysisofthefirstgeneticengineeringattributionchallenge AT warmbrodkelseylane analysisofthefirstgeneticengineeringattributionchallenge AT lipsteingreg analysisofthefirstgeneticengineeringattributionchallenge AT chungchristine analysisofthefirstgeneticengineeringattributionchallenge AT bakerleechristopherw analysisofthefirstgeneticengineeringattributionchallenge AT mckelveytgreg analysisofthefirstgeneticengineeringattributionchallenge AT hollandshellyr analysisofthefirstgeneticengineeringattributionchallenge AT swettjacobl analysisofthefirstgeneticengineeringattributionchallenge AT esveltkevinm analysisofthefirstgeneticengineeringattributionchallenge AT alleyethanc analysisofthefirstgeneticengineeringattributionchallenge AT bradshawwilliamj analysisofthefirstgeneticengineeringattributionchallenge |