Cargando…
Artificial Intelligence Approach for Variant Reporting
PURPOSE: Next-generation sequencing technologies are actively applied in clinical oncology. Bioinformatics pipeline analysis is an integral part of this process; however, humans cannot yet realize the full potential of the highly complex pipeline output. As a result, the decision to include a varian...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society of Clinical Oncology
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198661/ https://www.ncbi.nlm.nih.gov/pubmed/30364844 http://dx.doi.org/10.1200/CCI.16.00079 |
_version_ | 1783365005814530048 |
---|---|
author | Zomnir, Michael G. Lipkin, Lev Pacula, Maciej Dominguez Meneses, Enrique MacLeay, Allison Duraisamy, Sekhar Nadhamuni, Nishchal Al Turki, Saeed H. Zheng, Zongli Rivera, Miguel Nardi, Valentina Dias-Santagata, Dora Iafrate, A. John Le, Long P. Lennerz, Jochen K. |
author_facet | Zomnir, Michael G. Lipkin, Lev Pacula, Maciej Dominguez Meneses, Enrique MacLeay, Allison Duraisamy, Sekhar Nadhamuni, Nishchal Al Turki, Saeed H. Zheng, Zongli Rivera, Miguel Nardi, Valentina Dias-Santagata, Dora Iafrate, A. John Le, Long P. Lennerz, Jochen K. |
author_sort | Zomnir, Michael G. |
collection | PubMed |
description | PURPOSE: Next-generation sequencing technologies are actively applied in clinical oncology. Bioinformatics pipeline analysis is an integral part of this process; however, humans cannot yet realize the full potential of the highly complex pipeline output. As a result, the decision to include a variant in the final report during routine clinical sign-out remains challenging. METHODS: We used an artificial intelligence approach to capture the collective clinical sign-out experience of six board-certified molecular pathologists to build and validate a decision support tool for variant reporting. We extracted all reviewed and reported variants from our clinical database and tested several machine learning models. We used 10-fold cross-validation for our variant call prediction model, which derives a contiguous prediction score from 0 to 1 (no to yes) for clinical reporting. RESULTS: For each of the 19,594 initial training variants, our pipeline generates approximately 500 features, which results in a matrix of > 9 million data points. From a comparison of naive Bayes, decision trees, random forests, and logistic regression models, we selected models that allow human interpretability of the prediction score. The logistic regression model demonstrated 1% false negativity and 2% false positivity. The final models’ Youden indices were 0.87 and 0.77 for screening and confirmatory cutoffs, respectively. Retraining on a new assay and performance assessment in 16,123 independent variants validated our approach (Youden index, 0.93). We also derived individual pathologist-centric models (virtual consensus conference function), and a visual drill-down functionality allows assessment of how underlying features contributed to a particular score or decision branch for clinical implementation. CONCLUSION: Our decision support tool for variant reporting is a practically relevant artificial intelligence approach to harness the next-generation sequencing bioinformatics pipeline output when the complexity of data interpretation exceeds human capabilities. |
format | Online Article Text |
id | pubmed-6198661 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | American Society of Clinical Oncology |
record_format | MEDLINE/PubMed |
spelling | pubmed-61986612018-10-23 Artificial Intelligence Approach for Variant Reporting Zomnir, Michael G. Lipkin, Lev Pacula, Maciej Dominguez Meneses, Enrique MacLeay, Allison Duraisamy, Sekhar Nadhamuni, Nishchal Al Turki, Saeed H. Zheng, Zongli Rivera, Miguel Nardi, Valentina Dias-Santagata, Dora Iafrate, A. John Le, Long P. Lennerz, Jochen K. JCO Clin Cancer Inform Original Reports PURPOSE: Next-generation sequencing technologies are actively applied in clinical oncology. Bioinformatics pipeline analysis is an integral part of this process; however, humans cannot yet realize the full potential of the highly complex pipeline output. As a result, the decision to include a variant in the final report during routine clinical sign-out remains challenging. METHODS: We used an artificial intelligence approach to capture the collective clinical sign-out experience of six board-certified molecular pathologists to build and validate a decision support tool for variant reporting. We extracted all reviewed and reported variants from our clinical database and tested several machine learning models. We used 10-fold cross-validation for our variant call prediction model, which derives a contiguous prediction score from 0 to 1 (no to yes) for clinical reporting. RESULTS: For each of the 19,594 initial training variants, our pipeline generates approximately 500 features, which results in a matrix of > 9 million data points. From a comparison of naive Bayes, decision trees, random forests, and logistic regression models, we selected models that allow human interpretability of the prediction score. The logistic regression model demonstrated 1% false negativity and 2% false positivity. The final models’ Youden indices were 0.87 and 0.77 for screening and confirmatory cutoffs, respectively. Retraining on a new assay and performance assessment in 16,123 independent variants validated our approach (Youden index, 0.93). We also derived individual pathologist-centric models (virtual consensus conference function), and a visual drill-down functionality allows assessment of how underlying features contributed to a particular score or decision branch for clinical implementation. CONCLUSION: Our decision support tool for variant reporting is a practically relevant artificial intelligence approach to harness the next-generation sequencing bioinformatics pipeline output when the complexity of data interpretation exceeds human capabilities. American Society of Clinical Oncology 2018-03-22 /pmc/articles/PMC6198661/ /pubmed/30364844 http://dx.doi.org/10.1200/CCI.16.00079 Text en © 2018 by American Society of Clinical Oncology https://creativecommons.org/licenses/by-nc-nd/4.0/ Creative Commons Attribution Non-Commercial No Derivatives 4.0 License: https://creativecommons.org/licenses/by-nc-nd/4.0/ |
spellingShingle | Original Reports Zomnir, Michael G. Lipkin, Lev Pacula, Maciej Dominguez Meneses, Enrique MacLeay, Allison Duraisamy, Sekhar Nadhamuni, Nishchal Al Turki, Saeed H. Zheng, Zongli Rivera, Miguel Nardi, Valentina Dias-Santagata, Dora Iafrate, A. John Le, Long P. Lennerz, Jochen K. Artificial Intelligence Approach for Variant Reporting |
title | Artificial Intelligence Approach for Variant Reporting |
title_full | Artificial Intelligence Approach for Variant Reporting |
title_fullStr | Artificial Intelligence Approach for Variant Reporting |
title_full_unstemmed | Artificial Intelligence Approach for Variant Reporting |
title_short | Artificial Intelligence Approach for Variant Reporting |
title_sort | artificial intelligence approach for variant reporting |
topic | Original Reports |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198661/ https://www.ncbi.nlm.nih.gov/pubmed/30364844 http://dx.doi.org/10.1200/CCI.16.00079 |
work_keys_str_mv | AT zomnirmichaelg artificialintelligenceapproachforvariantreporting AT lipkinlev artificialintelligenceapproachforvariantreporting AT paculamaciej artificialintelligenceapproachforvariantreporting AT dominguezmenesesenrique artificialintelligenceapproachforvariantreporting AT macleayallison artificialintelligenceapproachforvariantreporting AT duraisamysekhar artificialintelligenceapproachforvariantreporting AT nadhamuninishchal artificialintelligenceapproachforvariantreporting AT alturkisaeedh artificialintelligenceapproachforvariantreporting AT zhengzongli artificialintelligenceapproachforvariantreporting AT riveramiguel artificialintelligenceapproachforvariantreporting AT nardivalentina artificialintelligenceapproachforvariantreporting AT diassantagatadora artificialintelligenceapproachforvariantreporting AT iafrateajohn artificialintelligenceapproachforvariantreporting AT lelongp artificialintelligenceapproachforvariantreporting AT lennerzjochenk artificialintelligenceapproachforvariantreporting |