Cargando…

Artificial Intelligence Approach for Variant Reporting

PURPOSE: Next-generation sequencing technologies are actively applied in clinical oncology. Bioinformatics pipeline analysis is an integral part of this process; however, humans cannot yet realize the full potential of the highly complex pipeline output. As a result, the decision to include a varian...

Descripción completa

Detalles Bibliográficos
Autores principales: Zomnir, Michael G., Lipkin, Lev, Pacula, Maciej, Dominguez Meneses, Enrique, MacLeay, Allison, Duraisamy, Sekhar, Nadhamuni, Nishchal, Al Turki, Saeed H., Zheng, Zongli, Rivera, Miguel, Nardi, Valentina, Dias-Santagata, Dora, Iafrate, A. John, Le, Long P., Lennerz, Jochen K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society of Clinical Oncology 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198661/
https://www.ncbi.nlm.nih.gov/pubmed/30364844
http://dx.doi.org/10.1200/CCI.16.00079
_version_ 1783365005814530048
author Zomnir, Michael G.
Lipkin, Lev
Pacula, Maciej
Dominguez Meneses, Enrique
MacLeay, Allison
Duraisamy, Sekhar
Nadhamuni, Nishchal
Al Turki, Saeed H.
Zheng, Zongli
Rivera, Miguel
Nardi, Valentina
Dias-Santagata, Dora
Iafrate, A. John
Le, Long P.
Lennerz, Jochen K.
author_facet Zomnir, Michael G.
Lipkin, Lev
Pacula, Maciej
Dominguez Meneses, Enrique
MacLeay, Allison
Duraisamy, Sekhar
Nadhamuni, Nishchal
Al Turki, Saeed H.
Zheng, Zongli
Rivera, Miguel
Nardi, Valentina
Dias-Santagata, Dora
Iafrate, A. John
Le, Long P.
Lennerz, Jochen K.
author_sort Zomnir, Michael G.
collection PubMed
description PURPOSE: Next-generation sequencing technologies are actively applied in clinical oncology. Bioinformatics pipeline analysis is an integral part of this process; however, humans cannot yet realize the full potential of the highly complex pipeline output. As a result, the decision to include a variant in the final report during routine clinical sign-out remains challenging. METHODS: We used an artificial intelligence approach to capture the collective clinical sign-out experience of six board-certified molecular pathologists to build and validate a decision support tool for variant reporting. We extracted all reviewed and reported variants from our clinical database and tested several machine learning models. We used 10-fold cross-validation for our variant call prediction model, which derives a contiguous prediction score from 0 to 1 (no to yes) for clinical reporting. RESULTS: For each of the 19,594 initial training variants, our pipeline generates approximately 500 features, which results in a matrix of > 9 million data points. From a comparison of naive Bayes, decision trees, random forests, and logistic regression models, we selected models that allow human interpretability of the prediction score. The logistic regression model demonstrated 1% false negativity and 2% false positivity. The final models’ Youden indices were 0.87 and 0.77 for screening and confirmatory cutoffs, respectively. Retraining on a new assay and performance assessment in 16,123 independent variants validated our approach (Youden index, 0.93). We also derived individual pathologist-centric models (virtual consensus conference function), and a visual drill-down functionality allows assessment of how underlying features contributed to a particular score or decision branch for clinical implementation. CONCLUSION: Our decision support tool for variant reporting is a practically relevant artificial intelligence approach to harness the next-generation sequencing bioinformatics pipeline output when the complexity of data interpretation exceeds human capabilities.
format Online
Article
Text
id pubmed-6198661
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher American Society of Clinical Oncology
record_format MEDLINE/PubMed
spelling pubmed-61986612018-10-23 Artificial Intelligence Approach for Variant Reporting Zomnir, Michael G. Lipkin, Lev Pacula, Maciej Dominguez Meneses, Enrique MacLeay, Allison Duraisamy, Sekhar Nadhamuni, Nishchal Al Turki, Saeed H. Zheng, Zongli Rivera, Miguel Nardi, Valentina Dias-Santagata, Dora Iafrate, A. John Le, Long P. Lennerz, Jochen K. JCO Clin Cancer Inform Original Reports PURPOSE: Next-generation sequencing technologies are actively applied in clinical oncology. Bioinformatics pipeline analysis is an integral part of this process; however, humans cannot yet realize the full potential of the highly complex pipeline output. As a result, the decision to include a variant in the final report during routine clinical sign-out remains challenging. METHODS: We used an artificial intelligence approach to capture the collective clinical sign-out experience of six board-certified molecular pathologists to build and validate a decision support tool for variant reporting. We extracted all reviewed and reported variants from our clinical database and tested several machine learning models. We used 10-fold cross-validation for our variant call prediction model, which derives a contiguous prediction score from 0 to 1 (no to yes) for clinical reporting. RESULTS: For each of the 19,594 initial training variants, our pipeline generates approximately 500 features, which results in a matrix of > 9 million data points. From a comparison of naive Bayes, decision trees, random forests, and logistic regression models, we selected models that allow human interpretability of the prediction score. The logistic regression model demonstrated 1% false negativity and 2% false positivity. The final models’ Youden indices were 0.87 and 0.77 for screening and confirmatory cutoffs, respectively. Retraining on a new assay and performance assessment in 16,123 independent variants validated our approach (Youden index, 0.93). We also derived individual pathologist-centric models (virtual consensus conference function), and a visual drill-down functionality allows assessment of how underlying features contributed to a particular score or decision branch for clinical implementation. CONCLUSION: Our decision support tool for variant reporting is a practically relevant artificial intelligence approach to harness the next-generation sequencing bioinformatics pipeline output when the complexity of data interpretation exceeds human capabilities. American Society of Clinical Oncology 2018-03-22 /pmc/articles/PMC6198661/ /pubmed/30364844 http://dx.doi.org/10.1200/CCI.16.00079 Text en © 2018 by American Society of Clinical Oncology https://creativecommons.org/licenses/by-nc-nd/4.0/ Creative Commons Attribution Non-Commercial No Derivatives 4.0 License: https://creativecommons.org/licenses/by-nc-nd/4.0/
spellingShingle Original Reports
Zomnir, Michael G.
Lipkin, Lev
Pacula, Maciej
Dominguez Meneses, Enrique
MacLeay, Allison
Duraisamy, Sekhar
Nadhamuni, Nishchal
Al Turki, Saeed H.
Zheng, Zongli
Rivera, Miguel
Nardi, Valentina
Dias-Santagata, Dora
Iafrate, A. John
Le, Long P.
Lennerz, Jochen K.
Artificial Intelligence Approach for Variant Reporting
title Artificial Intelligence Approach for Variant Reporting
title_full Artificial Intelligence Approach for Variant Reporting
title_fullStr Artificial Intelligence Approach for Variant Reporting
title_full_unstemmed Artificial Intelligence Approach for Variant Reporting
title_short Artificial Intelligence Approach for Variant Reporting
title_sort artificial intelligence approach for variant reporting
topic Original Reports
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198661/
https://www.ncbi.nlm.nih.gov/pubmed/30364844
http://dx.doi.org/10.1200/CCI.16.00079
work_keys_str_mv AT zomnirmichaelg artificialintelligenceapproachforvariantreporting
AT lipkinlev artificialintelligenceapproachforvariantreporting
AT paculamaciej artificialintelligenceapproachforvariantreporting
AT dominguezmenesesenrique artificialintelligenceapproachforvariantreporting
AT macleayallison artificialintelligenceapproachforvariantreporting
AT duraisamysekhar artificialintelligenceapproachforvariantreporting
AT nadhamuninishchal artificialintelligenceapproachforvariantreporting
AT alturkisaeedh artificialintelligenceapproachforvariantreporting
AT zhengzongli artificialintelligenceapproachforvariantreporting
AT riveramiguel artificialintelligenceapproachforvariantreporting
AT nardivalentina artificialintelligenceapproachforvariantreporting
AT diassantagatadora artificialintelligenceapproachforvariantreporting
AT iafrateajohn artificialintelligenceapproachforvariantreporting
AT lelongp artificialintelligenceapproachforvariantreporting
AT lennerzjochenk artificialintelligenceapproachforvariantreporting