Cargando…

Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks

BACKGROUND: Real-time analysis of patient data during medical procedures can provide vital diagnostic feedback that significantly improves chances of success. With sensors becoming increasingly fast, frameworks such as Deep Neural Networks are required to perform calculations within the strict timin...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sanaullah, Ahmed, Yang, Chen, Alexeev, Yuri, Yoshii, Kazutomo, Herbordt, Martin C.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2018
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302367/ https://www.ncbi.nlm.nih.gov/pubmed/30577751 http://dx.doi.org/10.1186/s12859-018-2505-7

_version_	1783381962353803264
author	Sanaullah, Ahmed Yang, Chen Alexeev, Yuri Yoshii, Kazutomo Herbordt, Martin C.
author_facet	Sanaullah, Ahmed Yang, Chen Alexeev, Yuri Yoshii, Kazutomo Herbordt, Martin C.
author_sort	Sanaullah, Ahmed
collection	PubMed
description	BACKGROUND: Real-time analysis of patient data during medical procedures can provide vital diagnostic feedback that significantly improves chances of success. With sensors becoming increasingly fast, frameworks such as Deep Neural Networks are required to perform calculations within the strict timing constraints for real-time operation. However, traditional computing platforms responsible for running these algorithms incur a large overhead due to communication protocols, memory accesses, and static (often generic) architectures. In this work, we implement a low-latency Multi-Layer Perceptron (MLP) processor using Field Programmable Gate Arrays (FPGAs). Unlike CPUs and Graphics Processing Units (GPUs), our FPGA-based design can directly interface sensors, storage devices, display devices and even actuators, thus reducing the delays of data movement between ports and compute pipelines. Moreover, the compute pipelines themselves are tailored specifically to the application, improving resource utilization and reducing idle cycles. We demonstrate the effectiveness of our approach using mass-spectrometry data sets for real-time cancer detection. RESULTS: We demonstrate that correct parameter sizing, based on the application, can reduce latency by 20% on average. Furthermore, we show that in an application with tightly coupled data-path and latency constraints, having a large amount of computing resources can actually reduce performance. Using mass-spectrometry benchmarks, we show that our proposed FPGA design outperforms both CPU and GPU implementations, with an average speedup of 144x and 21x, respectively. CONCLUSION: In our work, we demonstrate the importance of application-specific optimizations in order to minimize latency and maximize resource utilization for MLP inference. By directly interfacing and processing sensor data with ultra-low latency, FPGAs can perform real-time analysis during procedures and provide diagnostic feedback that can be critical to achieving higher percentages of successful patient outcomes.
format	Online Article Text
id	pubmed-6302367
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-63023672018-12-31 Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks Sanaullah, Ahmed Yang, Chen Alexeev, Yuri Yoshii, Kazutomo Herbordt, Martin C. BMC Bioinformatics Research BACKGROUND: Real-time analysis of patient data during medical procedures can provide vital diagnostic feedback that significantly improves chances of success. With sensors becoming increasingly fast, frameworks such as Deep Neural Networks are required to perform calculations within the strict timing constraints for real-time operation. However, traditional computing platforms responsible for running these algorithms incur a large overhead due to communication protocols, memory accesses, and static (often generic) architectures. In this work, we implement a low-latency Multi-Layer Perceptron (MLP) processor using Field Programmable Gate Arrays (FPGAs). Unlike CPUs and Graphics Processing Units (GPUs), our FPGA-based design can directly interface sensors, storage devices, display devices and even actuators, thus reducing the delays of data movement between ports and compute pipelines. Moreover, the compute pipelines themselves are tailored specifically to the application, improving resource utilization and reducing idle cycles. We demonstrate the effectiveness of our approach using mass-spectrometry data sets for real-time cancer detection. RESULTS: We demonstrate that correct parameter sizing, based on the application, can reduce latency by 20% on average. Furthermore, we show that in an application with tightly coupled data-path and latency constraints, having a large amount of computing resources can actually reduce performance. Using mass-spectrometry benchmarks, we show that our proposed FPGA design outperforms both CPU and GPU implementations, with an average speedup of 144x and 21x, respectively. CONCLUSION: In our work, we demonstrate the importance of application-specific optimizations in order to minimize latency and maximize resource utilization for MLP inference. By directly interfacing and processing sensor data with ultra-low latency, FPGAs can perform real-time analysis during procedures and provide diagnostic feedback that can be critical to achieving higher percentages of successful patient outcomes. BioMed Central 2018-12-21 /pmc/articles/PMC6302367/ /pubmed/30577751 http://dx.doi.org/10.1186/s12859-018-2505-7 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Sanaullah, Ahmed Yang, Chen Alexeev, Yuri Yoshii, Kazutomo Herbordt, Martin C. Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks
title	Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks
title_full	Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks
title_fullStr	Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks
title_full_unstemmed	Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks
title_short	Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks
title_sort	real-time data analysis for medical diagnosis using fpga-accelerated neural networks
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302367/ https://www.ncbi.nlm.nih.gov/pubmed/30577751 http://dx.doi.org/10.1186/s12859-018-2505-7
work_keys_str_mv	AT sanaullahahmed realtimedataanalysisformedicaldiagnosisusingfpgaacceleratedneuralnetworks AT yangchen realtimedataanalysisformedicaldiagnosisusingfpgaacceleratedneuralnetworks AT alexeevyuri realtimedataanalysisformedicaldiagnosisusingfpgaacceleratedneuralnetworks AT yoshiikazutomo realtimedataanalysisformedicaldiagnosisusingfpgaacceleratedneuralnetworks AT herbordtmartinc realtimedataanalysisformedicaldiagnosisusingfpgaacceleratedneuralnetworks

Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks

Ejemplares similares