Cargando…

Analyzing the impact of radiation-induced failures in flash-based APSoC with and without fault tolerance techniques at CERN environment

All Programmable System-on-Chip (APSoC) devices are designed to provide higher overall programmable flexibility and system performance at lower costs. Such characteristics make APSoCs very suitable and attractive for critical environments, such as the one encountered in the accelerators chain of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Tambara, L A, Chielle, E, Kastensmidt, F L, Tsiligiannis, G, Danzeca, S, Brugger, M, Masi, A
Lenguaje:eng
Publicado: 2017
Materias:
Acceso en línea:https://dx.doi.org/10.1016/j.microrel.2017.06.049
http://cds.cern.ch/record/2320001
_version_ 1780958558713544704
author Tambara, L A
Chielle, E
Kastensmidt, F L
Tsiligiannis, G
Danzeca, S
Brugger, M
Masi, A
author_facet Tambara, L A
Chielle, E
Kastensmidt, F L
Tsiligiannis, G
Danzeca, S
Brugger, M
Masi, A
author_sort Tambara, L A
collection CERN
description All Programmable System-on-Chip (APSoC) devices are designed to provide higher overall programmable flexibility and system performance at lower costs. Such characteristics make APSoCs very suitable and attractive for critical environments, such as the one encountered in the accelerators chain of the European Organization for Nuclear Research (CERN), where electronic components can be exposed to high-energy hadrons (protons, neutrons, pions), heavy ions, and other particles, at the same time. However, APSoCs may be prone to experience Single Event Effects (SEE). We investigate how the configuration of the Processing System (PS) influences the reliability of a FLASH-based APSoC. We experimentally study the differences in the radiation-induced error rate of the PS, under various configurations while executing an application. We also propose two approaches for increasing the reliability of programs running on the embedded processor. Furthermore, we analyze the sensitivity of the system taking into account not only the cross section, but also the system reliability and the Mean Workload Between Failures (MWBF). Preliminary results show that it is possible to double the performance and to increase the system reliability up to one order of magnitude by managing processor features such as cache memory usage, error correcting codes, and processor exception handlers.
id oai-inspirehep.net-1673794
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2017
record_format invenio
spelling oai-inspirehep.net-16737942019-09-30T06:29:59Zdoi:10.1016/j.microrel.2017.06.049http://cds.cern.ch/record/2320001engTambara, L AChielle, EKastensmidt, F LTsiligiannis, GDanzeca, SBrugger, MMasi, AAnalyzing the impact of radiation-induced failures in flash-based APSoC with and without fault tolerance techniques at CERN environmentDetectors and Experimental TechniquesAll Programmable System-on-Chip (APSoC) devices are designed to provide higher overall programmable flexibility and system performance at lower costs. Such characteristics make APSoCs very suitable and attractive for critical environments, such as the one encountered in the accelerators chain of the European Organization for Nuclear Research (CERN), where electronic components can be exposed to high-energy hadrons (protons, neutrons, pions), heavy ions, and other particles, at the same time. However, APSoCs may be prone to experience Single Event Effects (SEE). We investigate how the configuration of the Processing System (PS) influences the reliability of a FLASH-based APSoC. We experimentally study the differences in the radiation-induced error rate of the PS, under various configurations while executing an application. We also propose two approaches for increasing the reliability of programs running on the embedded processor. Furthermore, we analyze the sensitivity of the system taking into account not only the cross section, but also the system reliability and the Mean Workload Between Failures (MWBF). Preliminary results show that it is possible to double the performance and to increase the system reliability up to one order of magnitude by managing processor features such as cache memory usage, error correcting codes, and processor exception handlers.oai:inspirehep.net:16737942017
spellingShingle Detectors and Experimental Techniques
Tambara, L A
Chielle, E
Kastensmidt, F L
Tsiligiannis, G
Danzeca, S
Brugger, M
Masi, A
Analyzing the impact of radiation-induced failures in flash-based APSoC with and without fault tolerance techniques at CERN environment
title Analyzing the impact of radiation-induced failures in flash-based APSoC with and without fault tolerance techniques at CERN environment
title_full Analyzing the impact of radiation-induced failures in flash-based APSoC with and without fault tolerance techniques at CERN environment
title_fullStr Analyzing the impact of radiation-induced failures in flash-based APSoC with and without fault tolerance techniques at CERN environment
title_full_unstemmed Analyzing the impact of radiation-induced failures in flash-based APSoC with and without fault tolerance techniques at CERN environment
title_short Analyzing the impact of radiation-induced failures in flash-based APSoC with and without fault tolerance techniques at CERN environment
title_sort analyzing the impact of radiation-induced failures in flash-based apsoc with and without fault tolerance techniques at cern environment
topic Detectors and Experimental Techniques
url https://dx.doi.org/10.1016/j.microrel.2017.06.049
http://cds.cern.ch/record/2320001
work_keys_str_mv AT tambarala analyzingtheimpactofradiationinducedfailuresinflashbasedapsocwithandwithoutfaulttolerancetechniquesatcernenvironment
AT chiellee analyzingtheimpactofradiationinducedfailuresinflashbasedapsocwithandwithoutfaulttolerancetechniquesatcernenvironment
AT kastensmidtfl analyzingtheimpactofradiationinducedfailuresinflashbasedapsocwithandwithoutfaulttolerancetechniquesatcernenvironment
AT tsiligiannisg analyzingtheimpactofradiationinducedfailuresinflashbasedapsocwithandwithoutfaulttolerancetechniquesatcernenvironment
AT danzecas analyzingtheimpactofradiationinducedfailuresinflashbasedapsocwithandwithoutfaulttolerancetechniquesatcernenvironment
AT bruggerm analyzingtheimpactofradiationinducedfailuresinflashbasedapsocwithandwithoutfaulttolerancetechniquesatcernenvironment
AT masia analyzingtheimpactofradiationinducedfailuresinflashbasedapsocwithandwithoutfaulttolerancetechniquesatcernenvironment