Cargando…

A High Performance Multi-Core FPGA Implementation for 2D Pixel Clustering for the ATLAS Fast TracKer (FTK) Processor

The high performance multi-core 2D pixel clustering FPGA implementation used for the input system of the ATLAS Fast TracKer (FTK) processor is presented. The input system for the FTK processor will receive data from the Pixel and micro-strip detectors read out drivers (RODs) at 760Gbps, the full rat...

Descripción completa

Detalles Bibliográficos
Autores principales: Sotiropoulou, C-L, Annovi, A, Beretta, M, Gkaitatzis, S, Kordas, K, Nikolaidis, S, Petridou, C, Volpi, G
Lenguaje:eng
Publicado: 2014
Materias:
Acceso en línea:http://cds.cern.ch/record/1704424
_version_ 1780936387411836928
author Sotiropoulou, C-L
Annovi, A
Beretta, M
Gkaitatzis, S
Kordas, K
Nikolaidis, S
Petridou, C
Volpi, G
author_facet Sotiropoulou, C-L
Annovi, A
Beretta, M
Gkaitatzis, S
Kordas, K
Nikolaidis, S
Petridou, C
Volpi, G
author_sort Sotiropoulou, C-L
collection CERN
description The high performance multi-core 2D pixel clustering FPGA implementation used for the input system of the ATLAS Fast TracKer (FTK) processor is presented. The input system for the FTK processor will receive data from the Pixel and micro-strip detectors read out drivers (RODs) at 760Gbps, the full rate of level 1 triggers. Clustering is required as a method to reduce the high rate of the received data before further processing, as well as to determine the cluster centroid for obtaining obtain the best spatial measurement. Our implementation targets the pixel detectors and uses a 2D-clustering algorithm that takes advantage of a moving window technique to minimize the logic required for cluster identification. The design is fully generic and the cluster detection window size can be adjusted for optimizing the cluster identification process. Τhe implementation can be parallelized by instantiating multiple cores to identify different clusters independently thus exploiting more FPGA resources. This flexibility makes the implementation suitable for a variety of demanding image processing applications. The implementation is robust against bit errors in the input data stream and drops all data that cannot be identified. In the unlikely event of missing control words, the implementation will ensure stable data processing by inserting the missing control words in the data stream. The 2D pixel clustering implementation is developed and tested in both single flow and parallel versions. The first parallel version with 16 parallel cluster identification engines is presented. The input data from the RODs are received through S-Links and a single data stream is also required by the processing units that follow the clustering implementation. Data parallelizing (demultiplexing) and serializing (multiplexing) modules are introduced in order to accommodate the parallelized version and restore the data stream to a single flow afterwards. The results of the first hardware tests of the single flow implementation on the custom FTK input mezzanine (IM) board are presented. We report on the integration of 16 parallel engines in the same FPGA, the resulting performances and the first parallel version hardware tests. The parallel 2D-clustering implementation has sufficient processing power to meet the specification for the Pixel layers of ATLAS, for up to 80 overlapping pp collisions that correspond to the maximum LHC luminosity planned until 2022.
id cern-1704424
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2014
record_format invenio
spelling cern-17044242019-09-30T06:29:59Zhttp://cds.cern.ch/record/1704424engSotiropoulou, C-LAnnovi, ABeretta, MGkaitatzis, SKordas, KNikolaidis, SPetridou, CVolpi, GA High Performance Multi-Core FPGA Implementation for 2D Pixel Clustering for the ATLAS Fast TracKer (FTK) ProcessorParticle Physics - ExperimentThe high performance multi-core 2D pixel clustering FPGA implementation used for the input system of the ATLAS Fast TracKer (FTK) processor is presented. The input system for the FTK processor will receive data from the Pixel and micro-strip detectors read out drivers (RODs) at 760Gbps, the full rate of level 1 triggers. Clustering is required as a method to reduce the high rate of the received data before further processing, as well as to determine the cluster centroid for obtaining obtain the best spatial measurement. Our implementation targets the pixel detectors and uses a 2D-clustering algorithm that takes advantage of a moving window technique to minimize the logic required for cluster identification. The design is fully generic and the cluster detection window size can be adjusted for optimizing the cluster identification process. Τhe implementation can be parallelized by instantiating multiple cores to identify different clusters independently thus exploiting more FPGA resources. This flexibility makes the implementation suitable for a variety of demanding image processing applications. The implementation is robust against bit errors in the input data stream and drops all data that cannot be identified. In the unlikely event of missing control words, the implementation will ensure stable data processing by inserting the missing control words in the data stream. The 2D pixel clustering implementation is developed and tested in both single flow and parallel versions. The first parallel version with 16 parallel cluster identification engines is presented. The input data from the RODs are received through S-Links and a single data stream is also required by the processing units that follow the clustering implementation. Data parallelizing (demultiplexing) and serializing (multiplexing) modules are introduced in order to accommodate the parallelized version and restore the data stream to a single flow afterwards. The results of the first hardware tests of the single flow implementation on the custom FTK input mezzanine (IM) board are presented. We report on the integration of 16 parallel engines in the same FPGA, the resulting performances and the first parallel version hardware tests. The parallel 2D-clustering implementation has sufficient processing power to meet the specification for the Pixel layers of ATLAS, for up to 80 overlapping pp collisions that correspond to the maximum LHC luminosity planned until 2022.ATL-DAQ-SLIDE-2014-248oai:cds.cern.ch:17044242014
spellingShingle Particle Physics - Experiment
Sotiropoulou, C-L
Annovi, A
Beretta, M
Gkaitatzis, S
Kordas, K
Nikolaidis, S
Petridou, C
Volpi, G
A High Performance Multi-Core FPGA Implementation for 2D Pixel Clustering for the ATLAS Fast TracKer (FTK) Processor
title A High Performance Multi-Core FPGA Implementation for 2D Pixel Clustering for the ATLAS Fast TracKer (FTK) Processor
title_full A High Performance Multi-Core FPGA Implementation for 2D Pixel Clustering for the ATLAS Fast TracKer (FTK) Processor
title_fullStr A High Performance Multi-Core FPGA Implementation for 2D Pixel Clustering for the ATLAS Fast TracKer (FTK) Processor
title_full_unstemmed A High Performance Multi-Core FPGA Implementation for 2D Pixel Clustering for the ATLAS Fast TracKer (FTK) Processor
title_short A High Performance Multi-Core FPGA Implementation for 2D Pixel Clustering for the ATLAS Fast TracKer (FTK) Processor
title_sort high performance multi-core fpga implementation for 2d pixel clustering for the atlas fast tracker (ftk) processor
topic Particle Physics - Experiment
url http://cds.cern.ch/record/1704424
work_keys_str_mv AT sotiropouloucl ahighperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT annovia ahighperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT berettam ahighperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT gkaitatziss ahighperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT kordask ahighperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT nikolaidiss ahighperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT petridouc ahighperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT volpig ahighperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT sotiropouloucl highperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT annovia highperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT berettam highperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT gkaitatziss highperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT kordask highperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT nikolaidiss highperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT petridouc highperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor
AT volpig highperformancemulticorefpgaimplementationfor2dpixelclusteringfortheatlasfasttrackerftkprocessor