Cargando…

Evaluation of the Intel Nehalem-EX server processor

In this paper we report on a set of benchmark results recently obtained by the CERN openlab by comparing the 4-socket, 32-core Intel Xeon X7560 server with the previous generation 4-socket server, based on the Xeon X7460 processor. The Xeon X7560 processor represents a major change in many respects,...

Descripción completa

Detalles Bibliográficos
Autores principales: Jarp, S, Lazzaro, A, Leduc, J, Nowak, A
Lenguaje:eng
Publicado: 2010
Materias:
Acceso en línea:http://cds.cern.ch/record/1325108
_version_ 1780921641765699584
author Jarp, S
Lazzaro, A
Leduc, J
Nowak, A
author_facet Jarp, S
Lazzaro, A
Leduc, J
Nowak, A
author_sort Jarp, S
collection CERN
description In this paper we report on a set of benchmark results recently obtained by the CERN openlab by comparing the 4-socket, 32-core Intel Xeon X7560 server with the previous generation 4-socket server, based on the Xeon X7460 processor. The Xeon X7560 processor represents a major change in many respects, especially the memory sub-system, so it was important to make multiple comparisons. In most benchmarks the two 4-socket servers were compared. It should be underlined that both servers represent the “top of the line” in terms of frequency. However, in some cases, it was important to compare systems that integrated the latest processor features, such as QPI links, Symmetric multithreading and over-clocking via Turbo mode, and in such situations the X7560 server was compared to a dual socket L5520 based system with an identical frequency of 2.26 GHz. Before summarizing the results we must stress the fact that benchmarking of modern processors is a very complex affair. One has to control (at least) the following features: processor frequency, overclocking via Turbo mode, the number of physical cores in use, the use of logical cores via Symmetric MultiThreading (SMT), the cache sizes available, the configured memory topology, as well as the power configuration if throughput per watt is to be measured. We have tried to do a good job of comparing like with like. In summary, we saw a broad range of results. Our variant of the SPEC benc hmark rate, “HEPSPEC”, gave a stunning 3x overall improvement on the new server, thanks to good scaling with the 32 cores and a 26% additional gain when enabling SMT. In-house data analysis and simulation benchmarks showed throughput increases in the range of 11 to 60%. Oracle database tests will follow. Finally it should be mentioned that the 4-socket server can be equipped with 32 memory cards (DIMMs) which correspond to 512 GB total memory. This is a very impressive amount of memory that also comes with a very significant thermal load.
id cern-1325108
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2010
record_format invenio
spelling cern-13251082019-09-30T06:29:59Zhttp://cds.cern.ch/record/1325108engJarp, SLazzaro, ALeduc, JNowak, AEvaluation of the Intel Nehalem-EX server processorComputing and ComputersIn this paper we report on a set of benchmark results recently obtained by the CERN openlab by comparing the 4-socket, 32-core Intel Xeon X7560 server with the previous generation 4-socket server, based on the Xeon X7460 processor. The Xeon X7560 processor represents a major change in many respects, especially the memory sub-system, so it was important to make multiple comparisons. In most benchmarks the two 4-socket servers were compared. It should be underlined that both servers represent the “top of the line” in terms of frequency. However, in some cases, it was important to compare systems that integrated the latest processor features, such as QPI links, Symmetric multithreading and over-clocking via Turbo mode, and in such situations the X7560 server was compared to a dual socket L5520 based system with an identical frequency of 2.26 GHz. Before summarizing the results we must stress the fact that benchmarking of modern processors is a very complex affair. One has to control (at least) the following features: processor frequency, overclocking via Turbo mode, the number of physical cores in use, the use of logical cores via Symmetric MultiThreading (SMT), the cache sizes available, the configured memory topology, as well as the power configuration if throughput per watt is to be measured. We have tried to do a good job of comparing like with like. In summary, we saw a broad range of results. Our variant of the SPEC benc hmark rate, “HEPSPEC”, gave a stunning 3x overall improvement on the new server, thanks to good scaling with the 32 cores and a 26% additional gain when enabling SMT. In-house data analysis and simulation benchmarks showed throughput increases in the range of 11 to 60%. Oracle database tests will follow. Finally it should be mentioned that the 4-socket server can be equipped with 32 memory cards (DIMMs) which correspond to 512 GB total memory. This is a very impressive amount of memory that also comes with a very significant thermal load.CERN-IT-Note-2011-005oai:cds.cern.ch:13251082010-05-01
spellingShingle Computing and Computers
Jarp, S
Lazzaro, A
Leduc, J
Nowak, A
Evaluation of the Intel Nehalem-EX server processor
title Evaluation of the Intel Nehalem-EX server processor
title_full Evaluation of the Intel Nehalem-EX server processor
title_fullStr Evaluation of the Intel Nehalem-EX server processor
title_full_unstemmed Evaluation of the Intel Nehalem-EX server processor
title_short Evaluation of the Intel Nehalem-EX server processor
title_sort evaluation of the intel nehalem-ex server processor
topic Computing and Computers
url http://cds.cern.ch/record/1325108
work_keys_str_mv AT jarps evaluationoftheintelnehalemexserverprocessor
AT lazzaroa evaluationoftheintelnehalemexserverprocessor
AT leducj evaluationoftheintelnehalemexserverprocessor
AT nowaka evaluationoftheintelnehalemexserverprocessor