Cargando…
Evaluation of the Intel Nehalem-EX server processor
In this paper we report on a set of benchmark results recently obtained by the CERN openlab by comparing the 4-socket, 32-core Intel Xeon X7560 server with the previous generation 4-socket server, based on the Xeon X7460 processor. The Xeon X7560 processor represents a major change in many respects,...
Autores principales: | , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2010
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/1325108 |
_version_ | 1780921641765699584 |
---|---|
author | Jarp, S Lazzaro, A Leduc, J Nowak, A |
author_facet | Jarp, S Lazzaro, A Leduc, J Nowak, A |
author_sort | Jarp, S |
collection | CERN |
description | In this paper we report on a set of benchmark results recently obtained by the CERN openlab by comparing the 4-socket, 32-core Intel Xeon X7560 server with the previous generation 4-socket server, based on the Xeon X7460 processor. The Xeon X7560 processor represents a major change in many respects, especially the memory sub-system, so it was important to make multiple comparisons. In most benchmarks the two 4-socket servers were compared. It should be underlined that both servers represent the “top of the line” in terms of frequency. However, in some cases, it was important to compare systems that integrated the latest processor features, such as QPI links, Symmetric multithreading and over-clocking via Turbo mode, and in such situations the X7560 server was compared to a dual socket L5520 based system with an identical frequency of 2.26 GHz. Before summarizing the results we must stress the fact that benchmarking of modern processors is a very complex affair. One has to control (at least) the following features: processor frequency, overclocking via Turbo mode, the number of physical cores in use, the use of logical cores via Symmetric MultiThreading (SMT), the cache sizes available, the configured memory topology, as well as the power configuration if throughput per watt is to be measured. We have tried to do a good job of comparing like with like. In summary, we saw a broad range of results. Our variant of the SPEC benc hmark rate, “HEPSPEC”, gave a stunning 3x overall improvement on the new server, thanks to good scaling with the 32 cores and a 26% additional gain when enabling SMT. In-house data analysis and simulation benchmarks showed throughput increases in the range of 11 to 60%. Oracle database tests will follow. Finally it should be mentioned that the 4-socket server can be equipped with 32 memory cards (DIMMs) which correspond to 512 GB total memory. This is a very impressive amount of memory that also comes with a very significant thermal load. |
id | cern-1325108 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2010 |
record_format | invenio |
spelling | cern-13251082019-09-30T06:29:59Zhttp://cds.cern.ch/record/1325108engJarp, SLazzaro, ALeduc, JNowak, AEvaluation of the Intel Nehalem-EX server processorComputing and ComputersIn this paper we report on a set of benchmark results recently obtained by the CERN openlab by comparing the 4-socket, 32-core Intel Xeon X7560 server with the previous generation 4-socket server, based on the Xeon X7460 processor. The Xeon X7560 processor represents a major change in many respects, especially the memory sub-system, so it was important to make multiple comparisons. In most benchmarks the two 4-socket servers were compared. It should be underlined that both servers represent the “top of the line” in terms of frequency. However, in some cases, it was important to compare systems that integrated the latest processor features, such as QPI links, Symmetric multithreading and over-clocking via Turbo mode, and in such situations the X7560 server was compared to a dual socket L5520 based system with an identical frequency of 2.26 GHz. Before summarizing the results we must stress the fact that benchmarking of modern processors is a very complex affair. One has to control (at least) the following features: processor frequency, overclocking via Turbo mode, the number of physical cores in use, the use of logical cores via Symmetric MultiThreading (SMT), the cache sizes available, the configured memory topology, as well as the power configuration if throughput per watt is to be measured. We have tried to do a good job of comparing like with like. In summary, we saw a broad range of results. Our variant of the SPEC benc hmark rate, “HEPSPEC”, gave a stunning 3x overall improvement on the new server, thanks to good scaling with the 32 cores and a 26% additional gain when enabling SMT. In-house data analysis and simulation benchmarks showed throughput increases in the range of 11 to 60%. Oracle database tests will follow. Finally it should be mentioned that the 4-socket server can be equipped with 32 memory cards (DIMMs) which correspond to 512 GB total memory. This is a very impressive amount of memory that also comes with a very significant thermal load.CERN-IT-Note-2011-005oai:cds.cern.ch:13251082010-05-01 |
spellingShingle | Computing and Computers Jarp, S Lazzaro, A Leduc, J Nowak, A Evaluation of the Intel Nehalem-EX server processor |
title | Evaluation of the Intel Nehalem-EX server processor |
title_full | Evaluation of the Intel Nehalem-EX server processor |
title_fullStr | Evaluation of the Intel Nehalem-EX server processor |
title_full_unstemmed | Evaluation of the Intel Nehalem-EX server processor |
title_short | Evaluation of the Intel Nehalem-EX server processor |
title_sort | evaluation of the intel nehalem-ex server processor |
topic | Computing and Computers |
url | http://cds.cern.ch/record/1325108 |
work_keys_str_mv | AT jarps evaluationoftheintelnehalemexserverprocessor AT lazzaroa evaluationoftheintelnehalemexserverprocessor AT leducj evaluationoftheintelnehalemexserverprocessor AT nowaka evaluationoftheintelnehalemexserverprocessor |