Re: [SAGE] SPEC CPU Benchmark
- To: Doug Hughes <doug@xxxxxxx>
- Subject: Re: [SAGE] SPEC CPU Benchmark
- From: ntwrkd <ntwrkd@xxxxxxxxx>
- Date: Sun, 15 Nov 2009 02:21:53 -0800
- Cc: SAGE <sage-members@xxxxxxxx>
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type; bh=FjTl53vvhYOkUM/PtHGmWGT0QNSRNbszgG386JQhyME=; b=c3TqjScynwjDAtQjNCVdWK/2rpeaYH83d/ssv7xX3JULAWp/EFV1Zb+vXS57wiTBqz abLp4BVm2d+E09QD51dbGKL7vm5ocVS5Gf0JaBhVqaIHPDwlkPNEjspAQvnabeE/jsB6 mTF/acFQjTTTPaVdTwH2+0CR0KePBvXiwddLE=
- Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=wLkgT33/ZSOyx6S+s03Lc5BlPDf1MAZYv2MBhKh8uqNIGrf8XwWBW8KTsk43lGEJG1 UHtFvkVR9acDMRj+tIA6NILfN837U/y263pH3bJdeWPgRGp5n4miO3oBteSUhHbJboll 9J6L6EYZG3MjU4pggjJ9kop/7c5TDSbEGs+Q4=
- In-reply-to: <4AEC5C2E.2060804@xxxxxxx>
- References: <f257b9ab0910310350h719d4d06l9d508776ee2a2165@xxxxxxxxxxxxxx> <4AEC5C2E.2060804@xxxxxxx>
After some testing with SPEC, I found that it is ok getting a general
idea of system performance, but as many have expressed it can be best
summed up by Doug's comment:
> Run *YOUR* code on the cpus in question, or a simplified version of it that
> is mathematically and structurally equivalent. Compare the results.
Perhaps any SPEC CPU engineers on the list may disagree?
On Sat, Oct 31, 2009 at 7:47 AM, Doug Hughes <doug@xxxxxxx> wrote:
> ntwrkd wrote:
>>
>> I'm trying to gather a baseline of performance testing the
>> CPU/OS/Compiler on a Linux system and I seemingly made the mistake of
>> purchasing the SPEC Benchmark. I don't know if there are any Intel
>> folk on the list but there seem to be a few problem with this
>> benchmark (granted that I know it is a respected standard):
>> - Lack of up-to date documentation
>> - Errors in documentation
>> - Requirements on a Fortran Compiler for integer testing
>> - No real way to translate results other than with a previous baseline
>> or another systems publicly published results.
>>
>>
>> I'm thinking that my own tests will be better off by using a more
>> real-world test, such as running a compute intensive process and
>> recording the results with sysstat.
>> This leaves me with the following questions:
>> - Does anyone use SPEC anymore?
>> - Are these shared concerns listed above with this benchmark?
>> - Any suggestions for alternatives?
>>
>>
>
> All the suggestions I have for benchmark involve getting a load that matches
> what you want to do in reality. If SPEC doesn't match your reality, it's of
> very little use. Even if you're doing pure floating point, there are a lot
> of different ways to do it. How many floating point operations can you have
> running in parallel on the different FP units (depends on your code)? SSE vs
> non-SSE offloading. is it 32 bit or 64 bit multiplication or is it division?
> All of these things make a big difference.
>
> Also, there are the memory effects, which cannot be ignored. Different
> architectures and chipsets handle memory copies VERY differently. We've
> noticed that a memory DIMM that is the same speed as all of the other dimms
> with measured with qcdstream --sse --mem (this is a good test by the way, I
> recommend it), runs 30% slower when run on a particular version of an MPI
> code test we use in HPC (inter and intra core copies for IPC).
>
> Run *YOUR* code on the cpus in question, or a simplified version of it that
> is mathematically and structurally equivalent. Compare the results.
>
> For our HPC results, the Nehalem architecture does much better at memory
> operations by a long shot then anything else for a certain HPC benchmark,
> but we have another app that runs as a file server where just more less
> expensive memory is a bigger win, so we went with AMD Istanbul with more
> cores and cheaper DDR2 dimms. Some jobs require the highest clock you can
> get because the marginal cost of the license allocation far exceeds that of
> the CPU, whereas others require higher throughput so having more less
> expenive, slightly slower CPUs is a bigger win.
>
> Measure cost per unit of speed of your code and decide whether more and
> slower or fewer and faster is your best bet.
>
This archive was generated by a fusion of
Pipermail (Mailman edition) and
MHonArc.