From: Schmidt, David (Performance Eng.) (d.schmidt@hp.com)
Date: Fri Dec 17 2004 - 13:40:43 CST
Below are 2 CPU stream results for an HP ProLiant DL145, configured as
follows:
HP ProLiant DL145
4x2.4GHz 240 Opteron processors
16GB PC2700 memory (8x2GB DIMMs)
SuSE Linux Enterprise Server 9 for AMD64
I used Revision 5.3 of the stream code and compiled with PGI C/C++ for
Linux v.5.2-4:
pgcc -O2 -Mvect=sse -Mnontemporal -mp -o ompstream stream_omp.c
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 7500000, Offset = 0
Total memory required = 171.7 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Number of Threads requested = 2
Number of Threads requested = 2
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 16077 microseconds.
(= 16077 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 4840.2797 0.0223 0.0248 0.0248
Scale: 4847.9256 0.0223 0.0248 0.0248
Add: 5940.5665 0.0273 0.0303 0.0303
Triad: 5534.1938 0.0293 0.0325 0.0326
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------
David Schmidt
Hewlett-Packard Company
(281) 514-5039
D.Schmidt@hp.com
This archive was generated by hypermail 2.1.4 : Sun Dec 19 2004 - 10:00:22 CST