From: John D Mccalpin (mccalpin@us.ibm.com)
Date: Wed Jan 22 2003 - 07:23:53 CST
The system used was:
IBM eServer pSeries 650 Model 6M2. 8 CPU. 1450 Mhz.
SUSE's SLES 8 for pSeries GA'ed product w/2.4.19 UL 1 64-bit kernel.
The "un-tuned" results were:
Copy: 6108
Scale: 5936
Add: 7429
Triad: 7588
These results were consistent with runs done on AIX on the same model
hardware.
Number of Threads = 8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 66060288
Offset = 96
The total memory requirement is 1512 MB
You are running each test 100 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 115993 microseconds
(= 115993 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 6108.2801 .1756 .1730 .1802
Scale: 5936.6008 .1944 .1780 .7617
Add: 7429.6680 .2161 .2134 .2197
Triad: 7588.9570 .2116 .2089 .2165
Sum of a is = 0.537150969561264381E+126
Sum of b is = 0.107430193911095819E+126
Sum of c is = 0.143240258548815869E+126
--- John D. McCalpin, Ph.D. STSM, eServer Hardware Performance IBM - 11400 Burnet Road, MS 045-3N098 Austin, TX 78758 (512) 838-6167 or tie line 678/6167 FAX (512) 838-6486 or 678/6486
This archive was generated by hypermail 2.1.4 : Wed Jan 22 2003 - 08:24:43 CST