From: Bradley Lucier (lucier@math.purdue.edu)
Date: Fri Jul 02 2004 - 15:54:53 CDT
These are the stream results with IBM's XLC 6.0 compiler with
92 22:18 /opt/ibmcmp/vac/6.0/bin/cc -O5 stream_d.c second_cpu.c
on a dual 2.0GHz G5:
[xsun21:~] lucier% ./a.out
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 40000000, Offset = 0
Total memory required = 915.5 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Your clock granularity/precision appears to be 9999 microseconds.
Each test below will take on the order of 220000 microseconds.
(= 22 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 3368.4258 0.1951 0.1900 0.2000
Scale: 2909.0972 0.2230 0.2200 0.2300
Add: 2526.3186 0.3810 0.3800 0.3900
Triad: 2526.3190 0.3871 0.3800 0.4100
This archive was generated by hypermail 2.1.4 : Wed Aug 11 2004 - 22:32:18 CDT