These are tuned STREAM results on an IBM System p5 550
with four 2.1 GHz cpus. This is a POWER5+ SMP machine.
Large pages were used in all cases.
Function Rate (MB/s) RMS time Min time Max time
Copy: 16255.32 .07 .07 .07
Scale: 16150.16 .07 .07 .07
Add: 20459.11 .08 .08 .08
Triad: 20721.89 .08 .08 .08
Here is the full output file:
--------------------------------------------------
Requesting Large Pages
Setting up for 2 CPUs per module
Number of segments per array = 2
CPU binding list : 0 2
Shared Segment Pointer = 504403158265495552
Shared Segment Pointer = 504403158802366464
Shared Segment Pointer = 504403159339237376
Segment Size (B) = 268435456 (MB = 256 )
Array Size (B) = 536870912 (MB = 512 )
Array Size (DW) = 67108864
Num_threads = 4
Num_threads = 4
Num_threads = 4
Num_threads = 4
rebind: num_parthds is 4
Starting Initialization
Done With Initialization
a(1) 1.00000000000000000
b(M) 1.00000000000000000
c(M) 1.00000000000000000
Incremental Offset = 2560
Number of Threads = 4
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 67073024
Offset = 0
The total memory requirement is 1535 MB
You are running each test 5 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 66588 microseconds
(= 66588 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 16255.32 .07 .07 .07
Scale: 16150.16 .07 .07 .07
Add: 20459.11 .08 .08 .08
Triad: 20721.89 .08 .08 .08
Sum of a is = 101866931943750.000
Sum of b is = 20373386388750.0000
Sum of c is = 27164515185000.0000
______________________________________________
Ly Vu
IBM Corp. - Austin, Texas.
AIX/pSeries Performance
Phone : (512) 838-8228
Email : lyvu@us.ibm.com
Received on Mon Jul 24 18:33:00 2006
This archive was generated by hypermail 2.1.8 : Tue Jul 25 2006 - 11:10:56 CST