These are standard STREAM results on an IBM Power 575
with thirty-two 4.7 GHz cores. This is a POWER6 SMP machine.
Large pages were used in all cases.
Function Rate (MB/s) Avg time Min time Max time
Copy: 142708.1898 .1204 .1204 .1205
Scale: 142612.4358 .1205 .1205 .1205
Add: 159010.6786 .1624 .1621 .1628
Triad: 162844.0220 .1585 .1582 .1588
Here is the full output file:
--------------------------------------------------
Environment variable MEMSUITE set to DETAILS
Requesting LARGE pages
Running TRIAD kernel
Setting up for 32 chips
Setting up for 32 threads
Number of segments per array = 32
Reading cpu binding list for page placement
DATA binding list : 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
21 22 23 24 25 26 27 28 29 30 31
Beginning allocation of data segments....
GETSHRSEG: requesting large pages
GETSHRSEG ENTRY: shmgetflag -2147481216
Moff = 75776
Incremental Offset = 678
504403158265495552 504403166855435560 504403175445375576
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 1073666048
The total memory requirement is 24574 MB
You are running each test 5 times
--
The *best* time for each test is used
*EXCLUDING* the first and last iterations
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
----------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 142708.1898 .1204 .1204 .1205
Scale: 142612.4358 .1205 .1205 .1205
Add: 159010.6786 .1624 .1621 .1628
Triad: 162844.0220 .1585 .1582 .1588
----------------------------------------------------
Solution Validates!
----------------------------------------------------
Received on Mon Apr 07 21:45:36 2008
This archive was generated by hypermail 2.1.8 : Tue Apr 15 2008 - 10:02:33 CDT