From: Frank Johnston (fjohn@us.ibm.com)
Date: Mon Jul 12 2004 - 16:08:59 CDT
IBM eServer p5 570 (1900 MHz, 8 cpu, 36MB L3 cache) with DDR2 memory.
Requesting Large Pages
Setting up for 2 CPUs per module
Number of segments per array = 4
CPU binding list : 0 2 4 6
Shared Segment Pointer = 504403158265495552
Shared Segment Pointer = 504403159339237376
Shared Segment Pointer = 504403160412979200
Segment Size (B) = 268435456 (MB = 256 )
Array Size (B) = 1073741824 (MB = 1024 )
Array Size (DW) = 134217728
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
rebind: num_parthds is 8
Starting Initialization
Done With Initialization
a(1) 1.00000000000000000
b(M) 1.00000000000000000
c(M) 1.00000000000000000
Incremental Offset = 1536
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 134155264
The total memory requirement is 3070 MB
You are running each test 5 times
--
The *best* time for each test is used
*EXCLUDING* the first and last iterations
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
----------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 32372.1095 .0666 .0663 .0667
Scale: 31545.4466 .0682 .0680 .0684
Add: 34842.6183 .0925 .0924 .0927
Triad: 35933.6462 .0900 .0896 .0902
----------------------------------------------------
Solution Validates!
----------------------------------------------------
This archive was generated by hypermail 2.1.4 : Tue Jul 13 2004 - 08:50:46 CDT