From: Frank Johnston (fjohn@us.ibm.com)
Date: Mon Jul 12 2004 - 16:09:44 CDT
IBM eServer p5 520 (1650 MHz, 2 cpu, 36MB L3 cache)
Requesting Large Pages
Setting up for 2 CPUs per module
Number of segments per array = 1
CPU binding list : 0
Shared Segment Pointer = 504403158265495552
Shared Segment Pointer = 504403158533931008
Shared Segment Pointer = 504403158802366464
Segment Size (B) = 268435456 (MB = 256 )
Array Size (B) = 268435456 (MB = 256 )
Num_threads = 2
Num_threads = 2
rebind: num_parthds is 2
Starting Initialization
Done With Initialization
a(1) 1.00000000000000000
b(M) 1.00000000000000000
c(M) 1.00000000000000000
Incremental Offset = 1536
Number of Threads = 2
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 33537024
Offset = 0
The total memory requirement is 767 MB
You are running each test 5 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 178269 microseconds
(= 178269 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 3076.5162 .3041 .1744 .1915
Scale: 3011.2651 .3045 .1782 .1806
Add: 4396.9786 .3088 .1831 .1858
Triad: 4510.1933 .3052 .1785 .1796
Sum of a is = 50934355200000.0000
Sum of b is = 10186871040000.0000
Sum of c is = 13582494720000.0000
This archive was generated by hypermail 2.1.4 : Tue Jul 13 2004 - 08:50:46 CDT