bsw-1 2# setenv MP_SET_NUMTHREADS 2
bsw-1 3# ./stream.mp.4e6
10036960
11EBB160
13D3FD28
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 4000000
Offset = 0
The total memory requirement is 91 MB
You are running each test 10 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity/precision appears to be 4 microseconds
The tests below will each take a time on the order
of 134646 microseconds
(= 33662 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) Min time Max time Mean time RMS time Median
Copy: 331.95 0.1928 0.2065 0.2039 0.2040 0.2062
Scale: 336.37 0.1903 0.2085 0.2054 0.2055 0.2077
Add: 373.28 0.2572 0.2609 0.2592 0.2592 0.2592
Triad: 364.67 0.2633 0.2649 0.2637 0.2637 0.2634
-----------------------------------------------------------------------------
Sum of a is = 57665039062.50000
Sum of b is = 11533007812.50000
Sum of c is = 15377343750.00000
a(1),a(n) = 1153300781250.000 1153300781250.000
b(1),b(n) = 230660156250.0000 230660156250.0000
c(1),c(n) = 307546875000.0000 307546875000.0000
bsw-1 4# !!
./stream.mp.4e6
10036960
11EBB160
13D3FD28
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 4000000
Offset = 0
The total memory requirement is 91 MB
You are running each test 10 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity/precision appears to be 4 microseconds
The tests below will each take a time on the order
of 134403 microseconds
(= 33601 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) Min time Max time Mean time RMS time Median
Copy: 333.35 0.1920 0.2063 0.2048 0.2048 0.2062
Scale: 337.55 0.1896 0.2078 0.2053 0.2053 0.2069
Add: 373.45 0.2571 0.2591 0.2588 0.2588 0.2590
Triad: 364.63 0.2633 0.2635 0.2633 0.2633 0.2633
-----------------------------------------------------------------------------
Sum of a is = 57665039062.50000
Sum of b is = 11533007812.50000
Sum of c is = 15377343750.00000
a(1),a(n) = 1153300781250.000 1153300781250.000
b(1),b(n) = 230660156250.0000 230660156250.0000
c(1),c(n) = 307546875000.0000 307546875000.0000
-- -- John D. McCalpin, Ph.D. Server System Architect Server Platform Engineering http://reality.sgi.com/mccalpin/ Silicon Graphics, Inc. mccalpin@sgi.com 650-933-7407
This archive was generated by hypermail 2b29 : Tue Apr 18 2000 - 05:23:07 CDT