bsw-1 6# f77 -n32 -mips4 -Ofast=ip27 stream.f second.o -o stream.small
bsw-1 7# ./stream.small
100149D8
1004F358
10089CD8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 30000
Offset = 0
The total memory requirement is 0 MB
You are running each test 20 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity/precision appears to be 4 microseconds
The tests below will each take a time on the order
of 306 microseconds
(= 77 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) Min time Max time Mean time RMS time Median
Copy: 1030.04 0.0005 0.0005 0.0005 0.0005 0.0005
Scale: 1085.97 0.0004 0.0005 0.0004 0.0004 0.0004
Add: 1089.26 0.0007 0.0007 0.0007 0.0007 0.0007
Triad: 1182.27 0.0006 0.0006 0.0006 0.0006 0.0006
-----------------------------------------------------------------------------
Sum of a is = 3.3252567300810404E+22
Sum of b is = 6.6505134601593369E+21
Sum of c is = 8.8673512802117495E+21
a(1),a(n) = 6.6505134601593012E+23 6.6505134601593012E+23
b(1),b(n) = 1.3301026920318602E+23 1.3301026920318602E+23
c(1),c(n) = 1.7734702560424803E+23 1.7734702560424803E+23
bsw-1 8# ^P
^P - Command not found
bsw-1 9# ./stream.small
100149D8
1004F358
10089CD8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 30000
Offset = 0
The total memory requirement is 0 MB
You are running each test 20 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity/precision appears to be 4 microseconds
The tests below will each take a time on the order
of 299 microseconds
(= 75 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) Min time Max time Mean time RMS time Median
Copy: 1025.64 0.0005 0.0005 0.0005 0.0005 0.0005
Scale: 1085.97 0.0004 0.0005 0.0004 0.0004 0.0004
Add: 1089.26 0.0007 0.0007 0.0007 0.0007 0.0007
Triad: 1184.21 0.0006 0.0006 0.0006 0.0006 0.0006
-----------------------------------------------------------------------------
Sum of a is = 3.3252567300810404E+22
Sum of b is = 6.6505134601593369E+21
Sum of c is = 8.8673512802117495E+21
a(1),a(n) = 6.6505134601593012E+23 6.6505134601593012E+23
b(1),b(n) = 1.3301026920318602E+23 1.3301026920318602E+23
c(1),c(n) = 1.7734702560424803E+23 1.7734702560424803E+23
bsw-1 10# !!
./stream.small
100149D8
1004F358
10089CD8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 30000
Offset = 0
The total memory requirement is 0 MB
You are running each test 20 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity/precision appears to be 4 microseconds
The tests below will each take a time on the order
of 304 microseconds
(= 76 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) Min time Max time Mean time RMS time Median
Copy: 1032.26 0.0005 0.0005 0.0005 0.0005 0.0005
Scale: 1088.44 0.0004 0.0005 0.0004 0.0004 0.0004
Add: 1089.26 0.0007 0.0007 0.0007 0.0007 0.0007
Triad: 1182.27 0.0006 0.0006 0.0006 0.0006 0.0006
-----------------------------------------------------------------------------
Sum of a is = 3.3252567300810404E+22
Sum of b is = 6.6505134601593369E+21
Sum of c is = 8.8673512802117495E+21
a(1),a(n) = 6.6505134601593012E+23 6.6505134601593012E+23
b(1),b(n) = 1.3301026920318602E+23 1.3301026920318602E+23
c(1),c(n) = 1.7734702560424803E+23 1.7734702560424803E+23
-- -- John D. McCalpin, Ph.D. Server System Architect Server Platform Engineering http://reality.sgi.com/mccalpin/ Silicon Graphics, Inc. mccalpin@sgi.com 650-933-7407
This archive was generated by hypermail 2b29 : Tue Apr 18 2000 - 05:23:07 CDT