xlf -O3 -qarch=pwrx stream_128.f -o stream
"stream_128.f", 1500-036 (I) Optimization level 3 has the potential to alter the semantics of a program. Please refer to documentation on -O3 and the STRICT option for more information.
** stream === End of Compilation 1 ===
** realsize === End of Compilation 2 ===
** confuse === End of Compilation 3 ===
** checktick === End of Compilation 4 ===
** second === End of Compilation 5 ===
1501-51xlf -O3 -qarch=pwrx stream_128.f -o stream
"stream_128.f", 1500-036 (I) Optimization level 3 has the potential to alter the semantics of a program. Please refer to documentation on -O3 and the STRICT option for more information.
** stream === End of Compilation 1 ===
** realsize === End of Compilation 2 ===
** confuse === End of Compilation 3 ===
** checktick === End of Compilation 4 ===
** second === End of Compilation 5 ===
1501-510 Compilation successful for file stream_128.f.
grieg 21% ./stream
Hmmmm. I am unable to determine the size.
Please enter the number of Bytes per DOUBLE PRECISION number :
16
You have manually entered a size of 16 bytes per DOUBLE PRECISION number
----------------------------------------------
Array size = 2000000
Offset = 0
The total memory requirement is 91 MB
You are running each test 10 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity/precision appears to be 10000 microseconds
The tests below will each take a time on the order
of 440000 microseconds
(= 44 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know tho be 10000 microseconds
The tests below will each take a time on the order
of 440000 microseconds
(= 44 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) Min time Max time Mean time RMS time Median
Copy: 800.00 .0800 .1300 .1050 .1059 .1100
Scale: 130.61 .4900 .5400 .5110 .5113 .5100
Add: 118.52 .8100 .9200 .8850 .8856 .8800
Triad: 68.57 1.4000 1.5700 1.4790 1.4800 1.4100
-----------------------------------------------------------------------------
All times are
.1100 .5000 .9200 1.5400
.1000 .4900 .9000 1.4500
.1100 .5400 .8800 1.5300
.0900 .5100 .8700 1.5200
.1000 .5100 .9100 1.4000
.1200 .5100 .8500 1.4200
.1300 .5200 .9100 1.4500
.0800 .5400 .8800 1.4400
.1100 .5000 .8100 1.5700
.1000 .4900 .9200 1.4700
-----------------------------------------------------------------------------
Sum of a is = 115330078125.000000
Sum of b is = 23066015625.0000000
Sum of c is = 30754687500.0000000
This archive was generated by hypermail 2b29 : Tue Apr 18 2000 - 05:23:06 CDT