John,
I ran stream_s.f on our Cray - the compiler optimized out all
the interesting code again.
Rob.
P.S. The simple insertion of
Common // a, b, c
enabled correct results to be achieved. I also used the Cray intrinsic
second, which returns CPU time - appropriate on a busy machine.
Here are the results.
Run on Cray Y-MP2/216, serial number 1409.
Clock speed 5.998 ns.
Banks are busy for 5 clock periods per reference.
Using only one processor.
UNICOS 6.0.12
Compiler invocation: cf77 -V -o Mc -Wf"-a static" stream_s.f >& stream_s.out
Executable invocation: ./Mc >> stream_s.out
1991 Oct 01 11:36:55 Tue
/ Robert C. Bell | CSIRO Supercomputing Support Manager \
| Division of Information Technology | 'phone: (03) 282 2620 +61 3 282 2620 |
| 723 Swanston Street | fax: (03) 282 2600 +61 3 282 2600 |
\ Carlton VIC 3053 Australia | email: csrcb@mel.dit.csiro.au /
FF0001 CFT77 VERSION 4.0.3 (386394) 12/22/90 21:03:26
FF0002 COMPILE TIME .563 SECONDS
FF0006 MAXIMUM FIELD LENGTH 333805 DECIMAL WORDS
FF0003 266 SOURCE LINES
FF0004 0 ERRORS, 0 WARNINGS
FF0005 CODE: 378 WORDS, DATA: 392 WORDS
SEGLDR version 6.0 - 08/25/91 (Chg-01/21/91 Level-0)
(c) Copyright Cray Research, Inc.
Unpublished -- All rights reserved under copyright laws of the United States
--------------------------------------
Single precision appears to have 14 digits of accuracy
Assuming 8 bytes per default REAL word
--------------------------------------
Timing calibration ; time = 0.9858252 hundredths of a second
Increase the size of the arrays if this is <30 and your clock precision is =<1/100 second
---------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Assignment: 2426.3556 0.0033 0.0033 0.0033
Scaling : 2426.1790 0.0033 0.0033 0.0033
Summing : 3454.4045 0.0035 0.0035 0.0036
SAXPYing : 3396.8949 0.0036 0.0035 0.0036
This archive was generated by hypermail 2b29 : Tue Apr 18 2000 - 05:23:01 CDT