Hello, John
I send you results for the STREAM benchmark on Siemens' RM600E system. The
OS is Reliant Unix 5.44, the compiler flags to "cc" were "-O2 -lm". As you
may see, Add rate is larger than Copy rate! Is this at all possible and how
this may happen?
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 1000000, Offset = 0
Total memory required = 22.9 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Your clock granularity/precision appears to be 16 microseconds.
Each test below will take on the order of 189254 microseconds.
(= 11828 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 75.2676 0.2141 0.2126 0.2171
Scale: 74.9931 0.2156 0.2134 0.2180
Add: 97.4220 0.2518 0.2464 0.2656
Triad: 74.6078 0.3239 0.3217 0.3271
Best wishes,
Alexander N. Andreyev
Laboratory of Parallel Information Technologies, SRCC, MSU
alexander@vvv.srcc.msu.su, http://alex.motor.ru, ICQ: #3523091
Parallel Computing: http://parallel.srcc.msu.su [russian language]
This archive was generated by hypermail 2b29 : Tue Apr 18 2000 - 05:23:08 CDT