Stream results for HP Integrity rx8620 with 1500MHz, 6MB Itanium 2 processors:
4 cells, 16 cpus, 32GB of memory (64x512MB DIMMs):
Function Rate (MB/s) Avg time Min time Max time
Copy: 19912.7274 0.0516 0.0514 0.0519
Scale: 19888.3396 0.0517 0.0515 0.0519
Add: 20118.3721 0.0766 0.0764 0.0769
Triad: 20510.8564 0.0754 0.0749 0.0760
The system was booted with half of the memory in each cell configured as local memory.
The runs used the omp version of the stream benchmark, with the following changes:
63c63
< PARAMETER (n=2000000,offset=0,ndim=n+offset,ntimes=10)
--- > PARAMETER (n=64003248,offset=0,ndim=n+offset,ntimes=10) 72c72 < INTEGER bytes(4) --- > INTEGER*8 bytes(4) 90c90 < * COMMON a,b,c --- > COMMON a,b,c compiled as follows (mysecond_wall.o was a C routine that called gettimeofday): f90 -o stream_d.omp +Ofaster +DSitanium2 -Wl,+pd,1M +DD64 +Oopenmp +autodbl4 +extend_source +noppu stream_d.f mysecond.o The resulting executable was run with the default thread launch policy (FILL) and the default local memory allocation policy (first-touch). Here is the output: ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 64003248 Offset = 0 The total memory requirement is 1464 MB You are running each test 10 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 Number of Threads = 16 ---------------------------------------------------- Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 19912.7274 0.0516 0.0514 0.0519 Scale: 19888.3396 0.0517 0.0515 0.0519 Add: 20118.3721 0.0766 0.0764 0.0769 Triad: 20510.8564 0.0754 0.0749 0.0760 ---------------------------------------------------- Solution Validates! ----------------------------------------------------Received on Mon Nov 3 15:21:45 2003
This archive was generated by hypermail 2.1.8 : Tue Nov 04 2003 - 12:25:25 CST