attached are Stream results for a Superdome with:
8 cells (in a single partition)
32 cpus (fully populated with 4 cpus/cell)
128GB of memory (fully populated with 512MB DIMMs)
750MHz PA-8700 cpus
running HP-UX 11i, September patch bundle
run Tue Sep 4 17:24:09 CDT 2001
modifications to stream source
------------------------------
63c63
< PARAMETER (n=2000000,offset=0,ndim=n+offset,ntimes=10)
--- > PARAMETER (n=53477800,offset=0,ndim=n+offset,ntimes=10) 88c88 < * COMMON a,b,c --- > COMMON a,b,coutput from make ---------------- f90 -o ../../stream_d.mp +extend_source +autodbl4 +DA2.0W +noppu +DS2.0 +O3 -Wl,+pd,L -Wl,-aarchive +Oparallel stream_d. f second_wall.o stream_d.f program STREAM external function REALSIZE external subroutine CONFUSE external function CHECKTICK external subroutine CHECKSUMS
413 Lines Compiled
output from execution ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 53477800 Offset = 0 The total memory requirement is 1224 MB You are running each test 10 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity/precision appears to be 2 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 13059.4961 0.0658 0.0655 0.0660 Scale: 12838.2655 0.0669 0.0666 0.0673 Add: 13526.4205 0.0958 0.0949 0.0981 Triad: 13591.1524 0.0949 0.0944 0.0956 ---------------------------------------------------- Solution Validates! ----------------------------------------------------
This archive was generated by hypermail 2b29 : Wed Oct 31 2001 - 11:26:47 CST