Attached are stream results for a Superdome with:
4 cells (in a single partition)
16 cpus (fully populated with 4 cpus/cell)
64GB of memory (fully populated with 512MB DIMMs)
750MHz PA-8700 cpus
HP-UX 11i (September 2001 patch bundle)
run Tue Sep 4 16:53:12 CDT 2001
modifications to stream source
------------------------------
63c63
< PARAMETER (n=2000000,offset=0,ndim=n+offset,ntimes=10)
--- > PARAMETER (n=53477800,offset=0,ndim=n+offset,ntimes=10) 88c88 < * COMMON a,b,c --- > COMMON a,b,coutput from make ---------------- f90 -o ../../stream_d.mp +extend_source +autodbl4 +DA2.0W +noppu +DS2.0 +O3 -Wl,+pd,L -Wl,-aarchive +Oparallel stream_d. f second_wall.o stream_d.f program STREAM external function REALSIZE external subroutine CONFUSE external function CHECKTICK external subroutine CHECKSUMS
413 Lines Compiled
output from execution: ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 53477800 Offset = 0 The total memory requirement is 1224 MB You are running each test 10 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity/precision appears to be 2 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 6508.8813 0.1325 0.1315 0.1336 Scale: 6514.8304 0.1326 0.1313 0.1353 Add: 6844.6101 0.1884 0.1875 0.1895 Triad: 6870.0771 0.1874 0.1868 0.1886 ---------------------------------------------------- Solution Validates! ----------------------------------------------------
This archive was generated by hypermail 2b29 : Wed Oct 31 2001 - 11:26:47 CST