Hi John,
Please find below STREAM results for a 128-socket Altix UV 1000 system.
System details:
SGI Altix UV 1000 (SSI)
128 Intel X7560 processors (8-core 2.26 GHz / 24 MB L3 cache)
5 TB main memory (768x4GB + 256x8GB quad-rank DDR3-1066 DIMMs)
SUSE Linux Enterprise Server 11 SP1 + SGI ProPack 7 SP1
Run details:
Intel compiler 10.1.018
Compilation flags: -O3 -ipo -xT -fno-alias -i8 -openmp -extend_source
-mcmodel=medium -i-dynamic
Standard STREAM source code, modified to handle formatting requirements
of large arrays.
The dplace tool was used to pin threads to cpus.
OMP_NUM_THREADS = 1024
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 20480000000
Offset = 0
The total memory requirement is 468750 MB
You are running each test 20 times
--
The *best* time for each test is used
*EXCLUDING* the first and last iterations
----------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds
----------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 1928142.3878 0.1708 0.1699 0.1746
Scale: 1933831.5239 0.1704 0.1694 0.1732
Add: 2209714.7393 0.2235 0.2224 0.2270
Triad: 2212717.3205 0.2228 0.2221 0.2251
----------------------------------------------------
Solution Validates!
----------------------------------------------------
Regards,
John
Received on Mon Jun 28 09:04:01 2010
This archive was generated by hypermail 2.1.8 : Mon Jun 28 2010 - 11:48:25 CDT