These are standard STREAM results on an IBM System p5 595
with sixty four 2.3 GHz cpus. This is a POWER5+ SMP machine.
Large pages were used in all cases.
Function Rate (MB/s) Avg time Min time Max time
Copy: 186136.8597 .0231 .0230
.0232
Scale: 179638.7612 .0239 .0239
.0241
Add: 200409.5931 .0321 .0321
.0321
Triad: 206242.7779 .0313 .0312
.0314
Here is the full output file:
--------------------------------------------------
Requesting Large Pages
Setting up for 8 CPUs per module
Number of segments per array = 8
CPU binding list : 0 8 16 24 32 40 48 56
Shared Segment Pointer = 504403158265495552
Shared Segment Pointer = 504403160412979200
Shared Segment Pointer = 504403162560462848
Segment Size (B) = 268435456 (MB = 256 )
Array Size (B) = 2147483648 (MB = 2048 )
Array Size (DW) = 268435456
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
Num_threads = 64
rebind: num_parthds is 64
GETSHRSEG: requesting large pages
GETSHRSEG ENTRY: shmgetflag -2147481216
bindprocessor successful: thread_self() 235139 cpu_id 0
bindprocessor successful: thread_self() 235139 cpu_id 8
bindprocessor successful: thread_self() 235139 cpu_id 16
bindprocessor successful: thread_self() 235139 cpu_id 24
bindprocessor successful: thread_self() 235139 cpu_id 32
bindprocessor successful: thread_self() 235139 cpu_id 40
bindprocessor successful: thread_self() 235139 cpu_id 48
bindprocessor successful: thread_self() 235139 cpu_id 56
GETSHRSEG: requesting large pages
GETSHRSEG ENTRY: shmgetflag -2147481216
bindprocessor successful: thread_self() 235139 cpu_id 0
bindprocessor successful: thread_self() 235139 cpu_id 8
bindprocessor successful: thread_self() 235139 cpu_id 16
bindprocessor successful: thread_self() 235139 cpu_id 24
bindprocessor successful: thread_self() 235139 cpu_id 32
bindprocessor successful: thread_self() 235139 cpu_id 40
bindprocessor successful: thread_self() 235139 cpu_id 48
bindprocessor successful: thread_self() 235139 cpu_id 56
GETSHRSEG: requesting large pages
GETSHRSEG ENTRY: shmgetflag -2147481216
bindprocessor successful: thread_self() 235139 cpu_id 0
bindprocessor successful: thread_self() 235139 cpu_id 8
bindprocessor successful: thread_self() 235139 cpu_id 16
bindprocessor successful: thread_self() 235139 cpu_id 24
bindprocessor successful: thread_self() 235139 cpu_id 32
bindprocessor successful: thread_self() 235139 cpu_id 40
bindprocessor successful: thread_self() 235139 cpu_id 48
bindprocessor successful: thread_self() 235139 cpu_id 56
bindprocessor successful: thread_self() 271011 cpu_id 1
bindprocessor successful: thread_self() 307655 cpu_id 5
bindprocessor successful: thread_self() 279111 cpu_id 17
bindprocessor successful: thread_self() 255351 cpu_id 22
bindprocessor successful: thread_self() 239403 cpu_id 27
bindprocessor successful: thread_self() 271797 cpu_id 54
bindprocessor successful: thread_self() 275701 cpu_id 4
bindprocessor successful: thread_self() 328081 cpu_id 53
bindprocessor successful: thread_self() 230973 cpu_id 7
bindprocessor successful: thread_self() 263849 cpu_id 47
bindprocessor successful: thread_self() 283811 cpu_id 12
bindprocessor successful: thread_self() 319857 cpu_id 37
bindprocessor successful: thread_self() 299695 cpu_id 57
bindprocessor successful: thread_self() 291471 cpu_id 41
bindprocessor successful: thread_self() 259463 cpu_id 30
bindprocessor successful: thread_self() 300233 cpu_id 44
bindprocessor successful: thread_self() 275915 cpu_id 62
bindprocessor successful: thread_self() 234915 cpu_id 6
bindprocessor successful: thread_self() 287619 cpu_id 34
bindprocessor successful: thread_self() 251513 cpu_id 23
bindprocessor successful: thread_self() 308425 cpu_id 60
bindprocessor successful: thread_self() 327687 cpu_id 16
bindprocessor successful: thread_self() 255899 cpu_id 35
bindprocessor successful: thread_self() 283507 cpu_id 26
bindprocessor successful: thread_self() 267687 cpu_id 46
bindprocessor successful: thread_self() 299955 cpu_id 58
bindprocessor successful: thread_self() 279399 cpu_id 18
bindprocessor successful: thread_self() 332193 cpu_id 61
bindprocessor successful: thread_self() 295843 cpu_id 50
bindprocessor successful: thread_self() 352427 cpu_id 24
bindprocessor successful: thread_self() 299451 cpu_id 13
bindprocessor successful: thread_self() 283247 cpu_id 25
bindprocessor successful: thread_self() 364739 cpu_id 48
bindprocessor successful: thread_self() 275015 cpu_id 9
bindprocessor successful: thread_self() 247689 cpu_id 3
bindprocessor successful: thread_self() 292021 cpu_id 28
bindprocessor successful: thread_self() 255625 cpu_id 31
bindprocessor successful: thread_self() 295583 cpu_id 49
bindprocessor successful: thread_self() 247405 cpu_id 15
bindprocessor successful: thread_self() 368855 cpu_id 40
bindprocessor successful: thread_self() 235139 cpu_id 0
bindprocessor successful: thread_self() 264123 cpu_id 51
bindprocessor successful: thread_self() 231253 cpu_id 59
bindprocessor successful: thread_self() 251261 cpu_id 14
bindprocesso Starting Initialization
Done With Initialization
a(1) 1.00000000000000000
b(M) 1.00000000000000000
c(M) 1.00000000000000000
Incremental Offset = 1536
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 267910144
The total memory requirement is 6131 MB
You are running each test 5 times
--
The *best* time for each test is used
*EXCLUDING* the first and last iterations
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
----------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 186136.8597 .0231 .0230
.0232
Scale: 179638.7612 .0239 .0239
.0241
Add: 200409.5931 .0321 .0321
.0321
Triad: 206242.7779 .0313 .0312
.0314
----------------------------------------------------
Solution Validates!
----------------------------------------------------
______________________________________________
Ly Vu
IBM Corp. - Austin, Texas.
AIX/pSeries Performance
Phone : (512) 838-8228
Email : lyvu@us.ibm.com
Received on Mon Jul 24 18:33:00 2006
This archive was generated by hypermail 2.1.8 : Tue Jul 25 2006 - 11:10:40 CST