Fw: standard STREAM on IBM eServer p5 550 (1500 MHz, 4cpu)

From: John D Mccalpin (mccalpin@us.ibm.com)
Date: Mon Mar 21 2005 - 07:45:56 CST

  • Next message: John D Mccalpin: "Fw: Tuned STREAM on IBM eServer p5 550 (1500 MHz, 4cpu)"

    ---
    John D. McCalpin, Ph.D.
    IBM - 11400 Burnet Road, MS 045-3N098
    Austin, TX  78758
    (512) 838-6167 or IBM tie line 678/6167
    

    ----- Forwarded by John D Mccalpin/Austin/IBM on 03/21/2005 07:45 AM ----- Ly Vu/Austin/IBM 03/11/2005 01:37 To PM mccalpin@us.ibm.com cc jacobt@us.ibm.com, robichau@us.ibm.com Subject standard STREAM on IBM eServer p5 550 (1500 MHz, 4cpu)

    These are standard STREAM results on an IBM eServer p5 550 express with four 1500 MHz cpus. This is a POWER5 SMP machine. Large pages were used in all cases.

    Function Rate (MB/s) Avg time Min time Max time Copy: 6176.3992 .1740 .1738 .1742 Scale: 6086.5589 .1764 .1763 .1765 Add: 7671.7113 .2099 .2098 .2099 Triad: 7814.9729 .2061 .2060 .2061

    Here is the full output file: --------------------------------------------------

    Requesting Large Pages Setting up for 2 CPUs per module Number of segments per array = 2 CPU binding list : 0 2 Shared Segment Pointer = 504403158265495552 Shared Segment Pointer = 504403158802366464 Shared Segment Pointer = 504403159339237376 Segment Size (B) = 268435456 (MB = 256 ) Array Size (B) = 536870912 (MB = 512 ) Array Size (DW) = 67108864 Num_threads = 4 Num_threads = 4 Num_threads = 4 Num_threads = 4 rebind: num_parthds is 4 Starting Initialization Done With Initialization a(1) 1.00000000000000000 b(M) 1.00000000000000000 c(M) 1.00000000000000000 Incremental Offset = 512 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67079168 The total memory requirement is 1535 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 6176.3992 .1740 .1738 .1742 Scale: 6086.5589 .1764 .1763 .1765 Add: 7671.7113 .2099 .2098 .2099 Triad: 7814.9729 .2061 .2060 .2061 ---------------------------------------------------- Solution Validates! ----------------------------------------------------

    ______________________________________________ Ly Vu IBM Corp. - Austin, Texas. RS/6000 Performance Analysis. Phone : (512) 838-8228 Email : lyvu@us.ibm.com


    pic29082.gif
    ecblank.gif

    This archive was generated by hypermail 2.1.4 : Mon Jun 13 2005 - 08:57:46 CDT