I ran the new Stream_d program 4 times on my Sparc-5 @ 110-MHz
(Solaris 2.4). I compiled the source this way:
cc -fast -xO3 -o streamc stream_d.c
Here are the results:
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Your clock granularity/precision appears to be 4 microseconds.
Each test below will take on the order of 286859 microseconds.
(= 71714 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING: The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Assignment: 93.1291 0.3451 0.3436 0.3500
Scaling : 92.2150 0.3512 0.3470 0.3533
Summing : 83.0654 0.5823 0.5779 0.5923
SAXPYing : 70.8583 0.6953 0.6774 0.7490
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Your clock granularity/precision appears to be 4 microseconds.
Each test below will take on the order of 293091 microseconds.
(= 73272 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING: The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Assignment: 93.6184 0.3486 0.3418 0.3572
Scaling : 92.6840 0.3565 0.3453 0.4033
Summing : 83.0871 0.5834 0.5777 0.5985
SAXPYing : 71.6299 0.6795 0.6701 0.7390
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Your clock granularity/precision appears to be 4 microseconds.
Each test below will take on the order of 306841 microseconds.
(= 76710 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING: The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Assignment: 92.0808 0.3511 0.3475 0.3591
Scaling : 92.6806 0.3525 0.3453 0.3694
Summing : 84.3806 0.5838 0.5689 0.6814
SAXPYing : 70.2928 0.6851 0.6829 0.6977
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Your clock granularity/precision appears to be 4 microseconds.
Each test below will take on the order of 288012 microseconds.
(= 72003 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING: The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Assignment: 93.4809 0.3459 0.3423 0.3605
Scaling : 92.2884 0.3676 0.3467 0.4787
Summing : 82.3000 0.6307 0.5832 0.9068
SAXPYing : 70.0694 0.6972 0.6850 0.7184
This archive was generated by hypermail 2b29 : Tue Apr 18 2000 - 05:23:05 CDT