Hardware (PC-compatible):
Motherboard: Gigabyte GA-8SQ800 Ultra
CPU: Intel Pentium 4, 2.4 GHz, 533 MHz FSB
RAM: 512 MB dual-channel 400 MHz DDR
Operating system:
Red Hat Linux 9.0.93, kernel 2.4.21
Compiler:
GCC 3.3.1
Compiler flags:
gcc -o stream -O3 -funroll-loops -mfpmath=sse -march=pentium4 -Wall -W
stream_d.c second_wall.c -lm
Results:
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 20000000, Offset = 0
Total memory required = 457.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 128674 microseconds.
(= 128674 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 1975.2236 0.1621 0.1620 0.1623
Scale: 1985.5560 0.1612 0.1612 0.1613
Add: 2357.8408 0.2036 0.2036 0.2037
Triad: 2358.8961 0.2036 0.2035 0.2037
-- Markku Kolkka markku.kolkka@iki.fiReceived on Thu Aug 14 08:07:39 2003
This archive was generated by hypermail 2.1.8 : Mon Aug 25 2003 - 10:20:39 CDT