Pentium 4 1.5GHz or Athlon Thunderbird 1.2GHz (DDR)
tnaw_xtennis
Though there has been so much mixed viewpoints and feelings expressed by CPU review reporters
and readers on the just released Pentium 4, the rational way that judges a new CPU by yourself
remains the same. What is the Sweetspot of Pentium 4 and Athlon respectively? To set up a dreaming
PC, is a Pentium 4 or an Athlon your best choice?
By reviewing carefully the benchmarking results of the Pentium 4 1.5GHz and
the Athlon Thunderbird
1.2 GHz (DDR), we can get the conclusion that: (1). The Integer and
Floating-Point computing
performances of Athlon 1.2 are more stronger than those of P4 1.5. (2).
Comparing with Athlon (DDR)
system, the FSB (Front Side Bus) and memory bandwidth of P4 system are much more powerful.
(3). SSE2 (Streaming SIMD Extensions 2) optimization of multimedia
software could speed up
performances of P4 system markedly. And the optimizations are not as difficult
as you may imagine.
(4). Performances of different type applications
may be sensitive to one of the system components
mentioned above, P4 1.5 or Athlon 1.2 could beat away its competitor easily in the field be
good at itself.
1. Performances of different type applications may be sensitive to different
system components
We all know that to run software requires the coordinated interactions of numerous components
in the computer. When the instructions and data are sent to the CPU, CPU handles the incoming
instructions and data, performs necessary computations and sends instruction and data to RAM
and peripheral devices. The video card takes video data from the CPU and process it into a format
that the monitor or display device can understand and we get the final results. The CPU communicates
with the main memory and peripheral devices through FSB (Front Side Bus). The CPU can only
manipulate data that is held in the main memory (RAM), so if it needs to access data on a storage
device the CPU instructs the device to send the necessary data and instructions to RAM.
In CPU benchmarking nowadays, three main factors from the hardware side that could determine the
benchmarking performances are: (1). CPU, (2). FSB and MBW (memory bandwidth). (3). Video card.
The three parts work coordinately. If one lags behind, it becomes the bottleneck that limits the overall
performance. Let us suggest that CPU, FSB and MBW could handle over 100M units of instructions
and data/s respectively, while the video card could only process an amount of video data corresponding
to 80M units of instructions and data/s, then video card becomes the bottleneck. If CPU and video
card could process over 100M units of instructions and data/s, while FSB and MBW could handle
80M units/s only, then the bottleneck comes from FSB and MBW. And CPU would be the bottleneck
if it could only process 80M units of instructions and data/s, while the FSB, MBW and video card can
handle 100M units/s.
Most of the applications today don't request evenly the strength of the three components
mentioned
above. Different types of applications may depend
on different system components to bring performances
into full play. Some type of applications may depend on strength of the CPU,
i.e., CPU as bottleneck.
and don't stress FSB, MBW and video cards. While other type of applications may
not stress the strength
of CPU and video cards, but depend on the FSB and MBW, i.e., FSB and MBW as
bottleneck.
It is also noticeable that even for the same application the factor that
determines the performance may
changes with changing of the condition applied to the application..
The figures below show an example of shifting of the bottleneck with the changing of the condition applied.
* source: www.anandtech.com
At the screen resolution of 640x480, Quake 3 performance depends mainly on the
CPU, FSB and
MBW. The Quake 3 performance varies on the computer with different CPU (types, core
speed), FSB
and MBW. At the screen resolution of 1024x768, the amount of video data that
video card need to
process increases, and video card seems can't handle of all them and becomes the
bottleneck. Since
the GeForce 2 GTS were applied equably by all testing systems, we got the same
benchmarking result.
* source: www.anandtech.com
When video card becomes the bottleneck of a system, Quake 3 performance can't be
improved by
changing a more powerful CPU, FSB or MBW. Only a faster video card can do the trick
at this
moment, just as an example showed above. It is obviously for 3D gamers that a powerful video card
is
a better choice than a high-end CPU, e.g., a P4 1.5 or a Athlon 1.2. Since no seriously 3D gamers
play at a screen resolution of 640x480 nowadays, and the 3D game performances of
a PC at a higher
resolution is depend mainly on the video card.
2. The sweetspots of Athlon Thunderbird 1.2GHz (DDR)
The SiSoft Sandra CPU benchmarking results in Table 1 indicates clearly that
the integer and floating
-point computing performances of Athlon 1.2 are more stronger than those of P4
1.5. The SiSoft
Sandra CPU Benchmark is exclusively measures of integer and floating-point performances of a CPU.
Factors such as FSB frequency
or memory bandwidth don't affect the score as you
can see in the case
of Athlon.
Table 1. SiSoft Sandra CPU Benchmark
Pentium 4 1.5GHz | Athlon T-B 1.2GHz (DDR) | Athlon T-B 1.2GHz | |
Dhrystone ALU (MIPS) | 2807 | 3422 | 3425 |
Whetstone FPU (MFLOPS) | 895 | 1670 | 1675 |
Since most of popular commercial applications and games nowadays are integer
or floating-point
intensive, and don't deal with large stream data that are stressful on the FSB and memory
bandwidth,
Athlon Thunderbird 1.2GHz (DDR) runs these applications and games faster than Pentium 4
1.5GHz
with taking the advantage of its stronger integer and floating-point computing
ability over P4.
Below are benchmarking results of two applications that flaunt the victory of
Athlon over P4 thoroughly.
RC5 and 3D Studio Max2 Rendering test respectively and exclusively the integer performance
and
floating-point performance of the CPU. In both tests, FSB and memory bandwidth
don't affect the results
absolutely
* source: www.anandtech.com
* souce: www.tomshardware.com
3. The sweetspots of Pentium 4
The sweetspots of Pentium 4 are on those applications dealing with large stream data that
could take the
advantage of the very powerful FSB and memory bandwidth of the Pentium 4, and on
those SSE2
optimized multimedia applications. These has been reviewed in the last
posted, here are the key point:
(1). In the specialized memory bandwidth benchmarking, Pentium 4 1.5GHz get
ahead Athlon Thunderbird
1.2GHz (DDR) by +79 ~ +161%. To the applications
performing on large data streams such as image
recognition, 3D computer model
review and Video 2000 MPEG2-Encoding, with taking the advantage
of the 400MHz
FSB and 3.2GB/s memory bandwidth, Pentium 4 1.5GHz runs up to +41% faster
than
Athlon Thunderbird 1.2GHz (DDR).
(2). In the specialized
SSE2 impact benchmarking, SSE2 optimized boosts Pentium 4 1.5GHz performances
up to +104%. To applications such as 2D/3D image viewer, video and audio
editing /encoding /playing,
and speech recognition with taking the advantage of
SSE2 optimized, Pentium 4 1.5GHz could run up
to +243% faster than Athlon
Thunderbird 1.2GHz (DDR).
The poor performance of Pentium 4 in the integer and floating-point computing
exceeded all expectations
indubitably. This is certainly far way from the design goal of Pentium 4, i.e.,
"a micro-architecture
maintaining an average IPC (Instructions Executed Per Clock ) that was within approximately 10% to
20% of the
P6 micro-architecture. In this design, although the IPC is lower, the increase in frequency
capability more than makes up (Performance = frequency x IPC) and deliver
overall higher performance
capability to the end user". What is the reason for this in nature?
According to analytic information at
present, one of the most possible key factors may be execution characteristics
of software code at present
don't cooperate well effectively with the new micro-architecture of Pentium 4.
If this is the case indeed,
software re-compiled and optimized may make Pentium 4 show better performance.
The figure below show
the results of MPEG4-encoding benchmark with an original version of FlasK MPEG
and a re-compiled
version of FlasK MPEG. With the re-compiled and SSE2 optimized FlasK
MPEG, Pentium 4 have up
to +266% and +395% MPEG4 encoding performance boosts
respectively. The work concerning the FlasK
MPEG re-compiled and SSE2 optimized was done with effort
of one night only by a guy called himself
'Alex' (not so hard as you may imagine,
right?).
* souce: www.tomshardware.com
1.4GHz
Pentium 4 vs. 1.2GHz Athlon Thunderbird with DDR SDRAM ( 05/11/2000 )
400MHz FSB Makes the Pentium 4 Shine ( 08/11/2000)
Intel flaunts FPU strength of Pentium 4 by SPEC CFP2000 ( 17/11/2000 )
The Sweetspots of Pentium 4 at Present ( 24/11/2000 )