by Kristopher Kubicki on 9/19/2004 8:00:00 PM
Posted in Linux
Buy the Intel BX80623I52400 Processor 1 x Core
Newegg
$189.99
TigerDirect
$220.99
CompUSA
$220.99

Database Tests

MySQL 4.0.20d has been a staple of our Linux tests since its inception. Even though it does not carry high relevance for a workstation test, we still regard it as the de facto free, open sourced benchmark for Linux. Below, you can see our results for sql-bench on both the 64-bit and 32-bit kernels for SuSE 9.1.



Hold your mouse over for the 64-bit graph.




Hold your mouse over for the 64-bit graph.


We already see some exciting trends with this benchmark. For one, the 64-bit MySQLd appears to be much faster than the 32-bit one (mouse over the graphs to see the difference). Since the above benchmarks were done without HyperThreading, we enabled it in the graph below.

HyperThreading: 32-bit MySQL 4.0.20d - Test-Select

HyperThreading: 32-bit MySQL 4.0.20d - Test-Insert

We expect to see a performance increase with HyperThreading - SQL servers must thread well. Unfortunately, the sql-bench benchmark is more to blame than anything else, and it does not thread realistically. As we validate a new benchmark for this portion of our Linux benchmark suite, sql-bench will do, but keep in mind that its extremely synthetic behavior.

Index Rendering Benchmarks
sell:nike shoes$32,ed hardy(items),jean$30,handbag$35,polo shirt$13,shox$34
No Subject by Hugh R on Thursday, September 23, 2004
Thanks for this article. It has been needed for about a year. Every previous benchmark of AMD 64 seemed to be 32-bit mode which is rather missing the point.

Firefox 1.0PR on LINUX did not show the 64-bit results until I went to edit:preferences:web features:enable java advanced... and turned on lots of crap (I don't know which item made the difference).
The information was fascinating but the presentation was very awkward.

When you see a surprising benchmark result, it is a good idea to analyze why you were surprised. For example, I would guess that the poor showing for 64-bit code on John the Ripper might be due to hand-coded x86 assembly code. Note: just a guess.

The fact that Wine is only 32-bit seems pretty uninteresting/unsurprising: Win32 binaries are also only 32-bit.

Few things in the LINUX world are binary-only, so almost anything for which CPU performance matters can and should be run in 64-bit mode on a 64-bit processor.
Hugh R
No Subject by bobbozzo on Tuesday, September 21, 2004
You should be running all the compilation test with -j2 or higher, as otherwise the CPU is waiting for the disk more often.
bobbozzo
No Subject by uyu on Tuesday, September 21, 2004
Consider re-evaluating the test with the icc compiler:

http://www.intel.com/software/products/compilers/c...

I do not think it will only favor the result of intel processors..
uyu
No Subject by Zebo on Tuesday, September 21, 2004
Why separate the graphs? Afriad of people easily visualizing major A64 ownage? Gawd that's hard to compare that way... I had to get out pen and paper.
Zebo
No Subject by Shalmanese on Tuesday, September 21, 2004
"throw an alternative opterating system"

I like the attempt at subliminal advertising :D.
Shalmanese
No Subject by TrogdorJW on Monday, September 20, 2004
On the LAME encoding benchmark, isn't the actual value really "Play time divided by encoding time"? Or perhaps "Relative encoding rate"? Anyway, the text explains the graph better (in 1 second the 64-bit FX-53 encoded 25 seconds of audio). Otherwise, good stuff.
TrogdorJW
No Subject by injinj on Monday, September 20, 2004
Crafty does have a bit of hand tuned asm for both x86 and x86_64. Most of the operations are done with boards packed into bit representations. For example, like this:

while (moves) {
to=FirstOne(moves);
*move++=temp|(to<<6)|(PcOnSq(to)<<15);
Clear(to,moves);
}

The FirstOne() function utilizes the bitscan ops of x86 (bsr = bit scan reverse), but notice the cmpl at the top:

cmpl $1, 8(%esp)
sbbl %eax, %eax
movl 8(%esp,%eax,4), %edx
bsr %edx, %ecx
jz l4
andl $32, %eax
subl $31, %ecx
subl %ecx, %eax
ret
l4: movl $64, %eax

The cmpl splits a 64 bit word into a 32 bit hi and lo words, so crafty will naturally exploit 64 bit instructions.

This same function on x86_64 can be done much fewer instructions:

asm (
" bsrq %0, %1" "\n\t"
" jnz 1f" "\n\t"
" movq $-1, %1" "\n\t"
"1: movq $63, %0" "\n\t"
" subq %1, %0" "\n\t"
: "=r&" (dummy), "=r&" (dummy2)
: "0" ((long) (word))
: "cc");

These are critical functions in crafty and if you see benchmarks comparing 64 bit crafty to 32 bit crafty, this is primarily why 64 bits is faster.
injinj
No Subject by mczak on Monday, September 20, 2004
what's up with the encryption benchmarks? "OpenSSL's crypt libraries are probably heavily optimized for 32-bit operation; we see the difference in the two architectures very clearly."
But the results show that 64bit mode is more than two times as fast as 32bit mode in one case (RSA), and 50% faster in the other case (AES)?
(and btw I haven't looked at johntheripper, but it might contain hand-optimized assembly for x86, but only generic c code for other architectures such as x86_64.)
mczak
No Subject by PrinceGaz on Monday, September 20, 2004
The mouseover images work fine for me (Firefox 0.9.3)
PrinceGaz
Latest from AnandTech