ansaurus

Question

Does using xor reg, reg give advantage over mov reg, 0?

Answer 1

+1 A:

I think on earlier architectures the mov eax, 0 instruction used to take a little longer than the xor eax, eax as well... cannot recall exactly why. Unless you have many more movs however I would imagine you're not likely to cause cache misses due to that one literal stored in the code.

Also note that from memory the status of the flags is not identical between these methods, but I may be misremembering this.

jerryjvl 2009-07-16 06:10:58

Answer 2

+6 A:

I'm stopped being able to fix my own cars after I sold my 1966 HR station wagon. I'm in a similar fix with modern CPUs :-)

It really will depend on the underlying microcode or circuitry. It's quite possible that the CPU could recognize "XOR Rn,Rn" and simply zero all bits without worrying about the contents. But of course, it may do the same thing with a "MOV Rn, 0". A good compiler will choose the best variant for the target platform anyway so this is usually only an issue if you're coding in assembler.

If the CPU is smart enough, your XOR dependency disappears since it knows the value is irrelevant and will set it to zero anyway (again this depends on the actual CPU being used).

However, I'm long past caring about a few bytes or a few clock cycles in my code - this seems like micro-optimization gone mad.

paxdiablo 2009-07-16 06:14:28

Regardless of whether it is excessive optimization for practical use, there may be value to understanding that not all similar instructions are created equal. ;)

jerryjvl 2009-07-16 06:18:26

@jerryjvl - It's also useful to realized that modern desktop x86 CPU's don't run x86 machine code - they decode the x86 into a RISC like internal instructions to execute. As such, they can recognize common code sequences (like xor eax, eax) and translate them into simpler instructions, like maybe some "clear reg" instruction instead. An actual xor is probably not done in this case.

Michael 2009-07-16 06:35:44

Answer 3

+1 A:

As others have noted, the answer is "who cares?". Are you writing a compiler?

And on a second note, your benchmarking probably won't work, since you have a branch in there that probably takes all the time anyway. (unless your compiler unrolls the loop for you)

Another reason that you can't benchmark a single instruction in a loop is that all your code will be cached (unlike real code). So you have taken much of the size difference between mov eax,0 and xor eax,eax out of the picture by having it in L1-cached the whole time.

My guess is that any measurable performance difference in the real world would be due to the size difference eating up the cache, and not due to execution time of the two options.

Thomas 2009-07-16 06:48:50

Answer 4

+7 A:

an actual answer for you:

from the Intel 64 and IA-32 Architectures Optimization Reference Manual

Section 3.5.1.6 is where you want to look, page 122.

In short there are situations where an xor or a mov may be preferred. The issues center around dependency chains and preservation of condition codes.

Mark 2009-07-16 07:31:17

Answer 5

+1 A:

x86 has variable-length instructions. MOV EAX, 0 requires one or two more bytes in code space than XOR EAX, EAX.

ajs410 2009-10-15 13:42:30

ansaurus

tags:

views:

answers:

Does using xor reg, reg give advantage over mov reg, 0?

related questions