ansaurus

Question

How well does the Visual C++ 2008/2010 compiler optimize?

Answer 1

+3 A:

You might want to try Intel's ICC compiler - it generates a lot better code than Visual C++, especially for SSE code. You can get a free 30 day evaluation license from intel.com.

Paul R 2010-07-14 22:28:46

It's also caught a lot of flak for generating needlessly inefficient code for AMD cpus

jalf 2010-07-15 12:24:28

@jalf: I guess that's a moot point, since SSE on AMD CPUs is pretty much useless - you probably want to use Intel CPUs if you're doing serious SIMD work.

Paul R 2010-07-15 14:49:08

@Paul: most people write software that has to run on multiple CPU's. Also, I'm not really sure what your problem with AMD's SSE performance is. I'm not aware of any significant limitations on AMD CPU's. Care to elaborate?

jalf 2010-07-15 15:07:42

@jalf: AMD still has no support for SSSE3, and its SSE implementation is still 64 bits under the hood (like on pre "Core" Intel CPUs - it takes two clocks to perform a 128 bit operation) so there is a severe performance limitation compared to current generation Intel CPUs which have SSSE3 and full 128 bit execution units.

Paul R 2010-07-15 16:30:07

Answer 2

+2 A:

You can activate asm view of the compiled code and see yourself what is generated.

Klaim 2010-07-14 22:29:40

i did it (well, i have written it this way, PTEST is an asm instruction), but the question was just why the compiler didn't use this optimization... maybe because the MSVC++ guys didn't thought about such an use/abuse...

Quonux 2010-07-14 22:40:24

Answer 3

A:

Check the presentation at http://lambda-the-ultimate.org/node/3674

Summary: Compilers generally do lots of amazing tricks now, even things that doesn't seem to be generally related to imperative programming, like tail-call optimization. MSVC++ is not the best, still it seems pretty good.

liori 2010-07-14 22:46:44

Answer 4

+3 A:

The default for the compiler is set to generate code that wil run on a 'lowest common denominator' CPU - ie one without SSE 4.1 instructions.

You can change that by targetting later CPUs only in the build options.

That said, the MS compiler is traditionally 'not the best' when it comes to SSE optimisation. I'm not even sure if it supports SSE 4 at all. That link gives good credit to GCC for SSE optimisation:

As a side note about GCC’s near perfection in code generation – I was quite surprised seeing it surpass even Intel’s own compiler

Perhaps you need to change compiler!

gbjbaanb 2010-07-14 23:04:05

ok, i had forget to mention that i setted it to SSE2, maybe there need to be a SSE4.1 switch ;). And thx for the GCC hint, ill check it out soon and try to squeeze it out :P

Quonux 2010-07-14 23:18:41

ansaurus

tags:

views:

answers:

How well does the Visual C++ 2008/2010 compiler optimize?

related questions