ansaurus

Question

Why is the following program 15% slower when compiled with g++? [solved]

Answer 1

+2 A:

In order to figure out why its slower you'll probably need to take a look at the assemblies that are produced by the compiler. The g++ compiler must be doing something different from the gcc compiler.

mezoid 2009-04-09 02:52:40

Same compiler - different flags. In particular, g++ sets the "compile as C++" flag to GCC.

MSalters 2009-04-09 08:50:47

Answer 2

A:

Oh, that is a fun one. But the code you gave us doesn't compile. You need

(int argc, char** argv)

Charlie Martin 2009-04-09 02:59:06

see comments section on question... thanks...

ojblass 2009-04-09 03:00:30

Answer 3

+7 A:

When compiled with gcc and g++ the only difference I see is within the first 4 lines.

gcc:

    .file "loops.c"
    .def ___main; .scl 2; .type 32; .endef
    .text
.globl _main

g++:

    .file "loops.c"
    .def ___main; .scl 2; .type 32; .endef
    .text
    .align 2
.globl _main

as you can see the only difference is that with g++, the alignment (2) occurs on a word boundary. This tiny difference seems to be making the significant performance difference.

Here is a page explaining structure alignment, although it is for ARM/NetWinder it is still applicable as it discusses how alignment works on modern CPUs. You will want to read section 7 specifically "What are the disadvantages of word alignment?" :

http://netwinder.osuosl.org/users/b/brianbr/public_html/alignment.html

and here is a reference on the .align operation:

http://www.nersc.gov/vendor_docs/ibm/asm/align.htm

Benchmarks as requested:

gcc:

john@awesome:~$ time ./loopsC

real    0m21.212s
user    0m20.957s
sys 0m0.004s

g++:

john@awesome:~$ time ./loopsGPP

real    0m22.111s
user    0m21.817s
sys 0m0.000s

I reduced the inner-most iteration to 1200. Results aren't as widespread as I had hoped, but then again the assembly output was generated on windows, and the timings done in Linux. Maybe something different is done behind the scenes in MinGW than it is with gcc for Linux alignment-wise.

John T 2009-04-09 03:03:30

What version of gcc are you using?

ojblass 2009-04-09 03:07:06

Could the align negatively impact performance?

ojblass 2009-04-09 03:07:44

4.4.0 (latest as of this post)

John T 2009-04-09 03:10:05

Can you run time on both versions of the exe? Each and every time I got i really significant difference.

ojblass 2009-04-09 03:13:03

@John, g++ is doing proper alignment. So, shouldn't that be faster?

chappar 2009-04-09 03:21:43

Maybe its tear up and tear down code instead?

ojblass 2009-04-09 03:24:16

hmm.. 72 billion iterations might take a little while

John T 2009-04-09 03:25:05

I think my box is a bit poluted with libraries newer than others for reasons I have not gotten to the bottom of... eager to see the numbers.

ojblass 2009-04-09 03:25:19

I tried to eliminate startup time to noise... you can reduce them.

ojblass 2009-04-09 03:25:50

Alright I am going to try it on a cleaner box tomorrow. Thank you so much.

ojblass 2009-04-09 03:42:10

Answer 4

+1 A:

One of the reason would be that gcc might have optimized the assignment of m and n, so that they can run in parallel.

That can done like this

m = i+j+k;
n = i+j+k+2;

I am not sure this than improve the performance by 15%. This might give bit of performance boost in multicore CPU. The best way is to compare the assembly code of 2 compilers.

chappar 2009-04-09 03:03:54

Maybe an optimized alignment?

ojblass 2009-04-09 03:19:11

ansaurus

tags:

views:

answers:

Why is the following program 15% slower when compiled with g++? [solved]

related questions