ansaurus

Question

Why the performance difference between C# (quite a bit slower) and Win32/C?

Answer 1

+11 A:

I believe you are seeing the results of bounds checks on the arrays. You can avoid the bounds checks by using unsafe code.

I believe the JITer can recognize patterns like for loops that go up to array.Length and avoid the bounds check, but it doesn't look like your code can utilizate that.

Michael 2009-06-29 19:33:33

I see a lot of these apples-oranges "identical code" attempts at perf comparisons with toy code. Yet I never see a negative comparison with full, product-quality code of comparable quality. Maybe because c# isn't actually slower.

Greg D 2009-06-29 19:37:32

@Greg D: I agree. I work almost exclusively on high performance, scientific oriented numerical processing. C# does have a very different perf. profile than C++, though, so profiling is critical - but in general, you can get C# to be just as fast as C++ with the right profiling and adjustments to the code.

Reed Copsey 2009-06-29 19:42:39

@Greg, Reed - Most of the issues that I see with managed code performance aren't around CPU time like this, but things like load time and memory footprint. For these, C++ still has a huge advantage (though bad programmers can easily negate that advantage :)

Michael 2009-06-29 19:45:36

@Michael: True. Startup time, in particular, tends to suffer in a managed world. Memory limits on 32bit are another issue where managed doesn't always live up to native (managed typically caps at 1.2-1.4gb/process, although the compacting GC can make up for this in most cases).

Reed Copsey 2009-06-29 19:47:57

Answer 2

A:

I am sure the optimization for C is different than C#. Also you have to expect at least a little bit of performance slow down. .NET adds another layer to the application with the framework.

The trade off is more rapid development, huge libraries and functions, for (what should be) a small amount of speed.

bdwakefield 2009-06-29 19:34:54

Answer 3

+2 A:

C# is doing bounds checking

when running the calculation part in C# unsafe code does it perform as well as the native implementation?

SQLMenace 2009-06-29 19:35:19

Answer 4

+14 A:

I believe your main issue in this code is going to be bounds checking on your arrays.

If you switch to using unsafe code in C#, and use pointer math, you should be able to achieve the same (or potentially faster) code.

This same issue was previously discussed in detail in this question.

Reed Copsey 2009-06-29 19:35:35

Answer 5

+6 A:

As others have said, one of the aspects is bounds checking. There's also some redundancy in your code in terms of array access. I've managed to improve the performance somewhat by changing the inner block to:

int tmp1 = array1[i];
int tmp2 = array2[k];
if (tmp1 == tmp2)
{
    calc = calc - array2[i] + array1[k];
}
else
{
    calc = calc + tmp1 - tmp2;
}

That change knocked the total time down from ~8.8s to ~5s.

Jon Skeet 2009-06-29 19:40:05

@Jon: Maybe I'm missing something, but I cannot measure any significant performance difference between your version and the OP's version. In fact, I also would not expect such a rather minimal change to have such an impact on performance.

0xA3 2009-06-29 20:31:40

Neither would I particularly, but it certainly does for me, on both .NET 3.5 and 4.0b1. Compiled with /o+ /debug- on 32 bit Vista as a console app. I've also changed the scope of the i and k variables, but I doubt that that's significant.

Jon Skeet 2009-06-29 20:58:05

(I've tested it enough times to make sure it's not just a fluke, btw :)

Jon Skeet 2009-06-29 20:58:37

@Jon: "I've also changed the scope of the i and k variables, but I doubt that that's significant." I checked this and it seems that a limited scope for i and k is actually the reason for the performance improvement. If i and k are just local to the for-loop the optimizer is probably able to remove the bounds check as it can determine that i and k are always within the bounds of the array (I checked this on XP/.NET 3.5).

0xA3 2009-06-30 07:38:04

It's not *just* that though - when I first just changed the scope, it made no difference - making the change specified in the answer then made a huge difference. I guess it's the two things combined.

Jon Skeet 2009-06-30 08:29:36

Answer 6

A:

To answer these extremely rapid responses (as a complete newbie I am impressed!)

Tried

VS 2005 / .Net 2

VS 2008 / .Net 2 and 3.5

VS 2010 Beta / .Net 4

Unsafe and safe

unfortunately no difference at all in c# performance.

Actually it gets worse - my colleague who is working on the complete application is getting factors of 4 slower - albeit on a T7200.

Sorry about the name change - remembered I had an OpenID (don't ask!)

2009-06-29 19:40:23

You'd normally put additional question information in an edit to your question. The "answer" part of the page is for... answers.

Daniel Earwicker 2009-06-29 19:43:35

OK - thanks for the pointer. Just a bit late over here and brain not in gear.

2009-06-29 20:01:14

Answer 7

+1 A:

If your application's performance critical path consists entirely of unchecked array processing, I'd advise you not to rewrite it in C#.

But then, if your application already works fine in language X, I'd advise you not to rewrite it in language Y.

What do you want to achieve from the rewrite? At the very least, give serious consideration to a mixed language solution, using your already-debugged C code for the high performance sections and using C# to get a nice user interface or convenient integration with the latest rich .NET libraries.

A longer answer on a possibly related theme.

Daniel Earwicker 2009-06-29 19:47:25

Answer 8

+3 A:

Just for fun, I tried building this in C# in Visual Studio 2010, and took a look at the JITed disassembly:

                    else 
                        calc = calc + array1[i] - array2[k];
000000cf  mov         eax,dword ptr [ebp-10h] 
000000d2  add         eax,dword ptr [ebp-14h] 
000000d5  sub         eax,edx 
000000d7  mov         dword ptr [ebp-10h],eax

They made a number of improvements to the jitter in 4.0 of the CLR.

2009-06-29 20:04:28

Answer 9

A:

Sorry everyone but my edit link seems to have disappeared. A big thanks for all the answers - without doubt it is the bounds checking. I obviously have far too much faith in the optimiser.

I note the link by Reed (thanks). Strangely both myself and my colleague had searched extensively on Google without coming up with this solution! Hope I can help someone else at some point in the future.

Again many thanks.

2009-06-29 20:11:45

To be fair, the optimiser isn't at fault. C# is not C, so making what amounts to a copy-paste of optimized C code into C# is not going to automatically result in optimized C# code. Another thing that a colleague of mine noticed about your code is that you left your arrays as instance values instead of making them static. I believe that's what results in those extra "ecx" additions in your assembly-- you're dereferencing the "this" pointer in c#, but there is no such thing happening in your C. These are fundamental semantic diffs that the optimizer must maintain b/c you wrote it that way.

Greg D 2009-06-29 20:33:31

ansaurus

tags:

views:

answers:

Why the performance difference between C# (quite a bit slower) and Win32/C?

related questions