ansaurus

Question

HashSet performance Add vs Contains for existing elements

Answer 1

A:

Interesting, on my machine (Dell Latitude D630, dual-core 2.2 Ghz) I am getting nearly identical results for both tests unless I run the stopwatch against a null action before the tests. For example:

I run the tests with the exact code you have given in the question:

Without Contains(): 8205794
With Contains():    8207596

If I modify the code in this manner:

After:

Stopwatch watch = new Stopwatch();
int size = 10000;
int iterations = 10000;

Add:

watch.Time(null, 0);

My results become:

Without Contains(): 8019129
With Contains():    8275771

This seems to me like something odd is going on inside of the Stopwatch that is causing these fluctuations.

John Rasch 2009-03-09 22:20:38

jeez if you cant trust stopwatch who can you trust :) I just repeated all my tests and am getting results consistent with my original, the gap even gets bigger in release mode.

Sam Saffron 2009-03-09 22:28:35

This could be related to me having an X64 version of the framework, testing in a VM

Sam Saffron 2009-03-09 22:49:02

Confirmed the issue appears to be X64 related

Sam Saffron 2009-03-09 23:22:56

Answer 2

A:

My guess is that you ran your test from Visual Studio which caused the inlining of AddIfNotPresent into Add to be suppressed, so you're seeing the result of an extra level of indirection in the method calls.

If I compile and run from the command line to remove any VS trickery...

> csc /o+ /t:exe Program.cs
> Program.exe

...then there is no performance difference.

Sample outputs (representative of a larger number of tests):

Greg Beech 2009-03-09 22:29:40

nope, just ran in release outside of VS and am finding contains to be 15% faster

Sam Saffron 2009-03-09 22:35:02

have you tried the plain old command line version as above?

Greg Beech 2009-03-09 22:37:55

One big difference that my machine has is that im running Vista X64, will quickly test in a VM on 32 bit

Sam Saffron 2009-03-09 22:42:40

Also my machine is Quad Core which may mean something about the way the GC is operating

Sam Saffron 2009-03-09 22:43:31

I'm on a Vista x86 dual core at the moment.

Greg Beech 2009-03-09 22:45:22

confirmed, the issue only shows up on X64

Sam Saffron 2009-03-09 23:21:38

Answer 3

+1 A:

AddIfNotPresent does an additional divide that Contains doesn't perform. Take a look at the IL for Contains:

IL_000a:  call       instance int32 class System.Collections.Generic.HashSet`1<!T>::InternalGetHashCode(!0)
  IL_000f:  stloc.0
  IL_0010:  ldarg.0
  IL_0011:  ldfld      int32[] class System.Collections.Generic.HashSet`1<!T>::m_buckets
  IL_0016:  ldloc.0
  IL_0017:  ldarg.0
  IL_0018:  ldfld      int32[] class System.Collections.Generic.HashSet`1<!T>::m_buckets
  IL_001d:  ldlen
  IL_001e:  conv.i4
  IL_001f:  rem
  IL_0020:  ldelem.i4
  IL_0021:  ldc.i4.1
  IL_0022:  sub
  IL_0023:  stloc.1

This is computing the bucket location for the hash code. The result is saved at local memory location 1.

AddIfNotPresent does something similar, but it also saves the computed value at location 2, so that it can insert the item into the hash table at that position if the item doesn't exist. It does that save because one of the locations is modified later in the loop that goes looking for the item. Anyway, here's the relevant code for AddIfNotPresent:

IL_0011:  call       instance int32 class System.Collections.Generic.HashSet`1<!T>::InternalGetHashCode(!0)
  IL_0016:  stloc.0
  IL_0017:  ldloc.0
  IL_0018:  ldarg.0
  IL_0019:  ldfld      int32[] class System.Collections.Generic.HashSet`1<!T>::m_buckets
  IL_001e:  ldlen
  IL_001f:  conv.i4
  IL_0020:  rem
  IL_0021:  stloc.1
  IL_0022:  ldarg.0
  IL_0023:  ldfld      int32[] class System.Collections.Generic.HashSet`1<!T>::m_buckets
  IL_0028:  ldloc.0
  IL_0029:  ldarg.0
  IL_002a:  ldfld      int32[] class System.Collections.Generic.HashSet`1<!T>::m_buckets
  IL_002f:  ldlen
  IL_0030:  conv.i4
  IL_0031:  rem
  IL_0032:  ldelem.i4
  IL_0033:  ldc.i4.1
  IL_0034:  sub
  IL_0035:  stloc.2

Anyway, I think the extra divide is what's causing Add to take more time than Contains. At first glance, it looks like that extra divide could be factored out, but I can't say for sure without spending a little more time deciphering the IL.

Jim Mischel 2009-03-09 23:12:51

An interesting question is why is this so much slower on x64?

Sam Saffron 2009-03-09 23:29:40

ansaurus

tags:

views:

answers:

HashSet performance Add vs Contains for existing elements

related questions