ansaurus

Question

C# Micro-Optimization Query: IEnumerable Replacement

Answer 1

+1 A:

I'd suggest making an array of Squares (capacity four) and returning that instead. I would be very suspicious about using iterators in a performance-sensitive context. For example:

// could return IEnumerable<Square> still instead if you preferred.
public static Square[] GetNeighbors(Model m, Square s)
{
    int x = s.X, y = s.Y, i = 0;
    var result = new Square[4];

    if (x > 0) result[i++] = m[x - 1, y];
    if (y > 0) result[i++] = m[x, y - 1];
    if (x < m.Width  - 1) result[i++] = m[x + 1, y];
    if (y < m.Height - 1) result[i++] = m[x, y + 1];

    return result;
}

I wouldn't be surprised if that's much faster.

mquander 2009-03-27 16:40:55

I was in the process of suggesting the exact same thing. The use of IEnumerable introduces some hidden costs under the cover, and for only returning 4 elements ever, you don't need the flexibility and ease of use IEnumerable provides.

Michael 2009-03-27 16:42:44

I'm discovering that even when I have many elements, ienumerable is just way too expensive. This wasn't the only function paying this price.

Brian 2009-03-27 16:50:47

You might be interested in reading this blog entry describing the specific code generation that takes place for iterators and yield return. http://blogs.msdn.com/wesdyer/archive/2007/03/23/all-about-iterators.aspx

mquander 2009-03-27 16:57:11

Answer 2

+1 A:

I'm on a slippery slope, so insert disclaimer here.

I'd go with option 3. Fill in the neighbor references lazily and you've got a kind of memoization.

ANother kind of memoization would be to return an array instead of a lazy IEnumerable, and GetNeighbors becomes a pure function that is trivial to memoize. This amounts roughly to option 3 though.

In any case, but you know this, profile and re-evaluate every step of the way. I am for example unsure about the tradeoff between the lazy IEnumerable or returning an array of results directly. (you avoid some indirections but need an allocation).

Kurt Schelfthout 2009-03-27 16:42:15

Answer 3

+4 A:

Brian,

I've run into similar things in my code.

The two things I've found with C# that helped me the most:

First, don't be afraid necessarily of allocations. C# memory allocations are very, very fast, so allocating an array on the fly can often be faster than making an enumerator. However, whether this will help depends a lot on how you're using the results. The only pitfall I see is that, if you return a fixed size array (4), you're going to have to check for edge cases in the routine that's using your results.

Depending on how large your matrix of Squares is in your model, you may be better off doing 1 check up front to see if you're on the edge, and if not, precomputing the full array and returning it. If you're on an edge, you can handle those special cases separately (make a 1 or 2 element array as appropriate). This would put one larger statement in there, but that is often faster in my experience. If the model is large, I would avoid precomputing all of the neighbors. The overhead in the Squares may outweigh the benefits.

In my experience, as well, preallocating and returning vs. using yield makes the JIT more likely to inline your function, which can make a big difference in speed. If you can take advantage of the IEnumerable results and you are not always using every returned element, that is better, but otherwise, precomputing may be faster.

The other thing to consider - I don't know what information is saved in Square in your case, but if hte object is relatively small, and being used in a large matrix and iterated over many, many times, consider making it a struct. I had a routine similar to this (called hundreds of thousands or millions of times in a loop), and changing the class to a struct, in my case, sped up the routine by over 40%. This is assuming you're using .net 3.5sp1, though, as the JIT does many more optimizations on structs in the latest release.

There are other potential pitfalls to switching to struct vs. class, of course, but it can have huge performance impacts.

Reed Copsey 2009-03-27 16:57:26

Answer 4

+1 A:

Why not make the Square class responsible of returning it's neighbours? Then you have an excellent place to do lazy initialisation without the extra overhead of memoization.

public class Square {

 private Model _model;
 private int _x;
 private int _y;
 private Square[] _neightbours;

 public Square(Model model, int x, int y) {
  _model = model;
  _x = x;
  _y = y;
  _neightbours = null;
 }

 public Square[] Neighbours {
  get {
   if (_neightbours == null) {
    _neighbours = GetNeighbours();
   }
   return _neighbours;
  }
 }

 private Square[] GetNeightbours() {
  int len = 4;
  if (_x == 0) len--;
  if (_x == _model.Width - 1) len--;
  if (_y == 0) len--;
  if (-y == _model.Height -1) len--;
  Square [] result = new Square(len);
  int i = 0;
  if (_x > 0) {
   result[i++] = _model[_x - 1,_y];
  }
  if (_x < _model.Width - 1) {
   result[i++] = _model[_x + 1,_y];
  }
  if (_y > 0) {
   result[i++] = _model[_x,_y - 1];
  }
  if (_y < _model.Height - 1) {
   result[i++] = _model[_x,_y + 1];
  }
  return result;
 }

}

Guffa 2009-03-27 17:26:29

If there aren't a lot of squares in the Model, this is great, but you're adding a lot of pointers if you have many squares. The "100,000/second" loops suggests a lot of squares, though, so I'd be watchful of memory, too.

Reed Copsey 2009-03-27 17:58:18

Answer 5

A:

Depending on the use of GetNeighbors, maybe some inversion of control could help:

public static void DoOnNeighbors(Model m, Square s, Action<s> action) {
  int x = s.X;
  int y = s.Y;        
  if (x > 0) action(m[x - 1, y]);
  if (y > 0) action(m[x, y - 1]);
  if (x < m.Width - 1) action(m[x + 1, y]);
  if (y < m.Height - 1) action(m[x, y + 1]);
}

But I'm not sure, if this has better performance.

MartinStettner 2009-03-27 17:45:21

DoOnNeighbors sounds kinda pervy

StingyJack 2009-03-27 17:52:00

Might be 4 times better since it uses one delegate instead of 4 enumerates.

Brian 2009-03-27 21:11:57

ansaurus

tags:

views:

answers:

C# Micro-Optimization Query: IEnumerable Replacement

related questions