ansaurus

Question

Strange slowness while iterating over Linq result

Answer 1

+5 A:

The reason the linq query appears to execute quickly is because nothing is actually been calculated at the point of definition since linq uses deferred execution i.e. no 'real' work is done until you start to enumerate the results.

Lee 2009-10-17 13:33:56

Answer 2

+3 A:

The reason is that the query isn't actually run until you enumerate it. In LINQ to objects it just sets up a bunch of delegates that get called when you iterate over the enumerator. If you were to add a ToList() to the query to materialize it you'd see the time taken would shift to the set up and away from the display.

tvanfosson 2009-10-17 13:34:06

Answer 3

+3 A:

with many linq providers, the "alg start" to "alt end" is just parsed - the actual expressions aren't evaluated until you actually start enumerating the result. So the actual creation of the "qry" variable is fast (just setting up a enumerable that will actually perform the logic in the query), but enumerating through it is slower.

Philip Rieck 2009-10-17 13:34:44

Answer 4

+3 A:

The LINQ code only creates a query object out of the query expression, which doesn't take a lot of time. Only in the foreach is the query actually executed.

By the way, you shouldn't use DateTime.Now for performance timing, but the Stopwatch class, as it's far more accurate.

Joren 2009-10-17 13:34:48

Answer 5

+3 A:

The query doesn't actually calculate until you iterate over it. Until then it is just like a SQL statement, waiting to be executed.

Yuriy Faktorovich 2009-10-17 13:35:23

Answer 6

+2 A:

That question is doing brute forcing; LINQ is actually quite handy in such cases - I discussed this here: Brute force (but lazily)

Just to expand on some of the previous answers:

LINQ is typically designed around deferred execution, meaning nothing happens until start iterating the result. This is typically done via an iterator block; consider the difference between these:

static IEnumerable<T> Where(this IEnumerable<T> data, Func<T,bool> predicate) {
    foreach(T item in data) {
        if(predicate(item)) yield return item;
    }
}

and:

static IEnumerable<T> Where(this IEnumerable<T> data, Func<T,bool> predicate) {
    var list = new List<T>();
    foreach(T item in data) {
        if(predicate(item)) list.Add(item);
    }
    return list;
}

The difference is that the second version does all the work when you call Where, returning a single result, where-as the second (via the magic of iterator blocks) only does work when the enumerator calls MoveNext(). Iterator blocks are discussed more in the free sample chapter 6 of C# in Depth.

Generally, the advantage of this is that it makes queries composable - especially important for database-based queries, but just as valid for regular work.

Note that even with iterator blocks, there is a second consideration; buffering. Consider Reverse() - no matter how you do it, to reverse a sequence, first you need to find the end of the sequence. Now consider that not all sequences end! Contrast this to Where, Skip, Take etc - which can filter rows without buffering (simply by dropping items).

A good example of using this in an infinite sequence is this Fibonacci question, where we can use the non-buffered, deferred approach:

    foreach (long i in Fibonacci().Take(10)) {
        Console.WriteLine(i);
    }

Without deferred execution, this would never complete.

Marc Gravell 2009-10-18 06:40:29

ansaurus

tags:

views:

answers:

Strange slowness while iterating over Linq result

related questions