ansaurus

Question

Why can Linq operations be faster than a normal loop?

Answer 1

+1 A:

I would suggest that LINQ is only faster than a 'normal loop' when your algorithm is less than perfect (or you have some problem in your code). So LINQ will be faster at sorting than you are if you don't write an efficient sorting algorithm, etc.

LINQ is usually 'as fast as' or 'close enough to' the speed of a normal loop, and can be faster (and simpler) to code / debug / read. That's its benefit - not execution speed.

If it's performing faster than an empty loop, you are doing something wrong. Most likely, as suggested in comments, you aren't considering deferred execution and the LINQ statement is not actually executing.

Kirk Broadhurst 2010-09-21 01:10:00

As I said in a comment above, we printed the dataset it was producing and all seemed to be in order.

Pedery 2010-09-21 01:20:47

You would have to print it before the stopwatch stopped. Printing is too slow, however - best is to dump it into a List.

Kirk Broadhurst 2010-09-21 01:27:17

Hehe, comparing anything I write to anything found in the BCL, yes, all of my code is less then perfect. :O

Christopher Painter 2010-09-21 01:44:13

Answer 2

+8 A:

Undoubtedly you haven't actually performed the query, you've merely defined it. LINQ constructs an expression tree that isn't actually evaluated until you perform an operation that requires that the enumeration be iterated. Try adding a ToList() or Count() operation to the LINQ query to force the query to be evaluated.

Based on your comment I expect this is similar to what you've done. Note: I haven't spent any time figuring out if the query is as efficient as possible; I just want some query to illustrate how the code may be structured.

var dataset = ...

var watch = Stopwatch.StartNew();

var query = dataset.Where( d => dataset.Count( i => i == d ) > 1 );

watch.Stop();  // timer stops here

foreach (var item in query) // query is actually evaluated here
{
   ... print out the item...
}

tvanfosson 2010-09-21 01:11:43

Negatory: We printed the dataset to verify that it was actually doing what it was doing.

Pedery 2010-09-21 01:18:51

@Pedery -- but did you print it out from within the timed code section or outside it. Iterating over it to print it would cause it to be evaluated, but if you didn't time the loop where it was printed the evaluation would have been done outside the timing loop.

tvanfosson 2010-09-21 01:25:29

We assigned the result of the linq to a var, then printed it outside the timed area.

Pedery 2010-09-21 01:33:31

Yep, I see your edit. Regarding performance and the timer, your example is analoguous to our code. I think this explains the issue.

Pedery 2010-09-21 01:40:04

Please see the plethora of topics on "deferred execution" and Linq. These are exposed in both IEnumerable<T> (DataSet.Where, et. Al) and IQueryable<T> (for entities and Linq-to-Sql). The advantage is building a large Linq query without touching the objects that you are querying until absolutely needed for retrieval (the foreach var item portion of the code). At that point an enumerator is built on the resulting collection from the query.

Michael 2010-09-21 05:19:23

Answer 3

+1 A:

If you did not compile with "Optimize Code" enabled, you would probably see this behaviour. (It would certainly explain why the empty loop was not removed.)

The code underlying Linq, however, is part of already compiled code, which will certainly have been optimised (by the JIT, NGEN or similar).

Zooba 2010-09-21 01:24:58

+1 Good point! I forgot to check the computer's VS default settings regarding code optimizing.

Pedery 2010-09-21 01:35:52

ansaurus

tags:

views:

answers:

Why can Linq operations be faster than a normal loop?

related questions