ansaurus

Question

LINQ method for adding items to a dictionary

Answer 1

+1 A:

When I build your second example and then open it in Reflector's disassembly view, I get the following:

Dictionary<string, int> wordCountLINQ = allWordsLINQ.GroupBy<string, string>(delegate (string word) {
    return word;
}).Select<IGrouping<string, string>, IGrouping<string, string>>(delegate (IGrouping<string, string> groups) {
    return groups;
}).ToDictionary<IGrouping<string, string>, string, int>(delegate (IGrouping<string, string> g) {
    return g.Key;
}, delegate (IGrouping<string, string> g) {
    return g.Count<string>();
});

}

Probably it takes longer just because there are more function calls happening, and over the course of a million iterations that adds up.

Dathan 2010-01-22 16:29:13

That makes sense, is there a more "direct" way to do it in LINQ?

Matt Warren 2010-01-22 16:33:42

Not really, that I know of. Maybe by a different select expression? I'm out of my realm of experience as soon as group by is involved in the expression.

Dathan 2010-01-22 17:37:40

Answer 2

+1 A:

One of the reasons the LINQ version is slower, is because instead of one dictionary, two dictionaries are created:

(internally) from the group by operator; the group by also stores each individual word. You can verify this by looking at a ToArray() rather than a Count(). This is a lot of overhead you don't actually need in your case.
The ToDictionary method is basically a foreach over the actual LINQ query, where the results from the query are added to a new dictionary. Depending on the number of unique words, this can also take some time.

Another reason that the LINQ query is a little slower, is because LINQ relies on lambda expressions (the delegate in Dathan's answer), and calling a delegate adds a tiny amount of overhead compared to inline code.

Edit: Note that for some LINQ scenarios (such as LINQ to SQL, but not in-memory LINQ such as here), rewriting the query produces a more optimized plan:

from word in allWordsLINQ 
group word by word into groups 
select new { Word = groups.Key, Count = groups.Count() }

Note however, that this doesn't give you a Dictionary, but rather a sequence of words and their counts. You can transform this into a Dictionary with

(from word in allWordsLINQ 
 group word by word into groups 
 select new { Word = groups.Key, Count = groups.Count() })
.ToDictionary(g => g.Key, g => g.Count);

Ruben 2010-01-22 17:01:56

Can I modify the LINQ query to overcome these issues and still get the same result?

Matt Warren 2010-01-22 17:19:44

As far as I know, not in 3.5 or 4.0, no. For this to work, the ToDictionary and GroupBy operators would need to co-operate when you're only aggregating data. For in-memory LINQ that does not happen.

Ruben 2010-01-22 17:35:00

ansaurus

tags:

views:

answers:

LINQ method for adding items to a dictionary

related questions