ansaurus

Question

Answer 1

+4 A:

Try this:

Boolean isAuthorized =
    _userRoles.Any(user => _authRoles.Contains(user));

Andrew Hare 2010-05-11 14:54:17

Answer 2

+5 A:

If what you want is to determine if _authRoles and _userRoles have at least one common item, then use:

bool isAuthorized = _authRoles.Intersect(_userRoles).Any();

You can also query the result of Intersect in any other way you choose.

Jon 2010-05-11 14:56:29

+1. I haven't used `Any` in that fashion before; that's better than the solution that I proposed.

Adam Robinson 2010-05-11 14:58:19

Thanks. It's one of those things that are immediately obvious... after they cross your mind for the first time. :-)

Jon 2010-05-11 14:59:48

Answer 3

+2 A:

Suppose the lists are of size N and M and that the likely scenario is no match. Andrew's solution is O(NM) in time and O(1) in extra memory. Adam's solution is O(N+M) in time and memory, but could be written more clearly as Jon's solution.

Another solution which is basically the same as Adam and Jon's would be to join the two lists:

var joined = from user in userRoles 
             join auth in authRoles 
             on user equals auth 
             select user;
return joined.Any();

That's a bit heavier weight than necessary but it reads nicely. :-)

Eric Lippert 2010-05-11 14:59:52

I can't resist saying that having Eric Lippert reply with not a better solution that yours definitely feels good. :-) Cheers Eric, by a longtime reader of your blog.

Jon 2010-05-11 15:05:52

Are they definitely O(NM) and O(N+M), or does LINQ optimise it, as with the `Any()` on the end of Jon's solution, it only needs to find one result before it can return.

ck 2010-05-11 15:08:18

@ck: Big-O notation as O(f(x)) is specified as the limit as x goes to infinity. As such any early-out is irrelevant to the time complexity (unless you are always considering best-case O(1)).

Ron Warholic 2010-05-11 15:14:27

@ck: Ron is a bit imprecise in his characterization of asymptotic order but his point is well taken. Note that I said "and that the likely scenario is no match". If the likely scenario is no match then you are always going to have to build the complete data structure and iterate the whole thing to discover that there is no match. Building efficient data structures that can be queried quickly *when the query is likely to be "junk"* is quite tricky.

Eric Lippert 2010-05-11 17:15:03

About the O(N+M) speed, out of curiosity: How is it derived? I 've been trying to figure out how you can do it in O(N+M) when you do not know a) if the items can be less-than compared, b) if it is feasible and reasonable to use the items as array indexes, and I 'm stumped. Maybe O(M+N) takes into account that strings are less-than comparable?

Jon 2010-05-11 19:32:59

@Jon: Take the first list. Build a hash table HT out of that list. Cost to build HT is O(N) in time. For each item in the second list, ask HT if the item was in the first list. That's O(M) tests, each one costs O(1) in time if HT is well written. Total: O(N) + O(M) in time. Your idea about ordering is good though; if for some reason we didn't have an O(1) hash table we could at least build a binary searchable array at cost O(N lg N). Each search would cost O(lg N), so total cost would be O(N lg N) + O(M lg N).

Eric Lippert 2010-05-11 19:41:23

@Eric: Hash tables crossed my mind (what can you do with `IEquatable` but not with `IComparable`?) but I dismissed it because surely the BCL can't be building HTs behind your back unless it has a good reason to believe that the overhead of setting up a HT will pay off? But on second thought, if you are writing the BCL then you will need conditional algorithm selection based on problem size to cater to all customers. Right?

Jon 2010-05-11 20:05:18

Also about HTs: since the O in O(N) (building) is different than the O in O(M) (hashing/testing), can you really assert that it adds up to O(N+M)? Of course this is rather an academic question, building a HT is not going to be 10K times slower than looking items up in practice.

Jon 2010-05-11 20:09:06

Hmm, just to correct my comment which mentions `IEquatable` and `IComparable`: of course I should have written "with `ΙΕquatable` which is not also `IComparable`". Guilty of bad syntax and slow error detection. :-)

Jon 2010-05-11 20:23:40

@Jon: I think you'll find that LINQ-to-objects builds hash tables on your behalf all the time. If you don't like its behaviour then feel free to write your own implementation of LINQ-to-objects; nothing is stopping you. And the implementation often doesn't *know* the size of the problem; if the input is not an IList then calculating the size is itself O(n).

Eric Lippert 2010-05-11 20:47:44

@Jon: Re: "The O in O(N) is different than the O in O(M)". I have not got the *faintest* idea what on earth you are talking about. Yes, I can and do assert that O(N) + O(M) = O(N+M). Can you explain what you mean by "The O in O(N) is different than the O in O(M)"? They sure look like the same O to me.

Eric Lippert 2010-05-11 20:49:38

@Eric I like what works, so until the day comes when those hash tables are in my inner loop I probably won't bother. :-) But I would still find it hard to believe if someone told me "LINQ will build a hash table even if your collections have 3 items each and it knows that", that's all.

Jon 2010-05-11 20:51:52

About the Os: For each of M items, build a proper binary tree with 20K nodes. And for each of N items, compute their hashes. As both M and N grow, the time will be say 1000 * M + N. Which is less than 1000 * (M+N), which is still O(M+N). That's how I started thinking, but obviously I stopped too early. Thanks!

Jon 2010-05-11 20:57:46

As a final aside: this exchange has also taught me that a non-naive implementation of `GetHashCode` can be very important even where this would not be immediately obvious. Thanks again.

Jon 2010-05-11 21:07:44

ansaurus

tags:

views:

answers:

LINQ If .Any matches .Any

related questions