ansaurus

Question

Check if all items in a Collection have the same value.

Answer 1

+3 A:

Edit: to address Timwi's concerns about 3 enumerators:

bool same = <your default> ;
var first = items.FirstOrDefault();
if (first != null)  // assuming it's a class
{
   same = items.Skip(1).All(i => i.Template.Frequency == first.Template.Frequency); 
}

Which still uses 2 enumerators. Not an issue for the average List<>, but for a DB query it might pay to use the less readable:

bool same = <your default> ;
Item first = null;

foreach(var item in items)
{
    if (first == null)
    {
        first = item;
        same = true;
    }
    else
    {
        if (item.Template.Frequency != first.Template.Frequency)
        {
           same = false;
           break;
        }
    }
}

Henk Holterman 2010-09-10 19:27:19

you're code does not do the same comparison as OPs. He's comparing a property of a property. Yours is doing a ref comparison

Rune FS 2010-09-10 19:32:39

@Rune: I'll add that for completeness but it's not a fundamental difference. Unless some properties are null which would make it a mess.

Henk Holterman 2010-09-10 19:34:11

This is the same as the `Trick #2` in [my second answer](http://stackoverflow.com/questions/3687597/check-if-all-items-in-a-collection-have-the-same-value/3687724#3687724). It instantiates two enumerators and evaluates the first item twice — and if you don’t want it to throw, you’d need a third to check for `Any()` first. This can be a performance problem if the collection is lazy and the first element takes a while to compute.

Timwi 2010-09-10 19:34:54

@Henk I agree that it's not fundamental to the _way_ of solving the task your proposing but I disagree with it not being fundamental to the _solution_ since the code as is would yield an incorrect result quite often (as I'm sure you are aware of)

Rune FS 2010-09-10 19:40:28

@Timwi but on the upside this approach is a lot more readable since it can be written so that it almost reads as the requirements

Rune FS 2010-09-10 19:41:30

@Timwi: The name Collection suggests an enumerator isn't expensive, but we don't really know.

Henk Holterman 2010-09-10 19:41:38

@Rune: No, I don't see the big problem.

Henk Holterman 2010-09-10 19:44:24

@Henk: I think his problem is that, in your option1, you're comparing items of the collection directly (by ref), but in option 2, you're comparing items by item.Template.Frequency... If they're classes, your first solution will always say every item is unique (since they're different references)

Reed Copsey 2010-09-10 21:26:02

@Reed: correct, I was _thinking_ (but not writing) something like `x.Value`. I'll delete the first part.

Henk Holterman 2010-09-10 21:33:23

Henk, your new version now has a serious bug. It will say all the items in the collection are the same if the first item is `null`.

Timwi 2010-09-11 00:48:10

@Timwi: I don't think the collection contains `null` values.

Henk Holterman 2010-09-11 08:53:21

@Henk: How do you know? Your code doesn’t check for that. When you write a method that takes a collection of items, your method needs to be prepared to be given *any collection* and not have some unexpected, hard-to-debug side-effect when given something that you didn’t happen to think anyone would give it.

Timwi 2010-09-11 22:49:49

@timWi: if you look carefully you might see that I didn't write a method at all. And I minimize errorhandling in answers.

Henk Holterman 2010-09-12 07:10:42

Answer 2

+2 A:

It would be faster like this:

public static bool IsQuantized(this MeasurementCollection items)
{
    if(items == null || items.Count == 0)
       return true;

    var valueToCompare = items.First().Template.Frequency;

    return items.All(i => i.Template.Frequency == valueToCompare);
}

It will return false on the first item's template frequency that is different, while in your code, the algorithm passes the whole collection.

Ivan Ferić 2010-09-10 19:27:32

This assumes that the collection is indexable.

Timwi 2010-09-10 19:33:11

There, fixed it!

Ivan Ferić 2010-09-10 19:45:31

The OP put an #arrays tag on this question, which may indicate that MeasurementCollection is implemented as an array or similar structure.

StriplingWarrior 2010-09-10 19:46:09

@Timwi: there is an [arrays] tag

Henk Holterman 2010-09-10 19:47:00

① The edited version returns `true` when the input is `null`, thus masking a likely bug. It should throw `ArgumentNullException` instead. ② This still assumes that the collection has a `Count`. Admittedly, the name suggests that it does, but the question doesn’t state that, and the question asked about a solution *in LINQ*, which works with pure enumerables.

Timwi 2010-09-11 00:50:35

Answer 3

A:

First of a general linq advise. If you just what to know if there's exactly one in a collection use Single() or SingleOrDefault(). Count will potentially iterate the entire collection which is more than you need since you can bail out if there's two.

public static bool IsQuantized(this MeasurementCollection items)
        {
            var first = items.FirstOrDefault();
            return first != null && items.Skip(1).All(i => first.Template.Frequency == i.Template.Frequency));
        }

Rune FS 2010-09-10 19:28:14

Why the downvote?

Rune FS 2010-09-10 19:37:13

Not my -1 but your code seems like it wouldn't deal with an empty collection very well. And you were the perfectionist.

Henk Holterman 2010-09-10 20:10:33

@Henk you're right so I've updated my answer. I would however call it perfectionistic to comment on an implementation that in _most_ case would yield an incorrect answer. Neither do I see how praising your general approach for it's readability is perfectionistic :)

Rune FS 2010-09-10 20:45:16

Answer 4

+1 A:

You could just find the first value and check if ANY others are different, this will avoid having to eval the whole collection (unless the single different value is the last one)

public static bool IsQuantized(this MeasurementCollection items)
{
    if(!items.Any())
        return false; //or true depending on your use case

    //might want to check that Template is not null, a bit a violation of level of demeter, but just an example
    var firstProp = item.First().Template.Frequency;

    return !items.Any(x=> x..Template.Frequency != firstProp);

}

Christopherous 5000 2010-09-10 19:40:14

Answer 5

A:

I got a bit of inspiration and thought about a solution with only speed in mind. This is really not that readable (which I usually preferre) but the characteristics when it comes to speed should be pretty good.

The once case is the same for most of the other implementations O(n) but it's highly unlikely since it would require all the first half of the elements to be the equal and the second half to all be equal but not equal to the value in the first half. and would require the same number of comparisons as a linear search. In most other cases with the first odd one in a random place this will require half as many comparisons as the linear. In the case where the values are in pairs. So that item[0] == item[1] and item[2] == item[3] and item[0] != item[2] (and similar) then the linear search will be faster. In general with either random data or few odd once this should be faster than a linear search

public static bool AllSame<T>(this IEnumerable<T> source,
                              IEqualityComparer<T> comparer = null)
        {
            if (source == null)
                throw new ArgumentNullException("source cannot be null.", "source");

            if (comparer == null)
                comparer = EqualityComparer<T>.Default;
            var enumerator = source.GetEnumerator();

            return source.Zip(comparer);
        }

        private static bool Zip<T>(this IEnumerable<T> sequence, IEqualityComparer<T> comparer)
        {
            var result = new List<T>();
            var enumerator = sequence.GetEnumerator();
            while (enumerator.MoveNext())
            {
                var first = enumerator.Current;
                result.Add(enumerator.Current);
                if (enumerator.MoveNext())
                {
                    if (!comparer.Equals(first, enumerator.Current))
                    {
                       return false;
                    }
                }
                else
                {
                    break;
                }
            }
            return result.Count == 1 ? true : result.Zip(comparer);
        }

with out tail call optimization this uses extra memory (with a worst case of an amount of memory close to the amount of memory used for the original source). The call stack shouldn't get to deep though since no IEnumerable concrete implementations (at least that I'm aware of) can hold more than int.MaxValue elements. which would require a max of 31 recursions.

Rune FS 2010-09-10 20:36:13

ansaurus

tags:

views:

answers:

Check if all items in a Collection have the same value.

related questions