ansaurus

Question

"Join" of time series

Answer 1

A:

Union sounds like the right way to go - no query expression support, but I think it expresses what you mean.

You might be interested in looking at the Range-based classes in MiscUtil which can be nicely used for times. Combined with a bit of extension method fun, you can do:

foreach (DateTime day in 19.June(1976).To(DateTime.Today).Step(1.Day()))
{
    Console.WriteLine("I'm alive!");
}

I'm not suggesting this should replace whatever you're doing, just that you might be able to take some ideas to make it even neater. Feel free to contribute back, too :)

Jon Skeet 2008-10-27 21:38:20

I really don't like this example, since it extends a basic type (int) and that's just not good practice.

Omer van Kloeten 2008-10-27 21:46:58

The "building a DateTime" part is a little bit specious, but being able to build a range by doing 1.To(10) makes for very readable code IMO.

Jon Skeet 2008-10-27 21:55:33

Answer 2

A:

If I'm understanding the question correctly, you want to join multiple sequences based on their position within the sequence?

There isn't anything in the System.Linq.Enumerable class to do this as both the Join and GroupJoin methods are based on join keys. However, by coincidence I wrote a PositionalJoin method for just this purpose a few days back, used as in your example:

sequenceA.PositionalJoin(sequenceB, (a, b) => new { a, b });

The semantics of the method shown below is that it does not require the sequences to be of equal length, but it would be trivial to modify it to require this. I also commented out where the argument checking should be as it was using our internal helper classes.

public static IEnumerable<TResult> PositionalJoin<T1, T2, TResult>(
    this IEnumerable<T1> source1, 
    IEnumerable<T2> source2, 
    Func<T1, T2, int, TResult> selector)
{
    // argument checking here
    return PositionalJoinIterator(source1, source2, selector);
}

private static IEnumerable<TResult> PositionalJoinIterator<T1, T2, TResult>(
    IEnumerable<T1> source1, 
    IEnumerable<T2> source2, 
    Func<T1, T2, TResult> selector)
{
    using (var enumerator1 = source1.GetEnumerator())
    using (var enumerator2 = source2.GetEnumerator())
    {
        bool gotItem;
        do
        {
            gotItem = false;

            T1 item1;
            if (enumerator1.MoveNext())
            {
                item1 = enumerator1.Current;
                gotItem = true;
            }
            else
            {
                item1 = default(T1);
            }

            T2 item2;
            if (enumerator2.MoveNext())
            {
                item2 = enumerator2.Current;
                gotItem = true;
            }
            else
            {
                item2 = default(T2);
            }

            if (gotItem)
            {
                yield return selector(item1, item2);
            }
        }
        while (gotItem);
    }
}

Not sure if this is exactly what you're looking for, but hopefully of some help.

Greg Beech 2008-10-27 22:15:09

My details are more complicated, since two time series might not have the exact same time stamps, but the idea is the same as you describe.I was suspecting that the LINQ query syntax wouldn't quite suffice.Thanks for detailing why this is the case.

Bjarke Ebert 2008-10-27 23:50:48

Answer 3

+1 A:

From my NExtension project:

public static IEnumerable<TResult> Zip<T1, T2, TResult>(
    this IEnumerable<T1> source1, 
    IEnumerable<T2> source2, 
    Func<T1, T2, TResult> combine)
{
    if (source1 == null)
        throw new ArgumentNullException("source1");
    if (source2 == null)
        throw new ArgumentNullException("source2");
    if (combine == null)
        throw new ArgumentNullException("combine");

    IEnumerator<T1> data1 = source1.GetEnumerator();
    IEnumerator<T2> data2 = source2.GetEnumerator();
    while (data1.MoveNext() && data2.MoveNext())
    {
        yield return combine(data1.Current, data2.Current);
    }
}

Syntax is:

Stock1.Zip(Stock2, (a,b)=>a-b)

Cameron MacFarland 2008-10-27 22:27:35

Note that you should separate your argument checking and iteration into different methods - see http://blogs.msdn.com/ericlippert/archive/2008/09/08/high-maintenance.aspx for details.

Greg Beech 2008-10-27 22:30:05

Answer 4

+1 A:

Bjarke, take a look at NEsper, it's an open source Complex Event Processing app that amongst other things does SQL-like time series queries. You can either learn how they've done it, or perhaps even leverage their code to achieve your goal. link here http://esper.codehaus.org/about/nesper/nesper.html

endian 2008-10-27 22:31:29

Thanks! I'll have a look, at least to get some inspiration.

Bjarke Ebert 2008-10-27 23:51:59

ansaurus

tags:

views:

answers:

"Join" of time series

related questions