ansaurus

Question

Scientific Data processing (Graph comparison and interpretation)

Answer 1

+2 A:

I think you're looking for information on Digital Signal Processing. It can range from very simple to very hard to understand. If, say, your pre-event signal was 0, and every signal after the relevant signal was 1, you could just look for the first 1, figure out the time at which it occurred, and you'd be done. That's basically the limiting case of simplicity, and it might be a good place to start. Implement that, and you've got the beginnings of a sense of how to answer your question. Now, then, you've got noise. So, say, pre-event might range from -10 to 10, and post-event might range from 90 to 110. Still simple; watch for the first value greater than 10. But of course it's never that simple. You might have to average a window of readings, might look for some threshold of change from previous measurement, etc. In advanced cases, you could find yourself using transformations into other spaces, applying filters, pattern matching, and the like. But from your description, it sounds like reasonably simple methods should do the job for you. Don't get intimidated by concepts like FFT - you probably don't need them, yet. For now, at least, assume that it can be solved simply. Start with a trivially simple (but insufficient) solution, and work your way towards the solution that works.

Carl Manaster 2010-05-29 18:04:43

Hi Carl, thanks for the advice, Digital signal processing is definitely the field and I've amended the tags to include it. I'm currently working through an online book on dsp but as yet haven't come across any techniques that sound relevant to this problem but i assume its just a matter of time.

2010-05-31 16:05:15

Answer 2

+1 A:

One technique worth looking at if the sort of filter-and-threshold approach Carl suggests won't suffice is Cross Correlation. The essence of this is pretty simple: if two data sets are reasonably similar, their dot product will be maximimised when they align (because the highest values will be multiplied together). So you can get a good estimate of how to line them up by calculating this product at each offset and choosing the one that gives the highest result.

In a case like yours, the idea would be to have an "ideal" version of the curve shape you're looking for -- either generated from theory/simulation or by averaging the results of a number of good experimental curves identified and aligned by eye -- and compare it against the experimental data.

For simplicity, let's assume that the data set is longer than the ideal and has enough empty space at either end that we can ignore any boundary issues. Since you are looking for one specific event, it should be trivial to cut down your ideal to comply with this assumption. Crudely coded in Java, then, the process might go something like this:

int offset ( double[] data, double[] ideal )
{
    double cMax = -Double.MAX_VALUE;
    int tMax = 0;

    for ( int t = 0; t < data.length - ideal.length; ++t )
    {
        double c = 0;
        for ( int i = 0; i < ideal.length; ++i )
        {
            c += data[t + i] * ideal[i];
        }

        if ( c > cMax )
        {
            cMax = c;
            tMax = t;
        }
    }

    return tMax;
 }

Obviously, there are plenty of situations in which this approach can fail, particularly if there is a significant amount of non-independent noise or if there are periodicities in the signal that give rise to aliasing. Also, this example throws away a lot of information to focus just on an absolute maximum, which may be error-prone if there isn't a large, narrow peak in the cross correlation. But from your description it seems like your problem could be fairly amenable to something along these lines.

walkytalky 2010-05-31 10:47:03

ansaurus

tags:

views:

answers:

Scientific Data processing (Graph comparison and interpretation)

related questions