ansaurus

Question

Simple algorithm for online outlier detection of a generic time series

Answer 1

+2 A:

This is a big and complex subject, and the answer will depend on (a) how much effort you want to invest in this and (b) how effective you want your outlier detection to be. One possible approach is adaptive filtering, which is typically used for applications like noise cancelling headphones, etc. You have a filter which constantly adapts to the input signal, effectively matching its filter coefficients to a hypothetical short term model of the signal source, thereby reducing mean square error output. This then gives you a low level output signal (the residual error) except for when you get an outlier, which will result in a spike, which will be easy to detect (threshold). Read up on adaptive filtering, LMS filters, etc, if you're serious about this kind of technique.

Paul R 2010-08-02 18:13:45

Thank you for your response. This seems like a good approach, but what if the signal exhibits a high seasonality (i.e. a lot of network measurements are characterized by a daily and weekly pattern at the same time, for example night vs day or weekend vs working days)? Will an adaptive filter be able to model this aspect? In my ideal world, I would like to detect a peak of traffic happening during the sunday morning, while the same value might be completely normal on Monday.

gighi 2010-08-02 19:20:27

So long as you have enough terms in your filter to model the various periodicities then it should just work - the adaptive filter will remove these frequencies leaving just residual noise.

Paul R 2010-08-02 20:43:12

Thank you again, I would like to try an algorithm based on this stuff. Do you know if there are some general purpose libraries to do a preliminary simulation of this method, before doing a real implementation (which probably takes some time)?

gighi 2010-08-02 20:51:22

You can probably prototype this quite quickly and easily in MATLAB (or a free clone like Octave). See e.g. http://www.mathworks.com/matlabcentral/fileexchange/3649-lms-algorithm-demo

Paul R 2010-08-02 21:08:27

ansaurus

tags:

views:

answers:

Simple algorithm for online outlier detection of a generic time series

related questions