views:

354

answers:

1

I'm trying to implement the following method in Matlab: Minimum Error Thresholding - By J. Kittler and J. Illingworth

You may have a look at the PDF:

My code is:

function [ Level ] = MET( IMG )
%Maximum Error Thresholding By Kittler
%   Finding the Min of a cost function J in any possible thresholding. The
%   function output is the Optimal Thresholding.

for t = 0:255 % Assuming 8 bit image
    I1 = IMG;
    I1 = I1(I1 <= t);
    q1 = sum(hist(I1, 256));

    I2 = IMG;
    I2 = I2(I2 > t);
    q2 = sum(hist(I2, 256));

    % J is proportional to the Overlapping Area of the 2 assumed Gaussians
    J(t + 1) = 1 + 2 * (q1 * log(std(I1, 1)) + q2 * log(std(I2, 1)))...
        -2 * (q1 * log(q1) + q2 * log(q2));
end

[~, Level] = min(J);

%Level = (IMG <= Level);

end

I've tried it on the following image: Letters

Original size image.

The target is to extract a binary image of the letters (Hebrew Letters). I applied the code on sub blocks of the image (40 x 40). Yet I got results which are inferior to K-Means Clusters method.

Did I miss something? Anyone has a better idea?

Thanks.

P.S. Would anyone add "Adaptive-Thresholding" to the subject tags (I can't as I'm new).

+2  A: 

Thresholding is a rather tricky business. In the many years I've been thresholding images I have not found one single technique that always performs well, and I have come to distrust the claims of universally excellent performance in CS journals.

The maximum error thresholding method only works on nicely bimodal histogram (but it works well on those). Your image looks like signal and background may not be clearly separated enough for this thresholding method to work.

If you want to make sure that the code works fine, you could create a test program like this and check both whether you get good initial segmentation, as well as at what level of 'bimodality' the code breaks down.

Jonas
Which method would you suggest for this kind of image?Thanks.
Drazick
Honestly? I don't know. I have never worked with this kind of images, and thus I have very little intuition for this problem.I have looked at the histogram for this, and I noticed that even though the background is horribly inhomogeneous, the foreground is mostly <0.12. In other words, your letters are fairly uniform in blackness, which means that the peak in the histogram corresponding to them is very pronounced, so you can simply cut the histogram at the first minimum after the highest peak and get a reasonable first approximation.
Jonas
This first approximation gives you a pretty good idea of which pixels belong to the background. You can then segment the background into the brighter and darker portions. Within each portion, you can then do hysteresis thresholding to improve your initial guess, i.e. determine a local threshold, so that pixels that are below that threshold and that are connected to any pixel of the initial guess (you may have to clean up the initial guess a bit) are kept, whereas pixels that make the threshold but are unconnected are discarded.
Jonas
Obviously, there are many, many other ways to segment this image. Just find one that robustly gives you a good starting point, and come up with some pre-processing and post-processing routines that allow you to apply your prior knowledge of the images you're analyzing. If you want to solve problems, as opposed to write a CS paper, you should not get caught up too much hunting for the perfect core algorithm. You want to write a program that does the job.
Jonas