ansaurus

Question

Answer 1

+3 A:

OpenCV has a Python interface that you might find useful.

kwatford 2009-09-19 18:23:26

Found this: http://opencv.willowgarage.com/documentation/python/pattern_recognition.htmlHowever somewhat confusing to me, and example isn't very helpful.

Gökhan Sever 2009-09-19 18:45:37

Answer 2

+3 A:

Scan every square (e.g. from the top-left, left-to-right, top-to-bottom)
When you hit a blue square then:

a. Record this square as a location of a new object

b. Find all the other contiguous blue squares (e.g. by looking at the neighbours of this square, and the neighbours of those neighbours, etc.) and mark them as being part of the same object
Continue to scan
When you find another blue square, test to see whether it's part of a known object before going to step 2; alternatively in step 2b, erase any square after you've associated it with an object

ChrisW 2009-09-19 18:29:19

I was thinking to implement something similar what you have mentioned here. Thanks for laying out clearly than I am :)

Gökhan Sever 2009-09-19 18:43:53

With some help I wrote a simple flood_fill implementation and can count the number of objects (ice particle shadows) on the given image. And, yes it includes recursion as I expected. Now I want to be able construct simple numpy arrays representing each object. (as additionally demonstrated on my question) In the end I am thinking of appending all these arrays in to an array, where later I can simply loop over and get their shapes. Shapes are important because I will use them for height and widths of ice particle objects.

Gökhan Sever 2009-09-20 19:27:29

Answer 3

+2 A:

Looking at the image you provided, all you need to do next is to apply a simple region growing algorithm. If I were using MATLAB, I would use bwlabel/bwboundaries functions. I believe there's an equivalent function somewhere in numpy, or use OpenCV with python wrappers as suggested by @kwatford

Amro 2009-09-19 18:31:15

SciPy package has a similar labelling function (http://docs.scipy.org/doc/scipy/reference/generated/scipy.ndimage.measurements.label.html#scipy.ndimage.measurements.label) which I could label all the pixels correctly by using it. But that only solves the first part of the problem. I should be able distinguish each entity separately. Unfortunately, there is no function for this. Needs to be written :)

Gökhan Sever 2009-09-19 18:48:36

The way I understand it, is that each blue object in the image is an entity, right?Then by labeling all pixels into regions, you just select all pixels labeled as one, then the ones labeled as two, and so on to get each object...Next step would be to extract some features for each object (boundaries, shape, area, size, ...), which would be used in a supervised classification task.

Amro 2009-09-19 19:58:23

I clarified my usage of entities in my question under NOTE section. For now, all I have to do is get the boundary rectangle or square for some cases width and height for each object.

Gökhan Sever 2009-09-20 19:40:41

To make this even similar [outside of matlab] all of the bw commands use the connected components algorithm.

monksy 2009-10-08 06:14:46

Answer 4

+2 A:

I used to do this kind of analysis on micrographs and eventually put everything I needed into an image processing and analysis package written in C, driven via Tcl. (It worked with 512 x 512 images only, which explains why 512 crops up so often. There were images with pixels of various sizes allocated, but most of the work was done with 8-bit pixels, which explains why there is that business of 0xff and maximum meaningful count of 254 on an image.)

Briefly, the 'zz' at the begining of the Tcl commands sends the remainder of the line to the package's parser which calls the appropriate C routine with the given arguments. Right after the 'zz' is an argument that indicates the input and output of the command. (There can be multiple inputs but only a single output.) 'r' indicates a 512 x 512 x 8-bit image. The third word is the name of the command to be invoked; 'graphs' marks up an image as described in the text below. So, 'zz rr graphs' means 'Call the ZZ parser; input an r image to the graphs command and get back an r image.' The rest of the Tcl command line specifies which of the pre-allocated images to use. (The 'g' image is an ROI, i.e., region-of-interest, image; almost all ZZ ops are done under ROI control.) So, 'r1 r1 g8' means 'Use r1 as input, use r1 as output (that is, mark up the input image itself), and do the operation wherever the corresponding pixel on image g8 --- that is, r8, used as an ROI --- is >0.

I don't think it is available online anywhere, but if you want to pick through the source code or even compile the whole shebang, I'll be happy to send it to you. Here's an excerpt from the manual (but I think I see some errors in the manual at this late date --- that's embarrassing ...):

Example 6. Counting features.

Problem

Counting is a common task. The items counted are called “features”, and it is usually necessary to prepare images carefully so that features correspond in a one-to-one way with things that are the real objects to be counted. Here, however, we ignore image preparation and consider, instead, the mechanics of counting. The first counting exercise is to find out how many features are on the images in the directory ./cells?

Approach

First, let us define “feature”. A feature is the largest group of “set” (non-zero) pixels all of which can be reached by travelling from one set pixel to another along north-south-east-west (up-down-right-left) routes, starting from a given set pixel. The zz command that detects and marks such features on an image is “zz rr graphs R:src R:dest G:ROI”, so called because the mathematical term for such a feature is a “graph”. If all the pixels on an image are set, then there is only a single graph on the image, but it contains 262144 pixels (512 * 512). If pixels are set and clear (equal to zero) in a checkerboard pattern, then there will be 131072 (512 * 512 / 2) graphs, but each will containing only one pixel. Briefly explained, “zz rr graphs” starts in the upper-left corner of an image and scans each succeeding row left to right until it finds a set pixel, then finds all the set pixels attached to that through north, south, east, or west borders (“4-connected”). It then sets all pixels in that graph to 1 (0x01). After finding and marking graph 1, it starts scanning again at the pixel after the one where it first discovered graph 1, this time ignoring any pixels that already belong to a graph. The first 254 graphs that it finds will be marked uniquely; all graphs found after that, however, will be marked with the value 255 (0xff) and so cannot be distinguished from each other. The key to being able to count any number of graphs accurately is to process each image in stages, that is, find the number of graphs on an image and, if the number is greater than 254, erase the 254 graphs just found, repeating the process until 254 or fewer graphs are found. The Tcl language provides the means to set up control of this operation.

Let us begin to build the commands needed by reading a ZZ image file into an R image and detecting and marking the graphs. Before the processing loop, we declare and zero a variable to hold the total number of features in the image series. Within the processing loop, we begin by reading the image file into an R image and detecting and marking the graphs.

zz ur to $inDir/$img r1
zz rr graphs r1 r1 g8

Next, we zero some variables to keep track of the counts, then use the “ra max” command to find out whether more than 254 graphs were detected.

set nGraphs [ zz ra max r1 a1 g1 ]

If nGraphs does equal 255, then the 254 accurately counted graphs should be added to the total, the graphs from 1 through 254 should be erased, and the count repeated for as many times as it takes to reduce the number of graphs below 255.

while {$nGraphs == 255} {
  incr sumGraphs 254
  zz rbr lt r1 155 r1 g1 0 255 
  set sumGraphs 0
  zz rr graphs r1 r1 g8
  set nGraphs [ zz ra max r1 a1 g8 ]
}

When the “while” loop exits, the variable nGraphs must hold a number less than 255, that is, a number of accurately counted graphs; this is added to the rising total of the number of features in the image series.

incr sumGraphs $nGraphs

After the processing loop, print out the total number of features found in the series.

puts “Total number of features in $inDir \
images $beginImg through $endImg is $sumGraphs.”

After the processing loop, print out the total number of features found in the series.

behindthefall 2009-09-20 14:27:17

This is a very lengthy answer. It will take some time to grasp your explanations :)

Gökhan Sever 2009-09-20 19:45:10

Note that I added para 2 as a minature intro to the package's Tcl commandline syntax. That might help your 'grasping'.BTW, there are other examples in the manual that show how to prepare images using morphological operations and then, after finding and counting the features, do things like making size distributions of them.I've never seriously tried to adapt it to Python, although it has been in my mind to do to for some time.

behindthefall 2009-09-21 00:18:15

Also BTW: 'graphs' basically uses the algorithm from Graphics Gems, Vol.I, p.721, by Paul Heckbert.

behindthefall 2009-09-21 00:25:14

Thanks for your lengthy explanations again. I have solved the problem, and it does what I need.

Gökhan Sever 2009-09-21 17:49:00

Answer 5

+2 A:

Connected component analysis may be what you are looking for.

Dima 2009-09-20 14:32:23

I knew that there must be a new for this technique :) Thanks for the pointers. Both pygraph (http://code.google.com/p/python-graph/) and opencv with Python wrappings have some support for these algorithm, however their usage isn't very clear to me. Have you used any of those functions?

Gökhan Sever 2009-09-20 19:43:30

ansaurus

tags:

views:

answers:

Simple object recognition

related questions