I need to detect the bounding box(es) around portions of text in an image, and while there are quite a number of scholarly articles describing algorithms, I haven't found any implementations.
The specific problem I'm trying to solve is this:
Given an image that may or may not contain text, determine if the image does contain text, and if so, output the bounding rectangle around each area of text. (where "area" is defined by the algorithm, and hopefully it will err on the side of smaller areas vs. larger ones.)
Eventually, I'd like to turn the text into actual asci/unicode characters, but I think that is only tangentially related to this problem.
There are a number of tools out there that do OCR, (tesseract, Gocr, etc..) but they seem to only work on text that more or less fills the image with no real "image" content. (Eg: tesseract generates garbage when run on an image with subtitles.)
An implementation in java would be ideal, but I'm open to any cross-platform libraries/applications at this point.
Edit: I'm particularly interested in detecting artificial text, such as subtitles, or a HUD, which seems to be a simpler problem than detecting scene text, such as street signs. (Although scene text detection is even better.)