I am wondering is there a way to get scanned image with text to be converted to readable text by writing code for it? Is that possible?
+2
A:
OCRTools is what I use for .net
For Java, i've used Aspire in the past, it's very good, a little scary though. I've heard a lot about tesseract, you might as well check that out.
If you are getting confused by the answer: What you are looking for is an Optical Character Recognition software's API/SDK. What you've asked in the question points to building an OCR from scratch. That requires understanding Image Processing (mainly object recognition).
lalli
2010-10-11 03:48:01
Curious about what's a little scary about Aspire. Are you able to share?
Kevin Day
2010-10-11 06:37:25
Actually I had Tiff images from a batch scan. Reading them itself was very confusing, I had to convert them using another library (ImageMagick or something) and then had to, by hit and trial, convert small parts of the image and remove the noise and all.... But that was version 1.something, now it's version 4...
lalli
2010-10-12 03:41:37