ocr

.NET OCRing an Image

I'm trying to use MODI to OCR a window's program. It works fine for screenshots I grab programmatically using win32 interop like this: public string SaveScreenShotToFile() { RECT rc; GetWindowRect(_hWnd, out rc); int width = rc.right - rc.left; int height = rc.bottom - rc.top; Bitmap bmp = new Bitmap(width, height)...

free OCR library

can any one suggest a free ocr library dot net or java for me. I needto convert images containing text to word documents. The input images will be of tyy .tiff,.gif,.bmp.jpg. ...

Image processing / super light OCR

I have 55 000 image files (in both JPG and TIFF format) which are pictures from a book. The structure of each page is this: some text --- (horizontal line) --- a number some text --- (horizontal line) --- another number some text There can be from zero to 4 horizontal lines on any given page. I need ...

How to Recognize image in iphone sdk?

hi, i wanna ask that , is there any api to recognize image? i need read data from image...can i read numbers which is printed on image.. ...

OCR Playing Cards

I decided to do a project for fun where I want to take as input the image of a playing card and return its rank and suit. I figure that I only need look at the upper-left corner, since that has all the information. It should be robust - if I have a large image of an Ace of Diamonds, I should be able to scale it anywhere from 20 to 200% a...

Simple OCR programming tutorials/articles

I'm interested in simple OCR methods and algorithms. And with simple I mean simple! Best would be a tutorial/article/documentation without dependencies on 3rd party librarys if that's even possible. I would really like to build up my knowledge from the ground up. The programming language doesn't matter. Thanks in advance! Edit: An ...

OCR'ing a graph, table, or chart for 508 compliance.

I'm using PRIME RECOGNITION's OCR program and I was wondering if there was a program out there than can "make sense" of graphs, charts, and forms. I need them to have read-order and text-searchable. So instead of using lots of figure and text zones to describe them, is there a way to enter in the data and have a program recreate it in a ...

How to give best chance of success to an OCR software?

I am using Tesseract OCR (via pytesser) and PIL (Python Image Library) for automated test of an application. I am checking that the displayed text is ok by making a screenshot and getting the text thanks to tesseract. I had some issues in the beginning and it seems to work better since I have increased the size of the screenshot thanks...

How do I use MODI in an ASP.Net Web Application?

I've written an OCR wrapper library around the Microsoft Office Document Imaging COM API, and in a Console App running locally, it works flawlessly, with every test. Sadly, things start going badly when we attempt to integrate it with a WCF service running as an ASP.Net Web Application, under IIS6. We had issues around trying to free u...

How to compute similarity of images with small color differences for font-recognition?

I'm trying to do some simple and fast image comparison of similarity. All of the images are of the same size. The color differences are quite delicate. Currently, I calculate a difference measure by calculating the sum of difference in the R,G and B values between the images, sort of a Hamming distance. That is, Abs(P1.R - P2.R) + Ab...

OCR - most "different" or "recognizable" ASCII characters?

I am looking for a way to determine the most "different" or "recognizable" N ASCII characters... For example, if N = 10, what would be the most different N characters in the ASCII set from 0x21 to 0x7E? Obviously, the character "X" is very different than "O" (the letter), but "O" (the letter) is very similar to "0" (zero). Assuming a ...

java read text from image file

what is the best open source java library that i can use to extract text from image (jpg,png) files? ...

Best OCR for extracting text from a plot?

Does anyone know of a good ocr that is able to convert this image into text? I tried tesseract but it didn't work out the way I expected. ...

How to embed external OCR into existing PDF?

I have a set of images over which I run an OCR application. This process results in a XML file with character offsets. Then I convert the images to PDF using Acrobat 9. Now, I would like to add the XML file information as an invisible text layer into the PDF in order to achieve a searchable PDF. Is there an easy and free way? Some detai...

How to know if a PDF contains only images or has been OCR scanned for searching?

I have a bunch of PDF files that came from scanned documents. The files contain a mix of images and text. Some were scanned as images with no OCR, so each PDF page is one large image, even where the whole page is entirely text. Others were scanned with OCR and contain images and searchable text where text is present. In many cases ev...

Scanned PDF(Please Help Me)

Dear Friends, I have a Scanned PDF Files , i need a VB.net Source Code which convert that scanned PDF to text Format. Plz Help me... Thanx in Advance ...

Screen scraper application (not HTML)

Hello. I need screen scraper application which will recognize text from the screen (and not use winapi to do this so source could be in image file). I found a lot of commercial solutions, but I need something open source or free. I plan to include it in my C# project, so there should be some SDK available. Thanks. ...

C# .net OCR library/components

What are the best paid for OCR libraries for .net c#? We are looking for something reasonable that is backed with good support. ...

Any free OCR Library for C#?

Is there any free OCR Library for C#? Thanks. ...

Class not registered Exception - Microsoft Office 2003 OCR with MODI-C# Windows Application

I am trying to load TIFF Images in my C# Windows Application. for this to work I have added reference of MDIVWCTL- MODI. I also have MicroSoft Office 2003 Installed on my machine. What could be the possible cause for this exception. ...