ocr

Invoking via command line versus JNI

I need to invoke tesseract OCR (its an open source library in C++ that does Optical Character Recognition) from a Java Application Server. Right now its easy enough to run the executable using Runtime.exec(). The basic logic would be Save image that is currently held in memory to file (a .tif) pass in the image file name to the tessera...

Performing Optical Character Recognition on PDF's from Coldfusion using a Java or .NET Library?

I am looking to take a PDF and extract any text from it. I then want to make it available using Coldfusion's available Verity search to search the contents. Are there any libraries out there that do this quite well already? I am including Java or .NET (Java prefered) libraries in the scope since they can be called from CF. Any insigh...

Image Processing Library for C++

I need a library that can detect objects in an image (uses edge detection). This is NOT related to captchas. I am working on an MTGO bot that uses OCR and that works in any screen resolution. In order for it to port to any screen resolution my idea is to scan down narrow range on a results page (the cards that a player has can be listed ...

How to get text from the screen

There is some Win OS API call or so that would let one obtain text from the screen not via obtaining a snapshot and then doing OCR on it, but via API the idea is to get the text that is under the mouse that the user points to and clicks on. This is how tools like Babylon ( www.babylon.com ) and 1-Click Answers ( http://www.answers.com...

How to find blank field on scanned document image

I want my application to fill in a single field in a form that exists as an black-and-white image file. The form always starts as the same paper version, but by the time my application gets it from my users, it may have been scanned or faxed more than once. Because of that, the field I need is not in the same place in every file. My u...

Can a perceptron be used to detect hand-written digits?

Let's say I have a small bitmap which contains a single digit (0..9) in hand writing. Is it possible to detect the digit using a (two-layered) perceptron? Are there other possibilities to detect single digits from bitmaps besides using neural nets? ...

OCR: How to find the right ColorMatrix to define new colors?

Hi, I'm stuck right now with defining the dimension of each line. The list I want to scrape has various colors in it, and what disturbs me the most a selection: http://img18.imageshack.us/img18/210/lobbymd9.jpg As you can see the picture I try to analyze got a white background with green text. The selection background is grey with bl...

Using Lockbits to search for black pixels, but does this randomly?

Hi, I got a problem with Lockbits. I'm searching black pixels, and because it does not seem right, I'm using a WindowPopup to print the colors of every pixel, but it's like my program is using another picture. I marked the first found black pixel red. As you can see there are several other pixels that should match too. What am I doin...

OCR: How to compare images, sort unmatching out and do this fast?

Hi, I managed to have each character stored in a bitmap and am looking for a way to quickly determine which character it is. Therefore I'm about to store every possible character into an array of 1 and 0, and compare them to an array of the bitmap I just grabbed. I could do simple checks like compare how many black pixels I got, compa...

OCR image in .NET Compact Framework

Is there any library or code samples to convert an image to text in the .net compact framework? I have seen an app on a mobile phone that you can take a picutre of a business card and it will auto fill in a contact record so I know it is possible but I was wondering how to do it. I searched google and can't seem to find anything specifi...

Basic Pixel/Cell Counting Algorithm

Good night :) I am currently playing with the DevIL library that allows me to load in image and check RGB values per pixel. Just as a personal learning project, I'm trying to write a very basic OCR system for a couple of images I made myself in Photoshop. I am successfully able to remove all the distortions in the image and I'm left wi...

OCR in .NET

Hi everyone I need to use an OCR component within a .NET application that can recognise handwriting. We got the MODI component from MS office up and running, and it recognises printed text, but not handwriting. I don't mind paying. Does anyone have any recommendations? Many thanks Jim ...

Exact placement for XSL-FO

I have an item in an XML-FO page ( running through FOP ) when needs exact placement for an OCR. Whenever the page is edited (or an included page) its very difficult to get the OCR line back into place. Is it possible with FOP to specify exact placement. rather than having the item being pushed around by the previous items on the page? ...

OCR: limits of small character bitmaps and normalization

Hi, I want to use a neural network to determine which character is shown in a bitmap. I take these bitmaps from digital screenshots and I got them in b/w. I wonder, what dimensions these character bitmaps should have. Right now I'm using the top and bottom of the line, and the left+1 and right+1 bound of each charakter. I use these boun...

how to train a network using single layer perceptron for ocr?

I'm developing a program to recognize a character from a image using OCR techniques. Up till now I have used a method that scanned the image, but now I have to use neural networks. Please explain what is a single layer perceptron and how to use it to train the network. ...

OCR for sheet music

Im considering doing a small project as a part of my masters for doing ocr just for sheetmusic instead of text. I think PIL and Python would be fine for simple proof of concept O"notes"R. My question is: Has anyone got any "dont do it with PIL use xyz instead" or something in that alley? EDIT: My delicius links regarding the subject i...

Multi layer perceptron for OCR

Hi, I intend to use a multi layer perceptron network trained with backpropagation (one hidden layer, inputs served as 8x8 bit matrices containing the B/W pixels from the image). The following questions arise: which type of learning should I use: batch or on-line? how could I estimate the right number of nodes in the hidden layer? I in...

Image processing/enhancement algorithms for document OCR / readability?

I'm looking for algorithms, papers, or software to enhance faxes, images from cell phone cameras, and other similar source for readability and OCR. I'm mainly interested in simple enhancements (eg. things you could do using ImageMagick), but I'm also interested in more sophisticated techniques. I'm already talking to vendors, so for th...

OCR .NET Recommended

hi i have just tried a component for OCR in .NET and the results were pretty inacurrate. has anybody else been down this route before? can you please recommend a path for me to save me the time of evaluating lots of components that produce not satisfactory results. any recommendations much appreciated. i dont mind buying in or coding my...

FOSS Intelligent Character Recognition (ICR)

Does anyone know of an open source Intelligent Character Recognition (ICR) library? There seems to be plenty out there for OCR but I am focusing on recognizing handwriting and would love it if there was an open source solution out there. Thanks in advance, Jason ...