ocr

solution to OCR / search through 4 million pieces of paper and 10,000 added daily

I work for a medical lab company. They need to be able to search through all their client data. So far they have a few years in storage about 4 million pieces of paper, and they are adding 10,000 pages per day. For data that is 6 months old, they need to access it about 10-20 times per day. They are deciding whether to spend 80k on a sca...

Software to Improve OCR Results Based on Output from Multiple OCR Software Packages

Is there an already-existing piece of commercial or academic software that can overlay results from multiple OCR packages (Abbyy FineReader, Adobe Acrobat Professional, ReadIris, etc.) provide fully automated improvements based on accumulated knowledge from multiple sources allow for use of additional external tools setup at runtime (d...

Looking for Opensource OCR in Java

Am looking for an ocr library in java for scraping details from an image (IELTS Certificate image) http://pbrusilovskij.net/wp-content/uploads/2008/09/ielts-sprachzertifikatt.jpg Need to take out the details like Family Name,First Name etc from the image and put to database ...

OCR engines designed for screen-reading

Are there any OCR engines designed for identifying text in screen-captured images rather than scanned text? I have a project where I need to retrieve and identify text in an application, and none of the OCR engines I've tried so far have faired well with screenshots. Ideally the engine should work well with color and with background noi...

Best FREE OCR for JAVA

Is there any working OCR library for JAVA. It can call widows .dll - I don't mind it. I'm trying to overcome one easy captcha. Asprise OCR works sometimes, but mostly it misses one or two characters ...

OCR for mobile photos

Hi everybody, I'm thinking of developping a mobile OCR app to detect words from mobile pictures. The purpose if only to detect what words are in the picture, the layout is not important. Also it would be use on very short texts. I'm currently thinking of adapting tesseract to iphone and android. I wonder if anyone has had any related ...

How to perform: Upload Image > Recognize Text > Make Image Searchable > Store into DB?

Hey there, I need to know how to perform the procedure, you already have read in the title. You'll upload an image (e.g. a piece of text, an article) and on server-side the text will be recognized via OCR and stored into a database. Which would be the best programming language for it? It should be a browser application. I found the oc...

OCR recognition - gocr

Hi, I have a small problem while trying to do ocr using the tool gocr. It sometimes recognizes an o as zero and vice versa. To solve this, i tried to make it use a user specified database path. But doing that would require me to create a map for all possible characters. Is there any way in which i could tell gocr to just use the manual d...

extract text from image using leadtools

Has anyone done this before? I have leadtools on my machine but the examples they have provided is difficult to understand since i got not background from them, mfc and vb. All i want is to create a console application that accepts input and output paths. From the input paths, the tool will look for any tiff files and then load them in t...

What is best suited for license plate character recognition? Is it OCR or ANN?

Im a final year university student who is doing a license plate recognition system as my final year project. I want to know when recognizing the characters what suits the best, is it artificial nurel networks(ANN) or optical character recognition(OCR) using pattern matching? Or is there any easy method I can use? All the answered welcome...

TIFF IFilter not properly reading text in VARBINARY column.

link textI want to use the TIFF IFilter built in to Windows 2008 Server R2 with Full-Text search in SQL Server 2008... also R2. I have installed the filter through server manager and updated the "Force TIFF IFilter to perform OCR for every page in a TIFF document" Local Group Policy setting in Computer Configuration -> Administrative Te...

Looking for C++ OCR library

I am looking for an OCR library (commercial or free) that I can use of my C++ project. My project is mobile-based, so the library cannot depend on .Net or any framework like that. Can you recommend any? ...

OCR lib for math formulas

I need an open OCR library which is able to scan complex printed math formulas (for example some formulas which were generated via LaTeX). I want to get some LaTeX-like output (or just some AST-like data). Is there something like this already? Or are current OCR technics just able to parse line-oriented text? (Note that I also posted t...

OCR rotated image

Hi I have a couple of images including labels that have angles between 0 and 180 degrees. Any way to detect them using existing OCR tools or any algorithm for that? ...

From image to numbers

Hi everybody, I have some images that contain numbers written perfectly. These numbers can go from one to 4 characters. Is there a way to recognize and convert these numbers to text with PHP or Javascript? Thank you, Regards. ...

Are there any handwriting recognition libraries available for the iPhone?

I am looking for a simple (preferably open source) handwriting recognition library for the iPhone/iPad. In my case, I'm only interested in recognizing numbers. Does anyone know of a good library for this? ...

LEAD Tools Example

Can somebody show me an example of OCRing using LEADTools version 13 in C#? LEADTools doesn't provide documentations for .NET for version 13. What they have currently is version 16. Thanks. ...

Parsing a faxed form

Looking at a scenario where a form (consisting of, for simplicity sake, checkboxes only) is faxed to a fax server capable of OCR. Now, with typographic text, I've see various OCR implementations doing a decent job, but I'm not sure how it would handle checkboxes, especially handwritten "x" or checks, not to mention the coordinates. Bac...

OCR for known font

Hi, im searching for an OCR lib, that can be parameterized with a font, because I always know it and I believe the recognition results will be lots better this way. Does anyone know ? ...

OCR error correction: How to combine three erroneous results to reduce errors.

The problem I am trying to improve the result of an OCR process by combining the output from three different OCR systems (tesseract, cuneinform, ocrad). I already do image preprocessing (deskewing, despeckling, threholding and some more). I don't think that this part can be improved much more. Usually the text to recognize is between on...