tesseract

OCR with the Tesseract interface

How do you OCR an tiff file using Tesseract's interface in c#? Currently I only know how to do it using the executable. ...

OCR Web Service

I am searching for an OCR web service (eventually open source, preferably free) that simply receives an image and returns the text of the image in writing. I've looked at tesseract, OCRopus and GOCR but the only open server I could find is WeOCR. Unfortunately the detection rates (at least during my tests) are sub-par and the speed is n...

What is the ideal font for OCR?

Does anybody have any experience with different fonts for OCR? I am generating an ID then trying to scan it with tesseract. At the moment I am just T&E'n different fonts, but this seems pretty inefficient. I've tried the OCR* family of fonts, and various others such as Arial and Georgia. The tesseract tends to get confused with the OCR* ...

Invoking via command line versus JNI

I need to invoke tesseract OCR (its an open source library in C++ that does Optical Character Recognition) from a Java Application Server. Right now its easy enough to run the executable using Runtime.exec(). The basic logic would be Save image that is currently held in memory to file (a .tif) pass in the image file name to the tessera...

How do I enlarge a picture so that it is 300 DPI?

The accepted answer to the question C++ Library for image recognition: images containing words to string recommended that you: Upsize/Downsize your input image to 300 DPI. How would I do this... I was under the impression that DPI was for monitors, not image formats. ...

Is there any open source OCR project for the iPhone out there?

I'm looking for an open source or free OCR project for the iPhone. Maybe there is some port of tesseract-ocr for the iPhone? What I've found so far: How to compile Unix Apps for iPhone Thanks for any help. ...

How to give best chance of success to an OCR software?

I am using Tesseract OCR (via pytesser) and PIL (Python Image Library) for automated test of an application. I am checking that the displayed text is ok by making a screenshot and getting the text thanks to tesseract. I had some issues in the beginning and it seems to work better since I have increased the size of the screenshot thanks...

Using tesseract on android

Hello.. I am working on a android project that uses tesseract OCR engines..I have been searching on internet from past few days about any support for tesseract for android platform..But I didnt get any help...Please help me about how I do use tesseract for android platform...or how do I install it on android.. any kind of help is highl...

How can I get a libtiff TIFF object from a MagickWand object (in C)?

How can I get a libtiff TIFF object from a MagickWand object (in C)? I want to open any given image type with ImageMagick and run tesseract on it. Tesseract seems to use libtiff for it's IO, ImageMagick seems to use libtiff for it's tiff handling, so I figured I should somehow be able to use ImageMagick with tesseract without meddling i...

How can I install and launch tesseract-ocr using PHP

I am looking for a OCR component that converts images of text into characters using php. I got a script tesseract-ocr from google code. How can I install and launch tesseract-ocr through php ? As I am a beginner in PHP, I cant come up with the documentation they provided. I need some simple steps to install and launch ? thanks in ad...

Compiling tesseract-ocr on ARM/Gumstix?

Is it possible to compile tesseract-ocr for the Intel PXA270 found in certain Gumstix boards? Has anyone done this successfully, and if so, how did you do so? ...

.NET Application that uses Tessnet2 fails

Hi, I have an app that uses Tessnet2. The application fails when I try to call its Init method. The failure is just awful - no exception (ANY), no any trace of error, not even "Send report" window. The app's process is just droped. The problematic code is: Recogniser = new Tesseract(); Recogniser.SetVariable("tessedit_char_whitelist...

Python Tesseract OCR question

I have this image: I want to read it to a string using python, which I didn't think would be that hard. I came upon tesseract, and then a wrapper for python scripts using tesseract. So I started reading images, and it's done great until I tried to read this one. Am i going to have to train it to read that specific font? Any ideas on ...

Tesseract OCR in C#

Just wondering if anyone has got a sample project or compliled dll of the tesseract ocr engine running in C#? I have tried going through the tessnet2 demo (here) but for some reason, I can't install the C++ stuff in my current VS2008 installation so can't build it. Thanks! ...

Fraktur recognition with OCRopus/Tesseract on Linux

I am trying to perform recognition of a german text with fraktur typeface with ocropus but It doesn't seem to be using deu-f package. Here are the steps I performed. Compiled and installed tesseract and ocropus. Downloaded http://tesseract-ocr.googlecode.com/files/tesseract-2.01.deu-f.tar.gz, unpacked it to tessdata/. But when I cal...

Open source OCR for Chinese

I've searched around for open source OCR for Chinese. But without any luck there rarely seems to be some open source OCR (for Chinese) that are usable. So I am here wondering: Is there any open source OCR for Chinese that could be used for production environment? What's the main differences when implementing an OCR for Latin-languages...

Having a threading issue with tesseract/tessnet2, works with one thread but not two?

Howdy, I'm attempting to use OCR to automate some of the QA process we have in place at work. A QA worker gets one terminal open per document scanner, and as work orders are scanned, they appear in the terminal, and information from this terminal is entered into a different tool to be added to a database, then the QA worker signals they ...

Tesseract Example for Iphone

I have compiled tesseract for using it in xcode following: http://iphone.olipion.com/cross-compilation/tesseract-ocr I obtain the .a file, import it as framework in the iphone project, and update the header search path for tesseract header files. So now I need a simple example in order to get start. I can't find anything in http :// co...

Limit characters tesseract is looking for

Is it possible to limit the set of characters that tesseract is looking for (e.g. search only for letters a-z)? That would improve my results greatly. ...

Extracting code from photograph of T-shirt via OCR

I recently saw someone with a T-shirt with some Perl code on the back. I took a photograph of it and cropped out the code: Next I tried to extract the code from the image via OCR, so I installed Tesseract OCR and the Python bindings for it, pytesser. Pytesser only works on TIFF images, so I converted the image in Gimp and entered the...