ocr

.NET programmable graphics matching?

Does anyone know of a .NET programmable/usable API for reading an image file, and comparing it to an existing set of images? e.g. I have three pictures of the letters A, B, and C. I then copy the picture of A, and modify it so that it is flipped 180 degrees. I'd like to be able to have a piece of software that detects that it is a mat...

Any open source C# OCR library?

Hi, I couldn't get anything out of google, so I ask: Do you know some free open source C# OCR library? edit I need C# option, no coding of wrappers from C++ or similar stuff. the google library's website doesn't work - not sure if it ceased to exist or it's just unavailable which is very uncommon for Google websites. ...

What application does google use to show PDF attachments in gmail

I watched the traffic when google displays PDF attachments in gmail in a new window. The content is served as PNG images for each PDF page. And its text can be selected. What does google use on server side to generate a PNG file for a particular page in a pdf file? How does the selection of text on a png file work? Any ideas? ...

OCR: Convert edge into a vector path

Hello, I'm trying to build a simple OCR program as a project for an image processing class. So far I've been able to "read" an image containing just ones and zeros I differentiate between ones and zeros by first identifying edge contours, then counting the number of contours per character, such that 1=one contour, 0=two contours. ...

How to read text written on a image?

I need to parse some scanned documents to textual data. Is it possible to parse text written on a image using some software. If yes , please recommend any such online utility or software. ...

What are some popular OCR algorithms?

I've been interested in machine learning and computer vision for a while, so I've decided to attempt to build a simple Optical Character Recognition demo in C#. I'm looking for a description of some common OCR algorithms and how I would go about implementing them in C#. It's a learning exercise so I'm not looking for an OCR library. ...

OCR for .NET

Hello, I need an OCR component for .NET.I read all questions about it at SO,but I couldn't find what I needed. The MODI function from MS Office 2007 OCR makes the deployment of my program impossible,because all of my users must have MS Office 2007 on their computers. Is there a way to use the OCR from MS Office without the requirement...

Text detection / location libraries

I need to detect the bounding box(es) around portions of text in an image, and while there are quite a number of scholarly articles describing algorithms, I haven't found any implementations. The specific problem I'm trying to solve is this: Given an image that may or may not contain text, determine if the image does contain text, an...

high volume scanning and ocr automatic solution?

We need a high volume scanning and ocr solution we are talkin about digitalizing about 4000 documents a day, and saving them as pdf file with ocr (with hidden text)... the solution should let the operators scan a document and automatically save the files to a specific network resource, to be taken by an app that uploads it to a DB... ...

OCR (Optical Character Recognition) for on-screen text

I'm trying to create a piece of software that automate the PC by capturing the screenshot, then OCR (Optical Character Recognition) it looking for a particular button to click (for example). I've got the mouse and keyboard control part, but now, I needed an OCR to process the screenshot. What I discovered is that Tesseract OCR does not s...

Is there any open source OCR project for the iPhone out there?

I'm looking for an open source or free OCR project for the iPhone. Maybe there is some port of tesseract-ocr for the iPhone? What I've found so far: How to compile Unix Apps for iPhone Thanks for any help. ...

Java based OCR SDK/API

Are there any good OCR (optical character recognition) SDK or APIs in Java which will be able to convert TIFF files to txt files (or even html is good enough) with some sort of format retention? The challenge is to read a typical news magazine article and know that it has a header and certain number of paragraphs and pictures. I am Ok w...

How to recognize vehicle license / number plate (ANPR) from an image?

Hi all, I have a web site that allows users to upload images of cars and I would like to put a privacy filter in place to detect registration plates on the vehicle and blur them. The blurring is not a problem but is there a library or component (open source preferred) that will help with finding a licence within a photo? Caveats; I ...

OCR graph paper

I would like to take a pdf of a scanned graph paper notebook (with handwriting) and turn it into a text file. How can I do this? Thanks ...

Programmatically reading text from an image

hey guys My question is similar to this one 842986/how-to-read-text-written-on-a-image, except I'm looking for a way to do it programmatically. Essentially, does anyone know of a good, and simple, .NET compatible library, that is able to take in, say, a bitmap object or image object, and give me back any text on that image? cheers ...

What kind of OCR java library should i use in Android?

I would like to build an app that OCR a pictures and get text from them. What java library should i use? ...

WPF InkCanvas - how to determine if it has been "signed"

I'm using a WPF InkCanvas control to capture signatures in a Tablet PC application. One of my requirements is to validate whether or not the application has really been "signed". Right now I'm doing this by checking the Strokes collection of the InkCanvas - if there are 0 strokes, then I know the user has not "signed". However, if th...

Fast character detection

I don't want to know what it says, and it will not be dealing with any distortion like a CAPTCHA, I just want to know if a bunch of images contain any text. This is something that will be running on a couple of idle Linux servers, and a cron job will process a large batch of images multiple times a day. One of the things I want to do i...

optical character recognition of PDFs of parliamentary debates

Hi, For a contract work, I need to digitalize a lot of old, scanned-graphic-only plenary debate protocol PDFs from the Federal Parliament of Germany. The problem is that most of these files have a two-column format: I would love to read your answer to my following questions: How I can split the two columns before feeding them into...

android OCR?

I'm looking for a Java OCR that runs on Android, however Asprise doesn't seem to be a platform independent OCR. is there any opensource/free Java OCR I can use for android application development? ...