How might I index PDF files using Lucene.Net?

views:

1316

answers:

How might I index PDF files using Lucene.Net?

I'm looking for some sample code demonstrating how to index PDF documents using Lucene.Net and C#. Google turned up a few, but none that I could find helpful.

Well I found it easy enough. Did you Google for it?

http://www.google.com.br/search?q=index+pdf+lucene

Cleiton 2009-08-14 02:33:52

I did it, but couldnt find a good one

ldsenow 2009-08-14 03:04:14

Funnily enough this question comes up in Google when c# is added. It would be good to get some actual links in this question to help others finding this page in future.

John_ 2009-11-15 23:04:44

+2 A:

From my understanding, Lucene is limited to creating an index and searching that index. It's up to the application to handle opening files and extracting their contents for the index. So if you're looking to search PDF documents you'll want to use something like iTextSharp to open the file, pull out the contents, and pass it to Lucene for indexing. There are some good starting examples of using Lucene on the Dimecasts.net website.

JC Grubbs 2010-01-08 17:10:08

ansaurus

tags:

views:

answers:

How might I index PDF files using Lucene.Net?

related questions