Reading PDF content with itextsharp dll in VB.NET or C# | ansaurus

tags:

views:

1693

answers:

2

+2 Q:

Reading PDF content with itextsharp dll in VB.NET or C#

How can I read PDF content with the itextsharp with the Pdfreader class. My PDF may include Plain text or Images of the text.

+1 A:

Hi user221185,

check these links

http://www.dotnetspider.com/forum/156957-read-pdf-content-vb-net.aspx

http://jadn.co.uk/w/ReadPdfUsingCsharp.htm

http://forums.asp.net/p/1408202/3097463.aspx#3097463

below link contain tutorials of itextsharp.

http://itextsharp.sourceforge.net/tutorial/ch01.html

If you got solution from my answer then click my answer and vote me.thanx

Emaad Ali 2010-03-31 06:13:14

that's not really an answer, more of a "here is some info, work it out yourself"

Gordon Carpenter-Thompson 2010-06-28 15:32:40

+4 A:

You can't read and parse the contents of a PDF using iTextSharp like you'd like to.

From iTextSharp's SourceForge tutorial:

You can't 'parse' an existing PDF file using iText, you can only 'read' it page per page.

What does this mean?

The pdf format is just a canvas where text and graphics are placed without any structure information. As such there aren't any 'iText-objects' in a PDF file. In each page there will probably be a number of 'Strings', but you can't reconstruct a phrase or a paragraph using these strings. There are probably a number of lines drawn, but you can't retrieve a Table-object based on these lines. In short: parsing the content of a PDF-file is NOT POSSIBLE with iText. Post your question on the newsgroup news://comp.text.pdf and maybe you will get some answers from people that have built tools that can parse PDF and extract some of its contents, but don't expect tools that will perform a bullet-proof conversion to structured text.

Jay Riggs 2010-03-31 15:27:27

related questions

Displaying Flash content in a C# WinForms application

How to get the value of built, encoded ViewState?

Unhandled Exception Handler in .NET 1.1

How do I connect to a database and loop over a recordset in C#?

How do I most elegantly express left join with aggregate SQL as LINQ query

Get a new object instance from a Type in C#

.NET Testing Framework Advice

Automatically update version number

What is the difference between an int and an Integer in Java/C#?

How to write to Web.Config in Medium Trust ?

WinForms ComboBox data binding gotcha

How do you sort a C# dictionary by value?

Adding Scripting functionality to .NET applications

Floating Point Number parsing: Is there a Catch All algorithm?

How do I print an HTML document from a web service?

Decoding T-SQL CAST in C#/VB.net

Anatomy of a "Memory Leak"

How do I get a distinct, ordered list of names from a DataTable using Linq

Reliable Timer in a Console Application

How do I fill a DataSet or a DataTable from a LINQ query resultset ?

What's the difference between Math.Floor() and Math.Truncate() in .NET?

How do I calculate relative time?

How do I calculate someone's age in C#?

Are there any conversion tools for porting Visual J# code to C#?

When setting a form's opacity should I use a decimal or double?