ansaurus

Question

Pattern matching text in the body of a PDF and adding hyperlinks with PHP

Answer 1

A:

<?
$s="
http://something.com?code=3000 asdf text
http://something.com?code=5000 asdf
";
echo preg_replace('/(http:\/\/something\.com\?code=(\d+))/s', '<a href="$1">$2</a>',$s);
?>

output 3000 asdf text

5000 asdf

JapanPro 2010-09-01 15:23:19

This is a truly irrelevant answer.

hristo 2010-09-05 13:44:18

Sorry, but regex won't work with PDF content streams.

Dwight Kelly 2010-10-18 13:41:37

Answer 2

A:

I'm pretty sure that you are saying you have ten digit numbers throughout your input text, and you want all ten digit numbers converted to links. Japan's answer does not do that - it converts URLs to links.

This should work for converting numbers:

<?
$s="some text with 1234567890 and then more text 
and then 1234512345 and then 
more text";
echo preg_replace('/(\d{10})/s', '<a href="http://something.com?code=$1"&gt;$1&lt;/a&gt;',$s);
?>

Output:

some text with <a href="http://something.com?code=1234567890"&gt;1234567890&lt;/a&gt; and then more text 
and then <a href="http://something.com?code=1234512345"&gt;1234512345&lt;/a&gt; and then 
more text

JGB146 2010-09-01 23:52:47

Guys I know regex for gods sake.

hristo 2010-09-05 13:44:41

Answer 3

+1 A:

Replacing text in a PDF is difficult and none of the open source PDF solutions support this capability.

Apago (www.apago.com) has a developed commercial solution for replacing text in PDF files. It's used by greeting card manufacturer to modify pricing, "MADE IN" text, product numbers, etc.

Dwight Kelly 2010-09-13 15:36:27

Ok - but what about finding the text and getting its bounding box, in order to draw an active transparent rectangle directly above it?

hristo 2010-09-14 14:26:21

Xpdf could be used to calculate the bbox of text. See the example TextOutput output device class. If you need something ready-to-go, contact [email protected] for more information about the tool I mentioned above.

Dwight Kelly 2010-10-18 13:40:33

ansaurus

tags:

views:

answers:

Pattern matching text in the body of a PDF and adding hyperlinks with PHP

related questions