views:

290

answers:

1

I have an apparently simple regex query for pipes - I need to truncate each item from it's (<img>) tag onwards. I thought a loop with string regex of <img[.]* replaced by blank field would have taken care of it but to no avail.

Obviously I'm missing something basic here - can someone point it out?

The item as it stands goes along something like this:

sample text title 
<a rel="nofollow" target="_blank" href="http://example.com"&gt;&lt;img border="0" src="http://example.com/image.png" alt="Yes" width="20" height="23"/></a> 
<a.... (a bunch of irrelevant hyperlinks I don't need)...

Essentially I only want the title text and hyperlink that's why I'm chopping the rest off

Going one better because all I'm really doing here is making the item string more manageable by cutting it down before further manipulation - anyone know if it's possible to extract a href from a certain link in the page (in this case the 1st one) using Regex in Yahoo Pipes? I've seen the regex answer to this SO q but I'm not sure how to use it to map a url to an item attribute in a Pipes module?

A: 

You need to remove the line returns with a RegEx Pipe and replace the pattern [\r\n] with null text on the content or description field to make it a single line of text, then you can use the .* wildcard which will run to the end of the line.

http://www.yemkay.com/2008/06/30/common-problems-faced-in-yahoo-pipes/