ansaurus

Question

In Yahoo-Pipes, how to use regex when you can't see non-printable characters and html tags?

Answer 1

A:

What I do is use a regex tester (whichever uses the same regex engine that you are using) and I test my pattern on it. I've tried using text editors that display invisible characters but to me they only add to the confusion.

So I just go by trial and error. For instance, if a line ends in:

</a>

Then I'll try the following patterns on the regex tester until I find one that works:

</a>.
</a>..
</a>\s
</a>\s*
</a>\n
</a>\r
</a>\r\n

Etc.

Pessimist 2010-02-17 15:31:40

The question is not how to test generic regular expressions, but how to see what is actually the source string in yahoo-pipes, since it just shows you the the printable string and not the html.

CptanPanic 2010-02-17 17:42:51

I understood the question. My answer still applies. When I want to find out what kinds of non-printable characters some text is using, I paste that text onto a regex tester and run the above regexes against it until I get a match.

Pessimist 2010-02-18 08:15:33

Answer 2

A:

If the content of the string is HTML then debugger gives you a choice of viewing "HTML" or "Source". Source should show you any HTML tags that are there.

However if your concern is white space, this may not be enough. Your only option is to "view source" on the original page.

The best course of action is to explicitly handle these possibilities in your regex. For example, if you think you might be getting white space in your target string, use the \s* pattern in the critical positions. That will match zero or more spaces, tabs, and new lines (you must also have the "s" option checked in the regex panel for new lines).

However, without specific examples of source text and the regex you are using - advice can only be generic.

Gavin Brock 2010-02-18 06:42:57

ansaurus

tags:

views:

answers:

In Yahoo-Pipes, how to use regex when you can't see non-printable characters and html tags?

related questions