I'm trying to fetch some HTML from various blogs and I've noticed that different providers use the same tag in different ways.
For example, here are two major providers that use the Generator differently:
Blogger: <meta content='blogger' name='generator'/>
(content first, name later and, yes, single quotes!)
Wordpress: <meta name="generator" content="WordPress.com" />
(name first, content later)
Is there a way to extract the value of content for all cases? (single/double quotes, first/last in the row)
Thank you.
P.S. Although I'm using Java, the answer would probably help more people if it where for Regular Expressions generally