ansaurus

Question

Two charset tags on a page, which to take?

Answer 1

A:

The behavior of this is undefined by the HTML spec. You can't have two seperate content-type tags in the same document. Since presumably you'd have to parse this document anyway, your best bet is to make an educated guess about the developers intent.

Ryan Brunner 2009-08-05 14:53:31

Answer 2

+3 A:

I would do it heuristically:

Is everything actually ASCII? If so, it doesn't matter which you use.
Does it conform to valid UTF-8? If so, I'd use that.
Otherwise, use ISO-8859-1.

You might want to look at the content-type header coming back from the web server, too...

Fundamentally the page is broken, but the above should give a reasonable "best guess."

Jon Skeet 2009-08-05 14:53:36

ansaurus

tags:

views:

answers:

Two charset tags on a page, which to take?

related questions