views:

1004

answers:

4

Does window.location.hash contain the encoded or decoded representation of the url part?

When I open the same url (http://localhost/something/#%C3%BC where %C3%BCtranslates to ü) in Firefox 3.5 and Internet Explorer 8, I get different values for document.location.hash:

  • IE8: #%C3%BC
  • FF3.5:

Is there a way to get one variant in both browsers?

A: 

Actually in my version of Firefox (3.5 on Linux), if I type "#%C3%BC" as a hash in the URL, the URL itself actually transforms to unicode with "#ü". But you have appeared to answered your own question -- in Firefox, the browser transforms entity escape codes in the URL, while in IE, it does not.

My advice is actually this: Instead of putting "#%C3%BC" in the URL at all, just use full unicode in your hashes and URLs. Is that an option? It should work fine in any modern browser.

Ken
No, it's not :(. Your Firefox (and mine too) is just pretending to use an `ü` character. In HTTP it always uses the percent-encoding. Move your mouse over that link: http://test/%C3%BC. The Firefox status bar shows an `ü` for some reason. But if you use an HTTP sniffer, you will find out, that it's submitting `%C3%BC`.And basically, because I'm using that one in a HTTP redirect, I can not directly use unicode characters anyway.
Michael
Are you sure that doesn't depend on the encoding being ASCII v a unicode encoding?
Ken
AFAIK there is no way to transfer unicode characters in HTTP without special preparation like the percent-encoding (because HTTP does not allow characters outside the ASCII range).
Michael
A: 

You can use decodeURIComponent, it will return in all cases:

decodeURIComponent('#%C3%BC'); // #ü
decodeURIComponent('#ü'); // #ü

Try it out here.

CMS
Not a solution because:`decodeURIComponent('%2540'); // %40 (IE)`but`decodeURIComponent('%40'); // @ (FF)`
Michael
Not really sure about what you mean, %2540 is the `%` character encoded (`%25`) and the non encoded `40` string, `decodeURIComponent('%40');` is @ in IE or Firefox... http://jsbin.com/esafe
CMS
Let's assume I wanted to use the hash for a search function and someone wants to search for `%40` (but not for `@`). Depending on his browser, I will get `#%2540` (IE) or `#%40` (FF) as `location.hash`. If I decode it then, I get different results in the different browsers.
Michael
+1  A: 

Answering to my own question, my current solution is to parse window.location.href instead of using window.location.hash, because the former is always (i.e. in every browser) url-encoded. Therefore the decodeURIComponent function CMS proposed can always be used safely. YUI does the same, therefore it can't be that wrong...

Michael
+4  A: 

Unfortunately, this is a bug in Firefox as it decodes location.hash an extra time when it is accessed. For example, try this in Firefox:

location.hash = "#%30";
location.hash === "#0"; // This is wrong, it should be "#%30"

The only cross-browser solution is to just use (location.href.split("#")[1] || "") instead for getting the hash. Setting the hash using location.hash seems to work correctly for all browsers that support location.hash though.

Eli Grey
Yep, that seems to be the most reasonable solution.
Michael