I am seeing strange redirect behavior with URLs that have encoded characters. For example, the following two URLs differ ONLY by the case of the "e/E" in the first encoded character (i.e. "%e2" versus "%E2").
URL 1: http://youlookfab.com/welookfab/topic/your-favourite-80%e2%80%99s-music-bands
- "200 OK" HTTP status, page loads fine
URL 2: http://youlookfab.com/welookfab/topic/your-favourite-80%E2%80%99s-music-bands
causes a "302 Found" redirect
in a browser, the page redirects to the correct URL above (lowercase "e")
using web-sniffer.net, the content length is zero
I originally started looking into this because Google Webmaster Tools was showing crawl errors ("Redirect error", to be specific) on a bunch of pages that had URL encoded characters. Although my sitemap file specifies these characters in lowercase, GWT is showing them in uppercase.
I can't see any reason in .htaccess for lowercase URL encoded characters to redirect to uppercase. The site is based on bbPress, and I don't see any reason in the bbPress code for this to happen either.
Could mod_rewrite be doing something strange? I know there was a bug in the past where URL encoded characters were handled incorrectly.
Any insight you have would be much appreciated.
[This is an integrated bbPress/WPMU installation running LAMP, hosted on a MediaTemple (dv) server]