[Note to the wise: jump to last EDIT]
I have a very simple txt sitemap (named sitemap.txt) that looks like this:
http://myDomain.com
http://myDomain.com/about.html
http://myDomain.com/faq.html
http://myDomain.com/careers.html
When I load it up on webmaster tools I get:
Sitemap is HTML - Your Sitemap appears to be an HTML page. Please use a supported sitemap format instead
I tried a few alternatives (such as with or without www) but no luck.
Anyone any clue?
Any help appreciated!
EDIT:
I tried with an xml sitemap and getting the same error so it looks like the server is serving everything as HTML (as ceejayoz correctly suggests). Now the question is ... how do I get the appspot server to server text as plain text?
EDIT:
Ok - I got fed up and implemented a servlet to serve my sitemaps (I am now trying with both XML and TXT) explicitly as text/plain. Everything works fine if I manually invoke the servlet but still getting Sitemap is HTML. I don't know where to bang my head!
EDIT: I tried to verify content-type with a firefox plugin - everything seems to be coming up as expected (I am putting the actual URL so that people can have a look):
http://wokheisandbox.appspot.com/sitemaps/sitemap.txt --> Content-type: text/plain http://wokheisandbox.appspot.com/sitemaps/sitemap.xml --> Content-type: application/xml
With my servlet (setting text/plain explicitly): http://wokheisandbox.appspot.com/wokhei/serveSitemap?fileType=TXT --> Content-type: text/plain http://wokheisandbox.appspot.com/wokhei/serveSitemap?fileType=XML --> Content-type: text/plain
All I get from webmaster tool still is -->Sitemap is HTML.
EDIT:
I think I found out the reason --> I registered on google webmaster tools my site as http://mydomain.com but the app is hosted on appspot at http://myapp.appspot.com which is mapped to mydomain.com. If I register http://myapp.appspot.com everything works fine (sitemap validated).
This is good news but it's not ideal because I want mydomain.com to be indexed ... any idea about how to overcome?