ansaurus

Question

What is the correct way to determine the type of a file returned by a web server?

Answer 1

+1 A:

Is the only accurate method to examine the contents of the file?

Its the method browsers use to determine the file type, but is by no means accurate. The fact that it isn't accurate is a security concern.

The only method available to the server to indicate the file type is via the Content-Type HTTP header. Unfortunately, in the past, not many servers set the correct value for this header. So browsers decided to play smart and tried to figure out the file type using their own proprietary algorithms.

The "guess work" done by browsers is called content-sniffing. The best resource to understand content-sniffing is the browser security handbook. Another great resource is this paper, whose suggestions have now been incorporated into Google Chrome and IE8.

How do I determine the correct file type?

If you are just dealing with a known/small list of servers, simply ask them to set the right content-type header and use it. But if you are dealing with websites in the wild that you have no control of, you will likely have to develop some kind of content-sniffing algorithm.

sri 2010-09-07 05:27:29

Answer 2

+1 A:

The browser knows it's JavaScript because it reached it via a <script src="..."> tag.

If you typed the URL to a .js file into your URL's address bar, then even if the server did return the correct Content-Type, your browser wouldn't treat the file as JavaScript to be executed. (Instead, you would probably either see the .js source code in your browser window, or be prompted to save it as a file, depending on your browser.)

Browsers don't do anything with JavaScript unless it's referenced by a <script> tag, plain and simple. No content-sniffing is required.

Joe White 2010-09-07 05:36:12

ansaurus

tags:

views:

answers:

What is the correct way to determine the type of a file returned by a web server?

related questions