ansaurus

Question

C# WebClient only downloads partial html

Answer 1

A:

...Google's page doesn't have the closing tags for <body> and <html>. Talk about crazy optimization...

Matti Virkkunen 2010-04-13 09:39:47

Answer 2

A:

http://www.google.com/search doesn't have closing tags.

Marcelo Cantos 2010-04-13 09:40:46

Answer 3

+3 A:

Google's web pages are now in HTML 5, meaning the BODY and HTML tags can be self-closed - which is why Google omits them (believe it or not, it saves them bandwidth.)

See this article.

You can write HTML5 in either "HTML/SGML" mode (which allows the omitting of closing tags like HTML did prior to XHTML) or in "XHTML" which follows the rules of XML, requiring all tags to be closed.

Which the browser chooses to parse the page depends on whether you send a Content-type header of text/html for HTML/SGML syntax or application/xhtml+xml for XHTML syntax. (Source: http://stackoverflow.com/questions/1076897/html5-syntax-html-vs-xhtml)

Andy Shellam 2010-04-13 09:48:03

ansaurus

tags:

views:

answers:

C# WebClient only downloads partial html

related questions