DTP


 
Lively discussions on the graphic arts and publishing — in print or on the web


Go Back   Desktop Publishing Forum > General Discussions > Web Design

Reply
 
Thread Tools Display Modes
Old 07-31-2007, 09:19 AM   #1
dthomsen8
Member
 
dthomsen8's Avatar
 
Join Date: Aug 2005
Location: Philadelphia, PA 19130
Posts: 2,158
Default Legacy Pages Out There

Quote:
Originally Posted by CarlSeiler View Post
Does anyone here have any idea how many legacy pages are out there in general? How many HTML 2.0 and 3.2 pages are still out there on the web? Surely someone on the web has attempted to collect statistics.
Google has a big study on web page code, with a great deal of detail. I have not read much of it yet, but I will.

I found this study from here. This smaller study does have interesting statistics.

My own perception is that there is a great deal of HTML 4.01 out there which has never been upgraded. Also, many web sites do not specify a DOCTYPE. Of those that do, only a few validate with the www.w3c.org validation service. The only large web site that I have found that validates as XHTML is the W3C site itself. Doubtless there are others.

Last edited by dthomsen8; 07-31-2007 at 09:40 AM.
dthomsen8 is offline   Reply With Quote
Old 08-02-2007, 08:45 AM   #2
iamback
Member
 
iamback's Avatar
 
Join Date: Oct 2005
Location: Amsterdam, NL
Posts: 4,894
Default

Quote:
Originally Posted by dthomsen8 View Post
Google has a big study on web page code, with a great deal of detail. I have not read much of it yet, but I will.

I found this study from here. This smaller study does have interesting statistics.
Interesting stuff in both. Google drops a few balls (not surprising since apparently they've only recently become aware there are such things as standards!). The biggest blooper I found (but I didn't read all pages):
Quote:
...revisit-after, supposedly used to tell search engines how often to recrawl the page. To our knowledge only one search engine has ever supported it, and that search engine was never widely used — at this point, it is nothing more than a good luck charm. A remarkably widely used one. More pages use the completely worthless <meta name="revisit-after"> than use the <em> element!
Google seem to be unaware that there are such things as local search engines which would be perfectly happy to be told not to reconsider a page for a while! If your search engine indexer evaluates this directive, it's far from "completely worthless", and if public search engines ignore it, so what? It serves its function regardless.

Quote:
Originally Posted by dthomsen8 View Post
My own perception is that there is a great deal of HTML 4.01 out there which has never been upgraded. Also, many web sites do not specify a DOCTYPE.
My bet is that a lot of those that don't specify a DOCTYPE are actually (sort of) HTML 3.2 - from when that was effectively the only game in town - but no one cared much about standards.

Quote:
Originally Posted by dthomsen8 View Post
Of those that do, only a few validate with the www.w3c.org validation service.
And for those that don't specify a DOCTYPE, the validator assumes HTML 4.01 - so they likely won't validate anyway if it was actually HTML 3.2 (or something intended to be sort-of that)...

   
__________________
Marjolein Katsma
Look through my eyes on Cultural Surfaces (soon!), My ArtFlakes shop and Flickr.
Occasionally I am also connecting online dots... and sometimes you can follow me on Marjolein's Travel Blog
iamback is offline   Reply With Quote
Old 08-02-2007, 06:28 PM   #3
dthomsen8
Member
 
dthomsen8's Avatar
 
Join Date: Aug 2005
Location: Philadelphia, PA 19130
Posts: 2,158
Default

What do you mean by "local search engines"? Do you mean those for a limited geographical area, or those run by local ISPs, or what?
dthomsen8 is offline   Reply With Quote
Old 08-03-2007, 11:40 AM   #4
iamback
Member
 
iamback's Avatar
 
Join Date: Oct 2005
Location: Amsterdam, NL
Posts: 4,894
Default

Quote:
Originally Posted by dthomsen8 View Post
What do you mean by "local search engines"? Do you mean those for a limited geographical area, or those run by local ISPs, or what?
Those running on a site (server) itself! Or a server, for a collection of sites run on that server. Or within a company, for all that company's websites, internal or external. Etc.... In general: search engines operated by the owners of websites, usually (but not necessarily) running on the servers where their websites are hosted

ht://Dig (an OSS engine I once made some contributions to) is one example, but there are many more!

   
__________________
Marjolein Katsma
Look through my eyes on Cultural Surfaces (soon!), My ArtFlakes shop and Flickr.
Occasionally I am also connecting online dots... and sometimes you can follow me on Marjolein's Travel Blog
iamback is offline   Reply With Quote
Old 08-18-2007, 07:01 AM   #5
CarlSeiler
Member
 
CarlSeiler's Avatar
 
Join Date: Oct 2005
Location: Denton, TX
Posts: 271
Default

Quote:
Originally Posted by dthomsen8 View Post
My own perception is that there is a great deal of HTML 4.01 out there which has never been upgraded. Also, many web sites do not specify a DOCTYPE. Of those that do, only a few validate with the www.w3c.org validation service. The only large web site that I have found that validates as XHTML is the W3C site itself. Doubtless there are others.
IBM's web site has for a number of years validated. It is currently using XHTML 1.0 strict.
CarlSeiler is offline   Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
What's the easiest way to create web pages? marlene Web Design 9 07-04-2007 02:44 PM
Counters for your site's pages Linda Baldwin Web Site Building & Maintenance 7 01-08-2007 07:42 AM
Quark 6.5 - import pages bmann Print Design 1 10-30-2006 08:08 AM
No Good Home Pages? ElyseC Web Design 45 12-10-2005 05:55 AM
Master pages in PagePlus Michael Rowley Print Design 6 10-25-2005 12:28 PM


All times are GMT -8. The time now is 05:26 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2017, vBulletin Solutions, Inc.
Contents copyright 2004–2014 Desktop Publishing Forum and its members.