DTP


 
Lively discussions on the graphic arts and publishing — in print or on the web


Go Back   Desktop Publishing Forum > General Discussions > Web Site Building & Maintenance

Reply
 
Thread Tools Display Modes
Old 02-09-2007, 05:13 AM   #1
dthomsen8
Member
 
dthomsen8's Avatar
 
Join Date: Aug 2005
Location: Philadelphia, PA 19130
Posts: 2,158
Default Google Does PDF to HTML Automatically

While surfing the web doing research for a client, I found a page that Google offered as an Adobe Acrobat PDF file, but also as HTML. There was a notice by Google that they automatically convert PDF files to HTML as they crawl the web.

This is a very interesting statement, but other than stumbling on the HTML while surfing from Google, how would a user find the HTML version of a PDF?
dthomsen8 is offline   Reply With Quote
Old 02-09-2007, 06:05 AM   #2
ktinkel
Founding Sysop
 
ktinkel's Avatar
 
Join Date: Oct 2004
Location: In Connecticut, on the Housatonic River near its mouth at Long Island Sound.
Posts: 11,189
Default

Quote:
Originally Posted by dthomsen8 View Post
While surfing the web doing research for a client, I found a page that Google offered as an Adobe Acrobat PDF file, but also as HTML. There was a notice by Google that they automatically convert PDF files to HTML as they crawl the web.

This is a very interesting statement, but other than stumbling on the HTML while surfing from Google, how would a user find the HTML version of a PDF?
Dunno, but why would they want to? It loses all the formating — which is one of the main strengths of PDF — and the Google HTML-for-PDF pages are very weird-looking.

   
__________________
[SIZE=2][COLOR=LemonChiffon]::[/COLOR][/SIZE]
[SIGPIC][/SIGPIC]
ktinkel is offline   Reply With Quote
Old 02-09-2007, 07:15 AM   #3
LoisWakeman
Staff
 
LoisWakeman's Avatar
 
Join Date: Jan 2005
Location: Uplyme, Devon, England
Posts: 1,402
Default

The same way as you did , I guess! They are pretty awful to read and can be very slow - but at least the keywords hidden inside them are exposed to the world, and his dog who doesn't have the Acrobat reader installed.
LoisWakeman is offline   Reply With Quote
Old 02-09-2007, 07:56 AM   #4
terrie
Staff
 
Join Date: Oct 2004
Posts: 8,944
Default

Quote:
kt: Dunno, but why would they want to?
Sometimes for very large pdf's, the html can load quicker and if you just need to see if the pdf has the info you need, then checking out the html is easier...if it ends up that it has the info you need, then you can download/save the pdf to peruse at your leisure...

Terrie
terrie is offline   Reply With Quote
Old 02-09-2007, 08:57 AM   #5
Michael Rowley
Member
 
Join Date: Jan 2005
Location: Ipswich (the one in England)
Posts: 5,105
Default

Terrie:

Quote:
Sometimes for very large pdf's, the html can load quicker
Yes, the 'view as html' is often very useful, especially as waiting for the PDF reader to load competes with downloading the PDF—competing, that is, to see which is the slower.

   
__________________
Michael
Michael Rowley is offline   Reply With Quote
Old 02-09-2007, 09:39 AM   #6
ktinkel
Founding Sysop
 
ktinkel's Avatar
 
Join Date: Oct 2004
Location: In Connecticut, on the Housatonic River near its mouth at Long Island Sound.
Posts: 11,189
Default

Quote:
Originally Posted by terrie View Post
Sometimes for very large pdf's, the html can load quicker and if you just need to see if the pdf has the info you need, then checking out the html is easier...if it ends up that it has the info you need, then you can download/save the pdf to peruse at your leisure...
Oh, yes — I often use the translation myself just to see if there is anything useful. Otherwise I end up with many obliquely titled PDFs of unknown usefulness in my files.

That’s how I know how icky it looks!

   
__________________
[SIZE=2][COLOR=LemonChiffon]::[/COLOR][/SIZE]
[SIGPIC][/SIGPIC]
ktinkel is offline   Reply With Quote
Old 02-09-2007, 09:57 AM   #7
terrie
Staff
 
Join Date: Oct 2004
Posts: 8,944
Default

Quote:
kt: That’s how I know how icky it looks!
LOL!!! True...'-}}

Terrie
terrie is offline   Reply With Quote
Old 02-09-2007, 04:17 PM   #8
Steve Rindsberg
Staff
 
Join Date: Nov 2004
Posts: 6,742
Default

Every time Google shows me a PDF amid the results, it also has a "View as HTML" link.

   
__________________
Steve Rindsberg
====================
www.pptfaq.com
www.pptools.com
and stuff
Steve Rindsberg is offline   Reply With Quote
Old 02-12-2007, 06:30 AM   #9
dthomsen8
Member
 
dthomsen8's Avatar
 
Join Date: Aug 2005
Location: Philadelphia, PA 19130
Posts: 2,158
Default PDF and View as HTML

Quote:
Originally Posted by Steve Rindsberg View Post
Every time Google shows me a PDF amid the results, it also has a "View as HTML" link.
Quite right, Google does it that way. The trick is to figure out how to get Google to return the desired PDF in the results of a search. My objective is to be able to convert from an online PDF to a MS Word document, via HTML, and for free.
dthomsen8 is offline   Reply With Quote
Old 02-12-2007, 04:47 PM   #10
Steve Rindsberg
Staff
 
Join Date: Nov 2004
Posts: 6,742
Default

Do you want it to look anything like the original PDF? If so, it's not very probable.

   
__________________
Steve Rindsberg
====================
www.pptfaq.com
www.pptools.com
and stuff
Steve Rindsberg is offline   Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
How insert images automatically ktinkel Web Design 12 09-25-2006 07:32 AM
XLS to HTML dthomsen8 Web Site Building & Maintenance 3 08-07-2006 04:53 PM
MS Excel--->HTML Linda Baldwin Web Design 12 07-12-2006 01:56 AM
If you like HTML even a little... iamback Web Design 0 06-09-2006 01:15 AM
Advising me of all the posts automatically PeterArnel How to Use the Forum 1 04-30-2005 03:51 PM


All times are GMT -8. The time now is 11:09 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2017, vBulletin Solutions, Inc.
Contents copyright 2004–2014 Desktop Publishing Forum and its members.