PDF to Text OCR with Google: Difference between revisions

From Free Knowledge Base- The DUCK Project
Jump to navigation Jump to search
New page: Convert Scanned PDFs to Text Now if you have bunch of scanned PDF files on your hard drive and no OCR software, here’s what you can do to convert them into recognizable text. Create a ...
(No difference)

Revision as of 16:47, 4 May 2010

Convert Scanned PDFs to Text

Now if you have bunch of scanned PDF files on your hard drive and no OCR software, here’s what you can do to convert them into recognizable text.

Create a folder in your website (say abc.com/pdf) and upload all the PDF images to that folder. Now create a public web page that links to all the PDF files. Wait for the Google bots to spider your stuff.

Once done, type the query "site:abc.com/pdf filetype:pdf" to see the PDF documents as HTML.