Google Converting PDF to text     StumbleUpon Toolbar

OCR terminal
Converting your scanned PDF file into clean text format using Google crawler is possible. But the thre is not privacy to prevent them to be displayed in search engine results. Google crawls all find file like .doc, .txt , .pdf and more. Only if you havn’t set an restriction over the Google crawler. Many a times I found salary slips of employees get crawled by Google crawlers and displayed on Search engine results. But for those who wanted to convert the PDf files into text without any cost, is an excellent idea.

Though the crawling is not real time, just upload to a public domain and wait till the crawler comes crawl all your files. Google have an OCR terminal software installed on their server to extract all the PDF scanned information into text forI don’t know but Google must be crawling million of PDF files a day (inluding updates). Once all the file are being crawled. How you will find out your PDF files into text. Just go to Google search and type:

site:http://youdomainpath/username .pdf

Where you uploaded your file.

Note: The path should be crawled by Google. To get it crawled go to http://www.google.com/addurl

It will list all the PDF file under your host account. :)

If you want the real time online free PDF file convertion. I prefer to go for http://ocrterminal.com/ an excellent OCR service to get real time convertion.

StumbleUpon It!

Leave a Reply

You can use these XHTML tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

Triimart | Blogpico | Dekut | Directory | Free SMS