Docs let you extract the text from images with Optical Character Recognition (OCR) and computer algorithms automatically. Images can be processed in multiple groups using the PDF format. Images can be scanned using a flatbed scanner or captured using a digital camera or cell phones.
For the best extraction of images or PDF the following are needed:
Resolution: needs to be at least ten pixels high for each line of text; “high-resolution files work best.”
Orientation: only recognizes horizontal (left to right) text. If you mistakenly use the wrong image you can manipulation the programs to rotate before uploading.
Languages, fonts, and character sets: in OCR engine supports only Latin character right now. The fonts we commonly use such as Arial and Times New Roman can produce, but they do not produce very good results.
Image quality: will work better with even lighting, sharp images and clear contrast. The maximum image size is 2 MB. “For PDF files, we only look at the first ten pages when searching for text to extract.”
Found this to be most useful. Thanks for sharing and great job on your blog! It looks great.
ReplyDelete