Howdy folks,
In this article I’m going show you how to convert an image to text using Microsoft Office Document Imaging Tool. This method is also known as OCR (Optical Character Recognition).
Suppose we have the following image, for image to text conversion (OCR).
Now, follow step-by-step procedure below to convert this image to text.
Step 1:
Download the image to your hard drive and open the file with MS Paint.
Step 2:
Press F12 and save the image as TIFF(*.tif; *.tiff) file.
Step 3:
Open Microsoft Office Document Imaging from Start menu -> All programs -> Microsoft Office -> Microsoft Office Tools and open the .tif file with it. You can drag and drop the file in it or press CTRL+O and select the .tif file.
Step 4:
Go to Tools and click on Recognize Text Using OCR. Now press CTRL+A to select all as text or you can select it as regular text using mouse.
Press CTRL+C or Right click -> copy to copy the text.
You can use this method for scanned documents. If the image is unclear, MS Office Document Imaging might not detect text from image properly. Its good for high resolution scanned documents. Hope OCR is useful for you. If you need any help, please comment below.
Thank you!
Really simple method – without any additional tool, I did not know that MS Office can extract text from an image. Thanks for sharing.
trying to convert tamil image document to text but not able to do so can you guide me please. Thanks and Regards, sambasivam
Hello Sambasivam,
Try this online OCR tool http://www.newocr.com/
Iam using Microsoft office document imaging. While copying the text from image and pasting the text is not rendered properly. It is displaying ascii characters while pasting. What should i do to overcome this? Thanks..
Its not possible if the image is not clear enough, sorry about that Keerthi.
i did the above steps.but i didn’t get exact result.what can i do?
It depends on the quality of the image, nothing to do further.
Can i edit the text on certificate ……
I am Trying to convert Gujarati image document to text but not able to do so can you guide me please.
Convert image to text using CMD Command Prompt ,Tesseract Optical Character Recoginition(OCR)
https://www.youtube.com/watch?v=Mjg4yyuqr5E
May i know which algorithm is used in microsoft OCR for extracting the text from image ?