There are many ways to use the OCR (Optical Character Recognition) tool within PhantomPDF but the most common ways include OCR-ing a scanned document or image.
To perform the OCR feature on a scanned document:
Step 1: Go to File > Create > From Scanner > Select Your Scanner > Make Searchable (run OCR)> Scan.
Please note: ***This option appears greyed out because my scanner is not connected to my personal laptop. Once you’ve connected, your scanner, these options will be available to select.
Step 2: Because Make Searchable (run OCR) was selected, you can now search your scanned PDF for keywords using Ctrl + F.
Alternatively, you can perform this same function by going to Convert > From Scanner within PhantomPDF. Here you will be presented with the same window as above and follow the same steps to run the OCR function on your document.
Another common way to use the OCR function within PhantomPDF is on images.
Step 1: Drag and drop a image file (for e.g., .jpg, .tiff, .png, etc ) onto PhantomPDF while opened. PhantomPDF will convert this image file into a PDF document.
Please note**: It is not necessary to open the image file that you would like to convert to PDF. All that is necessary is to go to the folder in which the image file is stored and then drag it over to PhantomPDF.
Step 2: PhantomPDF will then advise “Some pages may contain unrecognized text. You can run text recognition to make it searchable or editable”.
Step 3: Select Recognize Text (circled above) and then you will be presented with this window:
Here you can indicate whether you’d like the OCR engine to run on the Current page/All Pages or a Page range as well as the language you’d like the OCR engine to support and whether you’d like to be able to search the text image or edit the text within the image. After you’ve made your choices, select OK.
By checking Find All suspects tool this enables you to manually go through each text within the image (that has been converted to PDF) to identify whether the characters highlighted is “Not text” or “Accept” . If you'd prefer, PhantomPDF to recognize all text within the image without manually reviewing, leave the Find All Suspects tool unchecked.