Laserfiche Does Not Generate Text When you Choose to Extract Text from Certain PDF files.

August 7, 2014 | KB: 1011825
Laserfiche 7.2, Laserfiche 7.2.1, Web Access 7.2, Web Access 7.2.1

Summary

When you attempt to extract text from certain PDF files, Laserfiche will not create any text.

Cause

Using the PDF IFilter to extract text requires that the selected PDF file contains embedded text. If the PDF was created as an image-only PDF with no embedded text, there will not be any text for the IFilter to extract.

One way to check whether a PDF file contains embedded text is to open the file in Adobe Reader and attempt to use the selection tool. If you are unable to select the text, then the file does not contain any embedded text.

Workaround

You can use Laserfiche Snapshot to generate TIFF images for that PDF file and then OCR the resulting images.