OmniPage OCR Does Not Correctly Process Multi-Byte Western Languages.

August 7, 2014 | KB: 1011841
Laserfiche 7.2.1, Quick Fields 7.1.2, Import Agent 7.0.2, Snapshot 7.0.3, Scanning 7.2.1, Web Access 7.2.1

Summary

When you attempt to OCR a document in a multi-byte western language, the OCR process will correctly retrieve Latin characters. However, the OCR process will not correctly retrieve non-Latin characters (e.g. Cyrillic characters).

Resolution

There is a hotfix available for Laserfiche products that include the OmniPage OCR engine. The fix includes updated versions of the following files:

  • BPOmniOCR.exe (version 7.2.1.3)
  • OmniOCRWrapper.dll (version 7.2.1.3)
  • OmniPage32.lfo (version 1.0.0.3)

To update the OmniPage OCR components

  1. Close any applications that may be using the OmniPage OCR engine.
  2. Click the following link to download a zip file containing the hotfix files.
    Hotfix_SCR29919.zip
  3. Replace your existing versions of BPOmniOCR.exe and OmniOCRWrapper.dll with the updated versions contained in the zip file. The files are located at "C:\Program Files\Common Files\Laserfiche\Batch Processor\BPOmniOCR."
  4. Register the new version of OmniOCRWrapper.dll.
    1. Click Start and then click Run.
    2. Type the following and then click OK:

      regsvr32 "C:\Program Files\Common Files\Laserfiche\Batch Processor\BPOmniOCR\OmniOCRWrapper.dll"

  5. Replace your existing version of OmniPage32.lfo located in your Laserfiche Client installation folder with the updated version. By default, the Laserfiche Client is installed at "C:\Program Files\Laserfiche\Client."

Related Links

BPOmniOCR.exe (version 7.2.1.3) also contains additional hotfixes not related to multi-byte western language support. Please see the following Knowledge Base articles for more information.

1011725 OmniPage OCR Does Not Correctly Use the Image Orientation Displayed in the Laserfiche Client When OCRing.

1011839 OmniPage OCR Cannot Process E-sized Images.

1011840 Temporary OCR Files Are Not Automatically Removed.