DT 3.5 update - DTOCRHelper eat up the memory!

nano5 · June 16, 2020, 1:06am

After updated to DT v3.5.1, it initiated reinstall of the ABBYY OCR module (761MB). But the memory issue remains, that ocrhelper quickly consume up memory after starting the OCR job and freeze.

Now I use standalone FineReader with smart rule created by @Silverstone Script to OCR PDFs with the latest FineReader, set to “high resolution”, which works well.

By the way, it seems standalone FineReader works a bit different in terms of resource strategy, e.g. on average it takes 25% CPU capacity during “import” and “saving”, and 75-80% when “recognizing”; my iMac running a quad-core intel i5

It is a 86 pages BBC Science magazine, a lot of images,

during Import, FineReader consumes up to 800MB memory;
during Recognizing, up to 2.8GB memory;
during Saving, around 2.6GB

The original pdf is about 78MB and OCRed pdf is doubled to 156MB.