I seem to recall discussions in the distant past that part of the DT3 upgrade and the new ABBYY framework would allow us to do multithreaded OCR. Is that still the case? My tests seem to be doing one thread only.
Multiple documents are not yet processed concurrently but multiple pages of a document are.
OK, I’ll keep an eye out. I’ve tried OCR on a 98-page PDF and it was running on one core the entire time.
Based on my recent conversations with support, the ABBYY sdk is licensed to “use” up to four cores but multithreading is only supported on Windows. I interpret that as meaning that the ABBYY sdk is single threaded and does nothing concurrently. Specifically, it does not process multiple pages of a document concurrently.
This matches the testing that I have done on my machine where DTOCRHelper seems to be using a single thread and only up to 1 CPU. That usage is spread across any cores just as any single core app is, and it takes the same amount of time to process as DTPO 2 did.
If I am wrong, and ABBYY should be processing things concurrently and taking advantage of multiple cores, please let me know. As far as I can get with support, things seem to be behaving as intended.
That is incorrect. The multithreaded option I referred to which is available on Windows is I believe for processing documents multiple documents. The ABBYY SDK on macOS processes a single document on multi-cores using multiple threads.
Oh, that makes a lots more sense.
I’m still not convinced that there’s any threading there, but I will remain hopeful.