Support multiple cores when OCRing

alanshutko · December 28, 2010, 2:29am

I would really, really like it if DTPO could support as many cores as I have when OCRing. I just got a 27" iMac with a quad-core i7 (with hyperthreading) and it’s a shame that most of the CPU is idle when OCRing. I’ve found myself doing a LOT of OCRing and have managed to back the queue up so far it’s taken 36 hrs to clear it.

I seem to recall a past conversation where it was mentioned that the OCR engine was limited to a single thread for license reasons. I would be interested in paying extra to add-in a multicore OCR license. If you could find a way where the base DTPO was single-threaded, but I could purchase an additional license key and enable more threads I’d buy it in a heartbeat.

I recently tried the PDFPen 5 demo, and its OCR engine ran four threads and ripped through a document. I don’t know the quality, but I’d prefer to keep things in DTPO (with its queuing and Scansnap support built-in).

cgrunenberg · January 13, 2011, 2:12pm

That’s indeed the case but we’ll consider this for future releases/upgrades if possible.

neilm · May 23, 2011, 5:36am

This would be an great feature.
Another variation of this would be for it to complete the queue and wait on me to enter in the details or to auto save with a sequential name that i would return later to rename the documents.

hydrostaticparadoxon · October 21, 2011, 6:38pm

Hello development team,

this would really be a great feature to add. We have powerful machines with quad cores and it would be really good to make them work! Make DevonThink support multiple cores!

Best regards!

leeatmg · November 20, 2011, 3:31pm

I could not agree more. I’d be willing to pay something more to enable multiple cores for the OCR function.

ybai011 · November 23, 2011, 2:10pm

Love to see this feature for the future.

andrevs · May 22, 2013, 2:37pm

So it’s almost a year and a half later and this is still pending. Could you perhaps provide an update on this please? This is becoming more of an issue and we’re currently having to rely on external software to do the OCR before importing the PDFs into DevonThink.

Thanks and regards,
André

BLUEFROG · May 22, 2013, 3:47pm

Not much to report other than we are working with what we can. We understand the desire for this but short of writing our own multi-core OCR engine (and no, that’s not some trivial task or on our radar) we have to work with the APIs and allowances of the licensor.

alanshutko · June 5, 2013, 1:49am

Let us know if we need to stage a sit-in with Abbyy.