I’ve recently gathered a large number of scanned copies of old journals (PDF only, no text layer) used in my research and am in the process of gradually adding them to my DT3 databases. (3.9.6, macOS 14.4.1). As I copy them over to DT3, I manually execute the “OCR to searchable text” command for each file.
My workflow in this situation is to add maybe a half-dozen or so journal issues at a time, queue up the OCR jobs, verify the results when completed, then delete the old PDF-only files and move the new PDF+text files to their proper location in the database. As I’m short on time these days, I’m doing this only perhaps once or twice a day. Thus I’m running the OCR conversions between six and twelve times daily, on files that are about 60-90 MB without a text layer and 120-180 MB after the conversion.
I’m once again running into an issue I’ve encountered before: after repeated successes, DT3’s built-in OCR (ABBYY FineReader) just stops working: I select the “OCR to searchable text” command and nothing happens. Error messages are rarely generated in these situations and nothing pops up in the Activity window to indicate that the menu command was even registered. Repeated efforts to execute it don’t work. Usually restarting DT3 fixes the problem, sometimes (this happened this AM), I have to reboot my computer to get back this feature of DT3. It’s as if ABBYY reaches some threshold for OCR, either a bug or some kind of counter, and just won’t budge after that. After restarting/rebooting, the documents in question OCR correctly, as if nothing had happened.
The issue is difficult to reproduce – I do wish I could be more detailed in this report – and I can’t identify the conditions under which it occurs. But it does happen, nearly predictably, almost every time I try to run a lot of OCR jobs. And it’s the only significant glitch I’ve encountered with DT3.
So… I’m registering an observation that something is amiss…?