Script to OCR PDFs with the latest FineReader

moytura · April 28, 2020, 7:37pm

Thanks for the help …

enGeo · April 28, 2020, 8:24pm

I got one more question: The filename now looks like “theName.pdf.txt” is there any simpel option to get rid of the old extension?

Silverstone · April 28, 2020, 8:45pm

Use name property, not filename. It’ll return the base name without extention.

enGeo · April 28, 2020, 9:12pm

Thank You!

ambleaKE · May 3, 2020, 7:54am

Excellent work!

Maybe it would prove useful to new users to link to the most important posts for version 1.1 from the original post at the top? It took me a while to find the code for version 1.1 and your instructions

Silverstone · May 4, 2020, 2:24pm

I don’t know how to do it…
Forum doesn’t allow me to edit the first post

nano5 · May 24, 2020, 7:47am

@Silverstone @roads
Thank you for the smart rule, which works smoothly without any further change, the resulted document is smaller and clearer. At the same time, I am having the same issue, that TEXT can not be selected in the OCRed pdf document. I also follow the steps copying “missing fonts” from …/Supplemental to Library/Font and restart the Mac, but it doesn’t solve the problem in my case.

FineReader engine is v.12.1.7
macOS Catalina v.10.15.4
Devonthink 3.5

Any further suggestion? Thx.

nano5 · May 24, 2020, 9:38am

Problem solved. Text of any new OCRed pdf is selectable after the above steps. I was sticking to this OCRed document which was scanned before the above steps.

Silverstone · May 25, 2020, 11:03am

You are welcome!

enGeo · May 25, 2020, 11:21am

How can I make the script just scan the first page of a multi page document?

Silverstone · May 28, 2020, 7:21pm

Well, unfortunately the FR’s function export to pdf does not have such a parameter (to save only selected pages range), so you will need to create a temporary PDF from the first page of a given document and then use the script with it.

For this I’d use frameworks “Foundation” and “Quartz”, like here.

omri · July 6, 2020, 3:35pm

Anyone figured out how to queue invocations of the script?
I made the smart rule triggered on import, and if I scan two items in rapid succession, the smart rule fails on the second one with FineReader being busy.
the error I get is: FineReader got an error: Connection is invalid.

omri · July 7, 2020, 6:28pm

Never mind. I think this was caused by the rule trying to do stuff (set a tag) on the document after the script ran.
I still sometimes have the old document not deleted, which is strange.

darrylmy · January 30, 2022, 7:24am

Hi @Silverstone, I’ve been using your script for some time now, but since getting a new M1 Mac and upgrading to the latest version of FineReader, the script no longer works. I doubled checked the usual suspects from the posts above, but am consistently getting the following error:

“ABBYY FineReader PDF got an error: Can’t continue «event FR 118».”

I see the event in the script, but can’t understand what it means. Have you by chance updated this script for the latest version of FineReader? Any help would sure be appreciated as the was such a great script!

Darryl

Silverstone · January 30, 2022, 12:11pm

Hey, Darryl

New ABBYY Finereader PDF app is not yet scriptable, unfortunately… They did say, they are going to make it AppleScriptable but not yet (

You may want to use previous version of the app with the script

darrylmy · January 30, 2022, 5:47pm

Hi @Silverstone, thanks for taking the time to reply. That is unfortunate indeed. Thanks for clarifying this for me.