I have a lot of folders with thousands and thousand pdf files. A lot of them are searchable a lot of not.
I know how to show only non-searchable pdf files via DEVONthink - Data > New from Template > Smart Groups > PDFs (not searchable).
I need to automatically save all non searchable pdf files to searchable (OCR) to the same folder like original with filename (originalfilename_ocr.pdf). And keep original (non searchable) file.
Thank you for your help but I need something different.
My philosophy of work with folders and files is different. I dont preffer work with database like DEVONthink is designed. Indeed, maybe I will change my mind
I need resolve my problem - I have a lot of folders with a lot of files and I want to keep this structure. Is it possible to use DEVONThink for resolving my problem? I want to check all pdf files by DEVONthink and find all non-ocr (non searchable) pdf and OCR them and create new file of OCRed pdf in the same folder where is original non-ocr (non searchable) pdf.
Can I reach this with DEVONthink without import all files to database and work with database?
In the end, your app si really great I have tried maybe all popular OCR software (ABBYY, Adobe Acrobat, …) and any of these app can find only non-OCR (non searchable) pdf.
I set everything like you wrote but new OCRed file doesnt have the name originalname_ocr.pdf but originalname-1.pdf. How can I fix ti?
and the more important - is it possible to set language for OCR? I need czech language because now OCR cant recognize for example - Čeněk, Škoda, …
and is it possible to change “smart rules” for this condition: - if there is non-searchable pdf and also the same file with same filename + _ocr (filename_ocr.pdf) skip it and OCR next one? This is very important for future using after adding new pdf files. Because I dont want to OCR all files again.
E.g. copy (see Change Alias action) the name to the alias before OCR, then use the alias in the Change Name action after OCR.
This can be only changed via Preferences > OCR
There’s no such smart rule condition. The only workaround would be to replace the actions with a script which checks this condition first before handling the actions on its own.
If I set this after OCR the result is - original file dissaper and there is one file (originalname-1) and this is searchable (OCR) and second file (originalname_ocr) and its non-searchable.
I would prefer to keep original file without rename or any change, only created new one searchable after OCR. Is it possible please?
“There’s no such smart rule condition. The only workaround would be to replace the actions with a script which checks this condition first before handling the actions on its own.”
It coudl by possible to resolve it if you can add to next version this functionality - add choice of “date created” is “newer than” “01.04.2021” (for example)