Indexed PDF's not correctly imported

When I index a folder with a lot of PDF’s most of them are imported currectly as PDF+Text but some appeare as “PDF document” and are not searchable.
If I remove them from the directory and add them again they appear currectly as an PDF+text.

Is there a way to re-index thouse files with out having to manually import them again?

Just to cleareify those documents are a PDF text documents and do not contain pictures with text.

Like a Smart Rule
image

@DTLow - I don’t want them to be OCR’ed, they currently have only plain text in them. Those documents are in Icelandic and ABBYY does not handle Icelandic as well as I would, so there is no reason to put them through an OCR engine.

Welcome @Geiri

  • Did you create the documents yourself?
    • If not, then on what basis are you determining they should be searchable? (And just being able to see the text is not a valid indicator.)

See this blog post…

Also, why are you indexing instead of importing?
Have you read the In & Out > Importing & Indexing section of the built-in Help and manual ?

Are these documents very large?

Some a created by my self and other downloaded from websites. In all cases have the documents actual text not images nor outlined fonts. (I can in Acrobat Pro, edit the text see font size and font name)
__
Also when added again they show up as expected.

It looks like it does not matter, but most of the documents are rather small, may be up to 10-30 pages, less then 1 mb in file size.

Which version of macOS & DEVONthink do you use? Are you able to reproduce this using a certain document?

Mac OS Ventura 13.6.7 | Devon Think 3.9.6.

No I am not able to reproduce this, because when re-adding the file it appears ok.
If I delete the database and reindex the whole folder (with 10.000+ files) the problem occure but not for the same files.

10,000+ documents is a very large amount. It’s possible the indexing isn’t finished. I would not recommend importing that many files in one go.

Tried folder with 250 files, 3 where sorted as PDF document, removed them and imported again and then they where as PDF+text.

In DEVONthink, hold the Option key and choose Help > Report bug to start a support ticket so we can look at the machine info and logs. Thanks.