Question about text recognition

MauriceK · April 9, 2026, 2:32pm

I made an observation while testing DT4 that I can’t quite explain. The following sequence:

Computer: Mac Mini M2 Pro with macOS 26.4
Installation: DT4 version 4.2.2
Created a test PDF without a text layer (Document.pdf)
Started DT4 and created a new database
In the settings under „Import“ enabled the option “Make text in PDF documents searchable”
Imported the file “Document.pdf” into DT4
The file appears in the inbox as kind “PDF Document” with 445 words
In the settings under “Import,” disabled the option “Make text in PDF documents searchable”
Imported the file “Document.pdf” into DT4 a second time
The file appears in the inbox as kind “PDF Document” without words
Selected the row and executed “Data → Recognition → Transcribe Text & Notes”
It then appears in the inbox as kind “PDF + Text” without words
Quit and restarted DT4
Now the file from the second import appears in the inbox as kind “PDF + Text” with 445 words

I would have expected that the result of the second import, after explicit recognition, would look the same as the first import with recognition enabled. What am I doing wrong?

Document.pdf (141.4 KB)

cgrunenberg · April 9, 2026, 3:37pm

Most likely just a temporary and harmless display glitch of the kind, we’ll check this.

cgrunenberg · April 15, 2026, 12:57pm

The next release will fix this inconsistency.

MauriceK · April 15, 2026, 1:41pm

Thank you. Dankeschön.