Question about text recognition

I made an observation while testing DT4 that I can’t quite explain. The following sequence:

  • Computer: Mac Mini M2 Pro with macOS 26.4
  • Installation: DT4 version 4.2.2
  • Created a test PDF without a text layer (Document.pdf)
  • Started DT4 and created a new database
  • In the settings under „Import“ enabled the option “Make text in PDF documents searchable”
  • Imported the file “Document.pdf” into DT4
  • The file appears in the inbox as kind “PDF Document” with 445 words
  • In the settings under “Import,” disabled the option “Make text in PDF documents searchable”
  • Imported the file “Document.pdf” into DT4 a second time
  • The file appears in the inbox as kind “PDF Document” without words
  • Selected the row and executed “Data → Recognition → Transcribe Text & Notes”
  • It then appears in the inbox as kind “PDF + Text” without words
  • Quit and restarted DT4
  • Now the file from the second import appears in the inbox as kind “PDF + Text” with 445 words

I would have expected that the result of the second import, after explicit recognition, would look the same as the first import with recognition enabled. What am I doing wrong?


Document.pdf (141.4 KB)

Most likely just a temporary and harmless display glitch of the kind, we’ll check this.

2 Likes

The next release will fix this inconsistency.

1 Like

Thank you. Dankeschön. :slightly_smiling_face: