Problem syncing the database ("no text" error)

raf · March 15, 2014, 5:53pm

I have a database.
I sync it with Dropbox.

When I try to import it from my other mac, i always, always, always, receive a problem with some files (150 more or less).

The log says (the same for all the 150 files; the database has 5000 files in total): 18:38:43: ~/database.dtBase2/Files.noindex/pdf/8/file.pdf No Text

What is the problem?
What is the “no text” thing?!?

raf · March 15, 2014, 6:28pm

I just discovered that all the “no text” advices are related to pdf files that were not with OCR.

So now I converted all of them to searchable pdf.

So the question is: is normal that in import the log tells me that file are no text files?

Bill_DeVille · March 15, 2014, 6:42pm

A “no text” PDF simply indicates that the PDF doesn’t contain searchable text. That’s not an error, but information.

For example, an image of a paper receipt saved as PDF by a scanner is merely a picture of the receipt. In order to make that PDF searchable, it can be subjected to OCR (optical character recognition), so as to produce a searchable PDF that contains a text layer as well as an image layer.

You can add a Kind column to a DEVONthink database view window (View > Columns > Kind). Searchable PDFs are identified as Kind = PDF+Text. Non-searchable PDFs are identified as Kind = PDF.

You can create a smart group that lists non-searchable PDFs using these criteria:
Kind = PDF/PS
Word Count = 0

Not all non-searchable PDFs need to be OCRed. For example, I’ve got a friend who occasionally sends image files (photos or graphics) as PDF files. They don’t include pictures of text, so there’s no point in OCRing them.

raf · March 15, 2014, 6:48pm

ok tks