Global Concordance

DEVONthink 4.1

Is there a “global concordance”?

I find that I’m frequently removing common “noise words” from my document concordance lists to allow me to see the wood for the trees in an attempt to achieve greater value out of my documents. I appreciate that a “global” concordance list is probably an unwieldy beast but is there a viable workaround that does not involve making the same Concordance edits across multiple documents?

Selecting multiple multi-paged PDF and then attempting to remove a batch of items (letters from 1 to 4 characters in length, numbers, non-English characters etc) from the Concordance list brings a five-minute (if I’m lucky) spinning beach ball each time I right-click for the context menu.

If you exclude words from the Concordance, they’re excluded for all documents, even across databases.

Here are two similar PDF documents, in different databases. I have excluded a word from one of them and as you can see it has applied to both documents and databases…

1 Like

OMG! I am such a doofus; I have so many junk-words across my PDF collection I couldn’t see that the Concordance list was already global and removing an unwanted item from a single PDF would result in its global removal. Thanks!

I now see that one of the many additional challenges with scanned documents is that the OCR will find an abundance of worthless words and characters that are all pulled into Concordance regardless.

You’re welcome.

And there is no 100% accurate OCR engine. ABBYY’s has long been the top or in the top two, but the quality and contrast of the original matters.