Boolean NOT (NEAR ...)?

I wish you to be as perceptive in analyzing the documents you find as you are in searching for them. :wink:

As do I! :slight_smile:

One of the dangers is that perceptivity in analysis is sometimes proportionate to the time available divided by the number of results one has to go through! Even narrowing the results by 10% could represent hundreds of hours saved over a decade. This all arose because I’d rather dedicate as much time as possible to analyzing results that match a given criteria to the greatest extent currently possible than to spend a lot of that precious time weeding stuff out that was irrelevant to begin with.

Our art is to be able to distinguish the important from the unimportant. More data does not automatically lead to more knowledge, even if many believe that … I’ll stop now. I don’t know any better than you. :slightly_smiling_face:

You’re completely right, and I agree.

I really don’t know anything about complicated/complex searches. Because that’s the case, I search for the most important thing first. Then I search only in the search results further and then (if necessary) again in the search results … but that’s probably useless in your case?

1 Like

If I always knew what I was looking for among personal files, that would be one thing. But I’m working with a growing archive of historical documents spanning 180 years. Multiple searches and sub-searches can help, depending on the study methodology and delimitations. Finding and knowing what is important is a process of discovery aided by the nature and strength of the queries.

1 Like

Shouldn’t a simple query like "new testament" OR "old testament" be sufficient? OPT doesn’t affect matching or highlighting but this might rank results including the word higher.

That’ll find all “articles”, but the OP wants to exclude those containing also “new/old covenant”. Since an “article” is simply an undefined part of the document, I doubt that is possible.

Hmmm… easiest approach would be ("new testament" OR "old testament) NOT ("new covenant" OR "old covenant"). Not very elegant but should work. Just like this approx. solution using wildcards:

"[no][el][wd] testament" -"[no][el][wd] covenant"

It really is more complicated than that. Or rather “not solvable”, in my opinion. Because we’re not talking about a complete document (in that case, your query would of course do the right thing). But rather something like:

  • A document has articles 1, 2, and 3 where
  • article 1 talks about any testament and the old covenant
  • article 2 talks about any testament and a marriage covenant
  • article 3 talks about the book of Hiob (or whatever)

What the OP would like to happen is that this document is found because one of its articles matches their “give me any testament and any covenant as long as the latter is neither old nor new”. But a query like "testament" NOT ("old" or "new") "covenant" will weed out the complete document, so that “article 2” will not be found, whatever we do.

As there’s no “limit my search to something that only humans can identify because there is apparently no clear boundary inside the document” operator, it simply can’t work. Cake have not eat. Or cake eat not have.

If there were clearly definable/defined boundaries between the articles, one could somehow script it by splitting the document into articles at the boundaries and then searching only in the articles.

I love the idea of a “now testament” :wink:

Interesting idea but not matched by the wildcards :slight_smile:

True but it would match the oew testament. :stuck_out_tongue:

Now how likely is that? But if it should be an OCR issue, then it’s unclear anyway whether it should be matched or not :grin: