I’ve been using DA a lot over the past couple of weeks to run scheduled searches on (mostly) News plugins - I thought I understood the principles behind Boolean operators and the Default vs. Secondary search fields…and was using the ‘unlimited’ nature of the Secondary to attempt very specific results. But the search results kept including pages that shouldn’t have been there…and not including pages that I thought should have been. So I did some very simple comparative searches today and the results really have me scratching my head. Can anyone (is Mr. DeVille in the house?) help explain these results?
[All searches were made using the Google News plugin @ 100 Results per, with a few domains previously placed in the Preferences=>Excluded list.]
Search 1: Why are these results not more or less the same?
Search 1a
Default Query: Kenya NEAR crisis (NOT food) (NOT economy)
Secondary: [empty]
Results: 23
Search 1b
Default Query: Kenya NEAR crisis NOT (food OR economy)
ditto: (Kenya NEAR crisis) NOT (food OR economy)
Secondary: [empty]
Results: 0
Search 1c
Default Query: Kenya NEAR crisis
Secondary: NOT (food OR economy)
Results: 69
Search 1d
Default Query: Kenya NEAR crisis
Secondary: Kenya NOT (food OR economy)
Results: 144 [inc. lots with food &/or economy!]
Search 2: Why do “NEAR (x OR y)” and “NEAR/1 (x OR y)” deliver identical results…And (per #1), why do similar queries in Default or Secondary fields deliver different results?
Search 2a
Default: Kenya NEAR (economy OR crisis)
Secondary: [empty]
Results: 70
Search 2b
Default: Kenya NEAR/1 (economy OR crisis)
Secondary [empty]
Results: 71
Search 2c
Default: (Kenya NEAR economy) OR (Kenya NEAR crisis)
Secondary: [empty]
Results: 73
Search 2d
Default: (Kenya NEAR/1 economy) OR (Kenya NEAR/1 crisis)
Secondary: [empty]
Results: 7
Search 2e
Default: Kenya
Secondary: Kenya NEAR (economy OR crisis)
Results: 35
Search 2f
Default: Kenya
Secondary: Kenya NEAR/1 (economy OR crisis)
Results: 36
Search 2g
Default: Kenya
Secondary: (Kenya NEAR economy) OR (Kenya NEAR crisis)
Results: 37
Search 2h
Default: Kenya
Secondary: (Kenya NEAR/1 economy) OR (Kenya NEAR/1 crisis)
Results: 3
Obviously, I’m dealing with at least 2 separate issues here:
Boolean: Why does x NOT (y OR z) ≠ x (NOT y) (NOT z)? (Similar for NEAR and NEAR/n).
Default vs Secondary: The Help literature states, “Secondary Query: When you enter something here, the primary term (the one that is entered or the default query) is only used for querying the search engines, but not for accepting or rejecting pages. Without a secondary query, DEVONagent uses the primary query for both querying search engines and post-filtering the results.” With this logic, I would think that # of Results for [Default=x and Secondary=x NEAR y] should be ≥ [Default=x NEAR y and Secondary=empty]…but it comes out the other way around.
I’ve emphasized the # of results as the main factor in these test searches; obviously, quality is the real issue - but unless the result counts make sense, I can’t trust the quality.
Can anyone help clarify this for me? I’ve never thought of myself as any more dense that the next fella, but now I’m having my doubts…
Thanks in advance!