DevonThink & Apple Intelligence

rkaplan · June 30, 2024, 3:47pm

There is a quite notable difference between AI which is based upon a static LLM model vs. RAG AI which queries the internet for current information.

No doubt responses need to be verified from all of them - just like you need to verify answers from a Google search. If you are searching for facts or reference sources, you will very consistently find that both Perplexity and Consensus are considerably more useful and timesaving compared with a Google search. You cannot use non-RAG AI at all for this purpose.

meowky · July 1, 2024, 11:06am

Not if the topic you are searching for has been plagued by ads. I just asked Perplexity about compatibility issues of the Snapdragon X Elite chip. It reiterated Qualcomm’s best-scenario-only claim that “There’s reportedly only about a 10% performance loss for emulated apps compared to native ARM apps.” It did not mention independent tests have revealed that a very large proportion of emulated apps perform horribly and greatly reduce battery life, the number one selling point of Snapdragon-powered Windows laptops.

As a real human, I know to dig through Google results until I see the other side of the coin, before deciding on a purchase. AIGC services like Perplexity currently do not, and I’m afraid they don’t have the business incentive to do so in the future, either.

kewms · July 1, 2024, 5:27pm

No, they don’t. Not unless they build a business model in which they are paid by users for accurate results, rather than being paid by advertisers. (And a user-supported model will probably lead to a focus on industries – medical, legal, financial services – that are willing to pay a premium, rather than on general consumer use.)

rkaplan · July 1, 2024, 8:57pm

I would suggest asking Perplexity questions like “what are good issues and bad issues with the X Elite Chip?” Or “Show me some articles expressing concerns about the X Elite Chip.” Or “What are some reason I might want to buy the X Elite Chip and what are reasons to buy a competitor?”

meowky · July 1, 2024, 10:47pm

I could coerce Perplexity into questioning the advertised claims, but the model ultimately relies on the same pool of articles to generate a response. It seems the pool does not include independent test results – which is the “facts and reference sources” that really matters beyond content paid for by Qualcomm and manufacturers.

Consequently, Perplexity can tell me that many apps don’t run (expected before product launch; same was true when Apple introduced the M1). What it cannot tell me is that emulated apps that do run perform poorly (unexpected; Apple’s Rosetta was much better in this regard).

kewms · July 2, 2024, 12:26am

This is, more or less, why the entire discipline of library science hasn’t just been replaced by search engines. Deciding what sources are relevant is hard, and it gets harder the more specific the information you’re looking for.

eboehnisch · July 2, 2024, 8:33am

This is the reason why I pay for kagi.com. I regularly use their Quick Answer feature, but also always with a grain of salt as it hallucinates, too, from time to time or jumps on the wrong bandwagon if the best-match search results lead into the wrong direction.

rkaplan · July 2, 2024, 9:50am

I think you are misunderstanding the Perplexity ads:

(1) The ads are separate from the AI response to your question. The actual AI algorithm that responds to your query and gives you a response and links/references is not advertising relateted at all.

(2) It is true that Perplexity recently began ads next to but distinct from the AI response for free accounts. The ads do not appear on paid (“Perplexity Pro”) accounts.

That seems like a pretty reasonable solution to me. Surely Perplexity needs to cover its expenses and earn a profit like any other business - how else could they survive? So you have a choice of a free advertising-based account or a paid ad-free account. What’s wrong with that?

meowky · July 2, 2024, 10:18am

I’m sorry but you have misunderstood the entirety of what I was talking about.

I never mentioned ad banners, and I generally don’t care if these are displayed somewhere on the web page, since it’s trivial to hide them once and for all with StopTheMadness.

The problem is, if a topic is plagued by ads (that is, the first few pages of Google search results consists of mostly advertisement and ad-style content), then Perplexity is bound to reiterate some advertised claims in its responses. Thus your claim that

is simply untrue.

I repeat: The algorithm draws responses from the web. If most of the web is ads (or, more precisely, most of the web traffic has gone to advertised and ad-style content), then the algorithm will spit out ads, too. The system is designed – unapologetically – to behave this way.

Don’t want to be stuck in a bubble of ads? Google and process the results yourself.

Am I surprised by the favorite son of latter-day capitalism not trying to refute capitalism? Not at all.

rkaplan · July 2, 2024, 12:14pm

OK I see your point there. But Library of Congress contains every book ever published - good, bad, and indifferent. Does that mean that the library is full of junk books? No - it simply means you need to figure out how to sort through what it is that you desire.

I think prompt engineering can help address much of your concern. Perplexity will even criticize itself if you ask “What are some criticisms of Perplexity.ai?”

meowky · July 2, 2024, 1:06pm

There are many ways to make use of information provided by the Library of Congress or something similar. It goes without saying that some ways work better than the others, and some would not get what you want at all. We as human users possess the capability to choose our own ways. An algorithm does not. When an algorithm doesn’t work for whatever reason, it’s not going to work. Period. That why, as @kewms has mentioned, machines are not yet replacing humans in this regard.

This is wishful thinking unless proven true. Even if it is true, the user must not be blamed for not being sufficiently skilled if the relevant skills have not been explained in any official or semi-official handbook. It’s akin to blaming limitations of iOS/Android on the user not knowing how to jailbreak.

rkaplan · July 2, 2024, 1:43pm

No “blame” needed

If you have a specific case you wish to share it would be helpful to review

No software and no technology is perfect. I have found Perplexity.AI and Consensus to both be useful in a very practical sense - both for personal use and for professional use. Perplexity consistently and quickly finds academic citations not found in long-accepted search indexes which do have big manuals.

No search technology or AI should be accepted unless/until validated by other means. That said - I think if Perplexity or some other RAG AI tool is not in your toolbox then you may be missing something that could be of big help no matter the reasons you do searches.

meowky · July 2, 2024, 2:02pm

My field is closely related to theology and religious studies. I frequently hear faithful individuals explain that «if you do not believe in ★★★, then you may (in a polite and respectful tone like yours) be missing something about life/nature/the universe/etc.» I consider most such arguments to be valid, even though they conflict with each other. And it’s always appreciated if ★★★ actually helps these individuals and/or make them feel better.

You acknowledged that

and I consider this single observation sufficient on its own to explain why any specific software could be unhelpful to certain individuals.

There is only one thing I would still like to confirm: when you stated (which was the one statement I specifically object to)

did you take potential time and effort on prompt engineering into consideration?

rkaplan · July 2, 2024, 2:15pm

Yes

I am not suggesting some huge page-long prompt

With Perplexity you will go very far with simple things like “Show me articles which disagree with XYZ” or “What are the reasons in the best and worst reviews for XYZ” or “What are reasons people choose competitors over XYZ”

If you have a real-world case where you think Perplexity does not work well let’s dicsuss that specific example.

Clyde_Barrow · July 2, 2024, 2:19pm

I use (well, experiment with) several different AI platforms. I pay to use these AI platforms (except for one, where I am a beta tester).

IMO, none of them are ready for prime time. They may be great for generating bogus college papers, but that’s about it. Asking the AI platforms to summarize a document is fine, but 90% of the time the summary omits key provisions. The AI platforms also do a poor job of spotting inconsistencies or even finding key terms.

Our brains took millenia to evolve. We are not constrained by binary logic, which drives computer programs and AI. Generative AI might evolve to simulate the way proficient people analyze and solve problems, but it will never be the same.

In short, I still rely on DevonThink and plan on relying on it for quite some time.

meowky · July 2, 2024, 2:47pm

Thanks for the reply!

In my own experience it typically takes longer to get answers that satisfy me from AIGC than through Google search. A possible reason is that I’m personally never comfortable with out-of-context statements. The context provided by AIGC always feels artificial and suspicious. Maybe this have to do with living in an ultra-low-trust society, though.

As for my specific case about the Snapdragon chip, Perplexity struggles because it apparently refuses to consider the comment sections of sites like Reddit, Youtube and developer forums (a trove of valuable information if you know how to make use of it), and sources of other languages than English. (China has a vibrant hardware review community, whose production is frequently referenced in English discussion, for example.) It is also not turning up very new (published in the last week or so) articles.

rkaplan · July 2, 2024, 2:58pm

Maybe it would help to clarify the goal

I view Perplexity as an advanced Google search- I am not relying on its narrative summary except to help point me to which sources I may want to read.

Seem to me if Perplexity found the relevant discussions on Reddit, Youtube, and Forums then it did its job.

meowky · July 2, 2024, 3:04pm

I use Google search to synthesize my own opinion (e.g. on whether Snapdragon stuff is worthy buying) from the search results and their linked content, which could be facts, observations or opinions. This workflow is nil possible without sufficient context. Search engines, for the excess of information they provide, are unparalleled in providing context about any topic.

I don’t use search or AIGC for this purpose. My sourced-from-the-web reading materials are either already in DT or to be discovered through RSS feeds and newsletters. To me, the search engine or its potential replacement is strictly for retrieval of information.

No, it did not, even when explicitly instructed to look at these sites.

kewms · July 2, 2024, 4:12pm

A useful exercise is to test these tools with material you’ve already read or a topic you know well. They are, as noted above, BS engines, but it’s a lot easier to evaluate their responses when you already know the “correct” answer.

MsLogica · July 3, 2024, 7:15am

@meowky

I’m not getting involved in your debate on Perplexity generally (my scientific field has had a reliable search engine for years and I’ve never struggled to find what I’m looking for, and I use several non-Google search engines that I’m happy with for non-academic stuff), but I did just want to note:

Google, and likely some other AI developers, actively downrank certain types of human content in their training (I.e. blogs, forums) (in fact Google does it in their search too). This does mean valuable content can be missing. As an example, there are thousands of posts across the internet querying how to do things on MacOS because Apple’s instructions are either missing or don’t work. Heck, in this forum alone there’s probably tens of queries about Apple software (not DT!) because people couldn’t find the answer in Apple’s support pages. This can make AI (and Google) less valuable than a decent search engine since they don’t actually find the answer.

[I’m ignoring the ethics of all this content-stealing in this post. It’s a separate topic.]