This looks pretty cool. I was thinking that perhaps at some point DT4 might include its own scripting window like VSCode that would let users just type requests and that would support them along the way of performing actions. If you’ve seen Google’s colab tool, for example, it has an agent mode that lets you drop some files and ask a question and it will build out a notebook with comments and code step by step showing you all the visualizations you requested. You can then go back to any step and ask it to make improvements.
If DT4 had a gemini cli built into the interface, it might make it easier to run scripts, commands, and other actions with simple text shortcuts. That same text-based interface on iPhone or on Web where we can transcribe text by talking would allow typical humans to ask questions about our databases and get better agent-driven answers without needing to do the multi-step process of opening databases, searching, and then summarizing.
I think it’s for people trying to reduce cognitive load like this guy:
Personally, with all the spelling errors I see in code, I wish everybody would use AI because it seems that if they can’t even spell in English, how can I trust that they can write logical and efficient methods in 15 other languages and databases.
By the way, when the user prompts for feedback, he’s expecting positive feedback and (perhaps) simple requests that he can manage. I sometimes wonder if humans need a “/memory add” command like we have in gemini cli to explain what we learn in kindergarten about human psychology so that we can have a conversation that doesn’t always dig into the weeds. For example, if I could open a thread and see that everybody is there and being supportive…. I almost wish that these forums and chatrooms would have AI built in that would let me set my preferences so that it would rewrite my version of reality and change all the sideways comments by rephrasing them so as to give me a Happy Little Clouds view of the forum instead…
Have you seen AI code, eg the one posted here? It may not contain spelling errors. But it certainly is full of hallucinated method calls, useless error checks, and overall reminds me of a person just starting out with CS 101.
You can be certain that the only languages and databases AI can write code for/in are those for which a huge corpus is “freely” available on the net. Free in the sense of “who cares for copyright.” So, AI-written C#, Python and JavaScript code might be ok. JXA, AppleScript, Cobol – not so much so (I haven’t seen Cobol yet).
This looks and works well using it with Amazon Q Dev CLI. I think DEVONthink should raise the bar and consider further improving this MCP server or creating their own that better matches their processes. Current limitations on which model hosting providers I can use (e.g., no support for AWS Bedrock/Agents) or no support for syncing files via an AWS S3 bucket are very disappointing. I think the DEVONthink team can do better when it comes to this more technical persona of their customer base.