Am I understanding correctly that I only have to tick arxiv in the settings where to search and then the LLM can access material on arxiv via RAG? If so that is exciting. However, one should remember that this is a preprint server and the preprints have not been peer-reviewed. (The same probably applies to a lot of the regular training material )
Will RAG make it possible to search arxiv from DevonAgent?
No, that is not correct and your premise is wrong. RAG isnât some kind of super-search functionality to be used in web searches. And it certainly isnât something that could be generically pointed at a website to force deeper searches.
The setting in DEVONthink just says external AI can specifically use results from the arXiv site, e.g., you can turn off all web-based options but still let it search and return results based on what it finds there. This is no different than the PubMed or Wikipedia options.
Will RAG make it possible to search arxiv from DevonAgent?
See above and DEVONagent has long had technology built in to locate documents by content.
After removing arXiv from being excluded from search in DEVONagent I can now search it. Was this exclusion done by default? I donât remember putting it there (but as you say I donât remember everythingđ)
The arXiv, PubMed and Wikipedia options explicitly enabling searching these sites and use the first n results (depending on Settings > AI > Chat > Usage and the used model). Irrelevant results are automatically filtered by using the cheapest commercial models (as LLMs get easily confused by irrelevant context)
Thanks for this info. Iâm trying to understand what RAG does and so far understand that the info gathered from arXiv, PubMed and/or Wikipedia (or whatever site being used) is used to modify the prompt that is given to the LLM. Is this correct?
When you say access arXiv here, do you mean using its search results, or results based on the actual contents of every PDF files? The latter would be very difficult, right?
The search in my illustration above is showing a match made inside a PDF on arXiv, so yes the contents. I search for âPlanck* wallâ However, the OPâs question about RAG / AI is infeasible in the situation as you canât enter searches like âShow me PDFs that mention something about solar windsâ. DEVONagent is searching for specific content.