I am often saving web pages to DevonThink as PDF files.
It seams that for web pages, the DevonThink hallmark Move To/See Also is not working as well as for ordinary documents. The suggestions are seldom useful. I assume a large part of the reason for this, is that a web page contains so much more information and navigation than just the main text - and a lot of the extra info is not closely related to the main text.
I think the result would be better if a algorithm like the one used by Readability where applied, to find the main text of the page, and index just that. This would give a text similar to the one presented by Safaris Reader function.
-
Readability, lab.arc90.com/experiments/readability/
-
Ruby port of the algorithm, github.com/iterationlabs/ruby-readability There is also a javascript version.