I am not happy with the current clipper that DT offers.
And worse, I can’t find a better solution.
Failings
Clutter-free cuts pictures away.
I have no way to configure the clutter-freeing behaviour to make it match the source site.
I have no way to intervene and perform retouches in the process.
I believe the current clutter-free does not only happen locally, but uses some server-side processing.
My ideal clutter-free tool and workflow
When I clip a webpage with the sorter…
I am proposed different clutter-free masks (pre-selected, depending on domain).
Then the “masking” (clutter-removal) happens
and I am given the possibility to retouch the resulting intermediate (webarchive) document (like cutting an uninteresting part that the mask did not remove, or resize a picture)
before the document is finally converted to the desired format (for me PDF).
Masks
Central to this ideal, is a locally performed clutter-removal, which the user can configure through masks (a description of what the clutter-free engine has to perform). Masks that I can manage, configure, tweak, or share with the community.
That has all kind of possibilities, is completely user configurable etc. You could even use it to inject you own CSS, depending on web site.
Re-inventing that wheel in DT would be a wast of efforts, in my opinion. Especially since a completely user configurable clutter free option would require a huge and complicated UX.
Hello, all.
I am also noticing issues when using the clutter-free option.
I am even observing a case where I am saving a web article that has multiple headings along the text, and all the headings are removed in the clutter-fee PDF.
Evernote’s webclipper browser plugin has options for selecting what part of a webpage to save to Evernote. Perhaps DEVONthink’s clipper would benefit from having something similar.
You can already highlight part of webpage and make it into an RTF or markdown (or plain text) note (it’s on the service menus, or use the short cuts cmd-shift-) or cmd-shift-()
I’ve found that the easiest way to get just the important stuff on a website is to use Safari’s Reader mode, select all the text, then use one of those two shortcuts.
Cmd-shift-r, cmd-a, cmd-shift-), job done…
This seems to work more consistently and more accurately then any of the other methods, and it has the advantage that you see what you’re getting, but I’ve not run any detailed comparative tests.
You could of course just highlight the text and run the clipper from the share menu if you want to go direct to PDF / tag / label at the same time.
Thank you for the advice. I didn’t know about this method.
However, this isn’t ideal for me for a couple of reasons: A) it imports it to a DEVONthink database, whereas I organize my content in file-indexing mode and B) It does not allow me to specify where to save it.
I also found something interesting: the problem I was observing, where multiple headings of sections of a text in a page were being removed in reader mode, is also happening when I view that page using Safari’s Reader.
Has anyone seen this happening with other pages?
I agree and I tried to add the URL in my previous post, but I wasn’t allowed as I saw a message saying that I can’t post URLs.
Any chance this restriction can be disabled?
Yes, I see the same thing. As to why, I can’t say at the moment.
Safari’s Reader mode shows there’s something beyond out browser extension involved in the situation; something about the way the page is designed.
The browser extension isn’t going to prompt you to save outside of DEVONthink. If you have an indexed group in DEVONthink you could select that in DEVONthink’s group selector.
Is this a response to my comment above where I wrote “A) it imports it to a DEVONthink database, whereas I organize my content in file-indexing mode”?
I am interested in saving it to DEVONthink, but not store it. I want to index it. And the extension allows me to do it.
But what I wrote was in response to @brookter’s suggestion, which was to use a service to save a rich note.
Do you agree my understanding is correct?
I think no, but not sure. However, I always pass all my stuff into global inbox and then move into wanted folder (indexed or not) via Control + CMD + M shortcut and the start typing the folder name.
Wow, thank you for this. This looks exactly how it should look like!
I just downloaded Microsoft Edge and I was able to produce an identical one.
So, it looks like Microsoft Edge’s Immersive Reader is better than Safari’s and DEVONthink’s clutter-free algorithm, as it didn’t strip out content that is supposed to be kept!