I want to import web pages into DT. Typically, it is pages belonging to a web magazine. Normally, I use the DT clipper and Crome as the browser.
The import seldom gives a good result, for example, the ‘Accept Cookies’-window is imported, too, although I have clicked it off before the import. At other times, one or two images on the page are imported, but no text, etc. I have tried all available formats.
In another post, I saw that this problem was mentioned and a solution could be to install an add-on: print to DevonThink. The add-on should be located in Install Add-ons. But I cannot find it. Should I look for that add-on elsewhere?
I have tried many methods to convert web pages to PDF doc.
What (always) gives me perfect results is, I import the web page with drag and drop (manually) to DT and then make a PDF out of it (in DT).
The PDF looks identical to the web page.
I don’t understand why the clipper doesn’t deliver the same quality, maybe something is different “in the background”. Or is there a specific explanation?
For comparison. Import from Safari “Save PDF to DT”
Drag and drop from Chrome isn’t comparable to what you produced in Safari. They are not using the same mechanism at all. Safari is printing to PDF via the PDF services. Converting a bookmark to PDF in DEVONthink is not doing the same thing.
PS: It has nothing to do with Chrome. You’d get the same results if you dragged a bookmark from Safari.
There is no 100% accurate solution to this. Clipping to PDF without the clutter-free option is one approach. However, it can also yield very long PDFs that may be difficult to view in DEVONthink To Go.
For the most part I use webarchive to save the webpage. This does a fresh capture of the page instead of capturing the existing page on the browser. Means if delete the ads, other promos etc from the html inspector and capture, still those appear in the webarchive. What I do instead is select and remove them from the captured archive. This will remove all the unnecessary elements from the page and keep the archive clean. Some pages don’t look well with web capture. They have dark background and dark font. For example healthline sites. For these I use clutter free option and do minor updates. PDFs of sites doesn’t give the exact look and feel most of the time. I use this sometimes when other options don’t work. When nothing works I convert the capture to plain text.