There would appear to be much experience on here with the various PDF manipulation tools.
I was recently given access to some hurriedly done photoscans of a very hard to find book 15 years out of print. I don’t intend to distribute the results of my work publicly but do intend to share it with a few people with similar interests. It’s a cookbook for a restaurant that no longer exists except in the memories of all the people that ate there.
The photo-scans were done a little haphazardly so it required lots of de-skewing and cropping/stamping out fingertips most of which I have accomplished in Photoshop. I’ve got it to the point where it’s readable as a 104 page PDF, but the skew another distortions are never completely gone.
I’ve run OCR on it a few different ways (Devonthink, Acrobat, QuickScan) and they’re all quite far off, especially on the worst quality pages.
What are the options to edit the OCR’ed text without getting into changing how the scans look? I was aiming for just having an accurate and searchable text layer for each page but Acrobat leads one down the road of editing all the graphics and text blocks. DT doesn’t really give you access to the OCR text layer.
Hope this isn’t too far off topic.