I was provided with a number of scanned documents that the provider OCR’d prior to handing over. In the original file handed to me, the text is selectable and the OCR accuracy seems high. Copy and pasting is almost 100% accurate, searching works, and seeking definitions works almost 100% perfectly.
The file metadata states:
PDF Producer: Adobe Acrobat 9.0 Paper Capture Plug-in with ClearScan
Content Producer: Adobe Acrobat 9
Oddly, when I make a few highlights then save that file, the metadata changes:
PDF Producer: Mac OS X 10.10.2 Quartz PDFContext
Content Producer: Adobe Acrobat 9
After this, the text is still selectable, but the OCR accuracy is now 0%. Copy and pasting copies and pastes nothing. No definitions can be sought for words. Search does not function.
I notice that this happens in both Preview and in DTPO’s built-in viewer, and it only occurs with some files, not all. Mostly scans of text that were subjected to some type of OCR.
The file size also increases. For example an 11mb original PDF changes to about 16mb after saving after viewing in DTPO or Preview.
Your thoughts are appreciated!