My most common scanning need is to scan 1-10 paragraphs or 1-10 pages from a book or magazine/journal. After they are in DTPO, I use the search function to find the sections I want, and pull paragraphs out to use for writing and webpages – so for both searching and copying I want the content to be exactly what is in the book or journal/magazine.
I’m not finding an easy way to do this with the DPTO scanning function and a Canon Lide 70 scaner. When a page is scanned in, it give me a PDF image of the page, but compared to a dedicated OCR program such as Omnipage, it is difficult to select several paragraph in the middle of a multicolumn journal page, and difficult to keep readjusting the scan area to just give me the right hand page or the left hand page of a book. (Yes, facing book pages can be scanned together, but then, as best I can make out, it is not possible to select in DTPO a paragraph on just one page.)
Also, I haven’t yet found an easy way to proof the DTPO scan. I’ve discovered that if I select all or part of the PDF scan in DTPO and use the service Deventhink Pro – Take Rich note, I can create a file with the underlying text, which is good. However, I also find that the IRIS OCR engine frequently incorrectly identifies a number of characters and words in the text, even when scanning from new books with uncomplicated layout. Correcting them is much more painstaking than proofing in Omnipage.
I am also finding that if I work just with the DTPO scanned pages, often searches don’t work correctly if the word in the pdf has been hyphenated across a line break or if the phrase extends across a line break.
The conclusion I’m coming to is that DTPO scanning works well for large number of PDF or single column documents – I’m thinking of Billie deVille and the 100,000 documents in his DTPO databases. The loss in accuracy is counterbalanced by the ease of bringing in a huge number of documents.
However, for people like me who periodically want an accurate rendering of a few paragraphs or pages from a book or journal, a better workflow might be to use the zoning and proofing functions in a dedicated OCR program like Omnipage, and then bring it over to DTPO as a rich text file.
I am still a novice with regard to both the MAC and DTPO, so I might well be missing one or more tools in DTPO which would make it easier to use the scanning function in DTPO for the work I am doing.
Comments and suggestions would be appreciated.