Since the document is standardised, I could grab the value according to the line number.
Could you point me to a good place in the internet where I could look up how to script that?
Must be so easy I maybe can teach myself?
Since the document is standardised, I could grab the value according to the line number.
Could you point me to a good place in the internet where I could look up how to script that?
Must be so easy I maybe can teach myself?
lin 7 connects to line 18 and so on.
This proves the underlying text layer has little relationship to what youāre looking at in the file.
Are you just trying to pull the NORSEDE71XXX value from the file?
try running this on your original file:
tell application id "DNtp"
set theRecords to selected records
repeat with theRecord in theRecords
set documentText to plain text of theRecord
set lns to paragraphs of documentText
set theContent to item 18 of lns # use whatever line number you need
display dialog theContent
end repeat
end tell
Iām playing with this myself, so I canāt actually point you to a source (it being a combination of scripts I have written, searching my brain and the 'net). I was inspired by this post in Stack Overflow.
As I said it is more an educational project to learn something.
I chose this entry point because I got a lot of them and if something goes wrong it doesnāt matter.
I wanted to learn automise renaming files.
So the date and recipient as filename via regex as a first exercise.
Regex isnāt really a ābeginnerās topicā but if youāve got the time and energy, itās an interesting and very powerful technology.
Hereās a famous quote from a well known hacker back in the dayā¦
Some people, when confronted with a problem, think āI know, Iāll use regular expressions.ā
Now they have two problems.
~ Jamie Zawinski,
Results stay the same. (Printing pdf or jpg as pdf.)
A couple of years ago I had Adobe Acrobat Pro and there you could create Textfields and then define their order.
Meaning you could guide the user through the document he is filling it out.
So you could set where the marker jumps to next when pressing tab. First I thought that is the reason why the fields are stacked liked this.
But since it seems OCR it anyway like this, there must be different logic behind it.
Anyway problem solved, even tough @Blanc helped far too muchā¦ I think I can cross this item from my bucket list anyway since I came up with the idea to grab the line.
(Thank you all very much)
This is unrelated. Youāre not using input fields here.
I suppose that the OCR engine treats your document as if it had a columnar layout. Which would be ok and desired for a normal text (e.g. a page in a magazine or a newspaper article). However, for a bank statement, it is certainly not what one wants. Unfortunately, you canāt influence the OCR engineās behaviour, so you probably have no chance to correct it.
Out of interest: Is this by any chance happening with account statements from german ING bank? I see similar behaviour here with mine.