Is it possible to extract a number/few words after some words and add it to custom metadata?

I have a number of transcripts in searchable PDFs. Inside each PDF is a table and on the bottom, there is a row goes " Weighted Average Score: 88.19". What I would like to achieve is to automatically extract the number after "Weighted Average Score: " and add it to the custom metadata “WA” or at least add it to the end of the name of each transcript (each transcript is named with the student name). Do you guys think it’s possible to do that?

Try a smart rule targeting the PDFs. Use the scan text action with a regular expression like Weighted Average Score: (\d+\.\d+) followed by a change ...to \1 action, where … is the name of you meta data field.

The exact form of three regular expression depends on your data. I assumed that it airways consists of at least one digit followed by dot followed by at least one digit.

My friend, you are genius. Love you.

1 Like

That’s very kind, but I’m already spoken for :wink:

1 Like