Plucking amounts out of PDFs

Get help. Get answers. Let others lend you a hand.

Moderator: Mr_Noodle

Plucking amounts out of PDFs Thu Feb 02, 2017 10:10 pm • by WTL
I have a pile of PDFs like this one (an image of one in this case):
Image

That have been OCR'd via my Scansnap scanner.

The scanned text looks like this:
Patty Bo land's 101 Clarence St. Ottawa, Ont.
Table #200
Trans #: 210809 Serv; Tiffers
10/16/2015 5:38 PM # Cust:1 Quan Descript Cost
1 Soft Drink $3.75 1 Bacon Delight Bu:ger $16.00 1 ->1/2 Fry 1/2 Salad $1.00
Net Total: $20.75 HST $2.70
TOTAL: $23.45 Amount Due: $23.45
Watch for New Daily Specials


What I'd like to do is pluck out the HST amount ($2.70) and the final total ($20.75) and put those amounts in the file name.

My rules already can pull the date out and rename it correctly but how to get those two amounts out is a bit puzzling.

Suggestions? Thanks!
WTL
 
Posts: 1
Joined: Mon Sep 16, 2013 12:25 pm

Re: Plucking amounts out of PDFs Fri Feb 03, 2017 1:06 pm • by Mr_Noodle
Make use of the text preceding it to grab the amounts. Like "HST $(number)". I forget if (number) includes decimal points so you may need to account for that, or use (anything) instead.
Mr_Noodle
Site Admin
 
Posts: 11872
Joined: Sun Sep 03, 2006 1:30 am
Location: New York City


Return to Support