Hazel not able to find first Date in Document

Get help. Get answers. Let others lend you a hand.

Moderator: Mr_Noodle

Hello,

I have a workflow where documents are scanned first, then SynOCR runs over the document and makes it searchable via OCR.
Once this is done, Hazel will search for keywords and the first date in the document and then change the file name to DATE+keyword.pdf and sort it into the appropriate folder.

Unfortunately, the date in the middle of the document is considered the first occurrence in the document and the first date is considered the second. So for some reason I can't figure out, the order is reversed.

The document looks like this here:

Image

My question now is it a mistake by Hazel or do I have to change something in SynOCR so that the order is correct in future?

Thank you for answer my question.
Ruhrpottjung
 
Posts: 12
Joined: Sun Mar 03, 2024 6:50 am
Location: Germany

Text in the PDF may not be in the order you expect. PDF is so open ended that text can be specified in the file in random order and have it still appear correct visually. You can use Hazel's preview function to see the text as Hazel does.
Mr_Noodle
Site Admin
 
Posts: 11865
Joined: Sun Sep 03, 2006 1:30 am
Location: New York City


Return to Support