Page 1 of 1

Hazel not able to find first Date in Document

PostPosted: Sat May 18, 2024 9:14 am
by Ruhrpottjung
Hello,

I have a workflow where documents are scanned first, then SynOCR runs over the document and makes it searchable via OCR.
Once this is done, Hazel will search for keywords and the first date in the document and then change the file name to DATE+keyword.pdf and sort it into the appropriate folder.

Unfortunately, the date in the middle of the document is considered the first occurrence in the document and the first date is considered the second. So for some reason I can't figure out, the order is reversed.

The document looks like this here:

Image

My question now is it a mistake by Hazel or do I have to change something in SynOCR so that the order is correct in future?

Thank you for answer my question.

Re: Hazel not able to find first Date in Document

PostPosted: Mon May 20, 2024 8:35 am
by Mr_Noodle
Text in the PDF may not be in the order you expect. PDF is so open ended that text can be specified in the file in random order and have it still appear correct visually. You can use Hazel's preview function to see the text as Hazel does.