hello,
I´m new with hazel and no english native speaker.
I have lots of scanned documents here (manuals, receipts, invoices, contracts, etc.) , which were saved as PDF's right after scanning.
My OCR engine at the time did not reliably convert the PDF's to text PDF, so there are still lots of unprocessed image PDF's.
My problem is that I can´t recognize from the "outside" whether it is an image or text PDF.
I would have to open each PDF and try to highlight text. After about 3000 PDFs.
Here are my questions:
1) How do I teach hazel what to look for in the PDF content so that it can clearly distinguish image PDF from text PDF?
2) Finally, how do I search for image PDFs so that I can move them to a separate directory so that an OCR can subsequently convert all image PDFs to text PDF?