Scanned image document / images preprocessing pipeline for bank and financial documents
Has anyone worked with preprocessing of documents before sending it to parsers? I am mainly working on a use case involving bank statements, financial statements and kyc documents that are mainly scanned and messy. I plan on using open source vlms for extraction post preprocessing. Have you seen any results with a good preprocessing pipeline?
u/East-Agent9391 — 6 days ago