How well does local AI actually work for messy internal documents?
Most demos/examples I see are around clean internal knowledge bases.
Curious if anyone here has had success using local/self-hosted AI for more chaotic real-world document environments:
- PDFs
- contracts
- reports
- mixed folders/network drives
- scanned documents
Does retrieval quality actually hold up in practice?