Has anyone seen AI successfully understand and reflow complex PDF layouts at scale yet?
We looked at this problem years ago and the technology just wasn’t there yet.
Extracting text is easy enough, but actually understanding reading order, multi-page articles, layout hierarchy, images, captions, etc. always seemed to break down fast once layouts became more complex.
Curious if anyone has seen recent AI approaches that genuinely solve this well now?
u/Environmental-Ad1175 — 13 days ago