u/Independent-Diver929

When Reconstruction Survives the Easy Cases but Fails the Hard Ones

I think the project has now reached the point where the real question is no longer:

“Can AI reconstruct a messy file?”

It is:

“Can the reconstruction survive hostile edge cases?”

Because the dangerous failures are not usually obvious hallucinations anymore.

They are procedural distortions that still produce perfectly reasonable-looking summaries.

A few of the edge cases we are now stress-testing against:

- conflicting timestamps across systems
- forwarded emails that subtly change evidentiary meaning
- duplicate-looking records that are procedurally distinct
- downstream documents inheriting unresolved assumptions
- summaries that quietly convert uncertainty into settled fact
- contradictory narratives that are both internally coherent
- missing participants later restored into chronology
- records that only become contradictory after awareness timing is reconstructed

The interesting part is that many of these files still generate “clean” outputs.

The distortion only becomes visible once chronology and procedural context are restored.

That has shifted the testing focus away from:
“Did the AI summarize the file correctly?”

and toward:
“Did the reconstruction preserve the uncertainty structure of the file?”

At this point, the hardest problems are no longer document problems.

They are procedural-state problems.

Meaning:
- when did an assumption start propagating?
- when did uncertainty stop being treated as uncertainty?
- when did later records begin inheriting disputed context as if it were settled fact?

That is where a lot of the reconstruction logic either survives or breaks.

For people here who deal with messy litigation, investigations, compliance reviews, or chronology-heavy disputes:

What are the nastiest reconstruction failure modes you have personally encountered?

Not legal theory.

The actual procedural edge cases that turn a file into a reconstruction sinkhole.

reddit.com
u/Independent-Diver929 — 6 days ago

The biggest reconstruction failures we found were not hallucinations. They were timeline distortions.

One thing that keeps surprising us while testing reconstruction workflows:

A lot of contradictions do not exist inside individual records.

They emerge across chronology.

A message can look perfectly reasonable in isolation, but become contradictory once:
- awareness timing changes
- missing participants are restored
- assumptions migrate downstream
- or later documents inherit disputed interpretations from earlier ambiguity

The weird part is that most “clean summaries” accidentally remove the exact uncertainty signals that explain why the dispute exists in the first place.

We originally thought we were building better summarization.

At this point it feels much closer to procedural reconstruction.

——-

reddit.com
u/Independent-Diver929 — 11 days ago

A couple people in my last thread pointed out important edge cases around chronology reconstruction and duplicate-looking records, so I updated the system and reran the methodology.

The core issue:

Standard AI summarization tends to normalize and flatten records early.

That works fine until:
- chronology matters
- contradictory statements exist
- or the same communication appears in multiple contexts with different evidentiary meaning

One example from the updated reconstruction:

An original approval email, a forwarded copy of that same email, and a later invoice referencing that approval all looked superficially similar.

A normal summarizer tends to collapse them into one event.

But they are not actually the same thing.

The forwarded version changed the evidentiary meaning because it captured internal uncertainty after the alleged approval occurred.

So the system now preserves:
- chronology
- contradiction context
- duplicate-looking but distinct records
- confidence levels
- and decision weighting

instead of flattening everything into a clean narrative too early.

Current demo:

https://www.notion.so/What-Actually-Happened-Standard-AI-vs-Source-Backed-Chronology-357c42abce4080c9832ecba60617eaa2?source=copy\_link

Still looking for edge cases, failure modes, and places where the reconstruction logic breaks down.

u/Independent-Diver929 — 15 days ago

I asked a question here recently about where time actually goes in contract disputes, especially with email-heavy records.

A lot of you said the same thing:

It’s not finding documents.
It’s reconstructing what actually happened.

So I took that and built a small demo using a realistic case file.

Same dataset. Two outputs.
One is a normal AI summary.
The other reconstructs the sequence with sources and contradictions.

Happy to share if anyone wants to see it.

Curious if this lines up with how you experience these cases, or if I’m still missing something.

reddit.com
u/Independent-Diver929 — 17 days ago

I am trying to understand the workflow burden in small-firm / solo contract-dispute work when the file is messy.

I mean the kind of matter where the relevant story is spread across emails, agreements, invoices, messages, attachments, and conflicting accounts of what was said or promised.

At a certain point, the hard part stops being the legal theory and becomes reconstructing what actually happened, in what order, and which conflicts matter enough to change strategy.

For people who handle that kind of work, where does the time actually go?

Is it mostly spent on:

- gathering and normalizing the documents

- building the chronology

- reconciling conflicting statements

- isolating contradictions that matter

- or turning the file into something usable for strategy, filing, or client communication

I am especially interested in what makes one of these matters a quick reconstruction job versus a multi-hour sinkhole.

Not asking for legal advice. I am trying to understand the workflow burden in this specific kind of file.

reddit.com
u/Independent-Diver929 — 26 days ago