u/Erius_Fayre

It is two models working together:

Professor Compressor🧑🏼‍🏫
Johnny Guesser🙋🏻

Professor Compressor🧑🏼‍🏫

sees currentWorldState, converts it to compressed(currentState)
sees nextWorldState, converts it to compressed(nextState)

The constraint is that Professor Compressor🧑🏼‍🏫 needs to compress a state with like 100% detail to like 5% ^(example) detail.

Professor Compressor🧑🏼‍🏫 gives Johnny Guesser🙋🏻 the compressed(currentState), and asks him to guess the compressed(nextState).

Professor Compressor🧑🏼‍🏫 is learning from trial&error – which 5% details he should keep, and which 95% to discard – to help Johnny Guesser🙋🏻 guess better.

and our Johnny Guesser🙋🏻, of course, is trying his best to guess the next state through trial&error.

Over time, we see that Professor Compressor🧑🏼‍🏫 gives better and better compressions for Johnny Guesser🙋🏻 to guess from/about.

and Johnny Guesser🙋🏻 gives better and better predictions of what the next state will be.

Is this an accurate analogy for JEPA?

just watched the world 7s highlights