u/thegreatestrang

Temporal data splitting

Temporal data splitting

Really need help with solving this paper’s problem to avoid data leakage. I need to think of a way to deal with overlapping nodes while splitting by edges. I’m thinking of creating 2 graphs
(1) training graph where loss and metrics are scored on nodes that have a timestamp =< cut off timestamps. Overlapping nodes will still appear in graph but take no roles other than message passing.
(2) inference graph where metrics are scored on future nodes and overlapping nodes.
(Noted: most “message” are from edges)
Is this okay?

u/thegreatestrang — 1 day ago