That leaks node-level features from the train set into the test/validation set then no? Like if 90% of my channels are in the train set, and the top performing feature is "positional encoding" then the train set can basically learn my nodes positional encoding and whether my channels tend to be balanced or not. Then it just extrapolates that to my channels in the test/validation set.
reply