Dataset Splitting

Global: fixed point in time for all users
Given-n: train with the first n items, test with the rest

A dataset has to be split into one for training a ML model, and one to test/evaluate it.

Random splitting … use specific ratio for splitting

Time based splitting: choose Cut-Off point in time to separate testing/training data

K-Fold Cross Validation: split dataset in (1/k) different fragments

Ederbit