@c3josh wrote:
If we are doing feature engineering using for example, category means, should we be doing this inside or outside the cross validation loops?
e.g. for 5 fold cross validation should we be using mean("whole_data_set") for both training and holdout or should we be using training mean("4_training_sets") and mean("holdout_set").
Posts: 1
Participants: 1