vendredi 12 mai 2023

Dividing the datasets into train, valid and test based on condition

Objective: Randomly divide a data frame into train, valid and test based on condition

one sample with 60% of the rows other two samples with 20% of the rows

The data looks like here

enter image description here

Data needs to be divided based on unique ID. so that training and valid or test should not have same unique ID

Thanks




Aucun commentaire:

Enregistrer un commentaire