If I want to use an undersampling approach to construct the machine learning model, I am wondering if there are any criteria to determine how many times I should sample the data from the majority group (the minority is 14% and the majority is 86%) and build the ML model? I am working with biological data and I am recommended not to use oversampling approaches. To determine which sampling approach we should use, are there any criteria to determine before constructing the ML model? Is it really dependent on the field of study?
Aucun commentaire:
Enregistrer un commentaire