I have a Spark dataframe that has one column that has A LOT of zeros and very few ones (only 0.01% of ones). I'd like to take a random subsample but a stratified one - so that it keeps the ratio of 1s to 0s in that column.
Is it possible to do in Spark? Thanks a lot!
Aucun commentaire:
Enregistrer un commentaire