Lets say that I want to randomly sample a database with id
of the following form
id
A
A
A
B
C
C
D
D
E
F
Instead of doing a completely random sample how would I sample 50% of the distinct groups. The difference is that a random sample would give me half of all of the records while a grouped sample would give me half of all of the distinct ids
.
This is necessary because I would not want to partially sample an id
. If an A
is selected then all 3 A
s should be included in this kind of grouped random sample.
Lets say for example the group set {A
,B
,C
} was randomly selected. The query would produce the following
id
A
A
A
B
C
C
Aucun commentaire:
Enregistrer un commentaire