Gist:
I'm looking for a randomization algorithm that can for each subject, look at all previous measurements, determine if a subject would fall into the high or low end, then ensure that approximately half of each of these groups get assigned to an experimental condition.
Details:
I'm building a research application that has two independent variables. One is an experimental manipulation which I can assign. We'll call these X (experimental) and C (control). The other is personal characteristic with two categorical types, measured through a scale. We'll call these P1 (type 1) and P2 (type 2).
So it's essentially a 2x2 where I have 4 conditions (P1X, P1C, P2X, P2C). I'm recruiting about 120 subjects so ideally I'd have a distribution of 30 subjects in each condition.
I have three problems.
1) Based on the literature I'm expecting a natural 50/50 split between P1 and P2 characteristics in my sample. However, I can't be sure as my population isn't what I'd consider the general population which is where the split estimate is derived from.
2) A simple randomization of the X or C manipulation won't necessarily guarantee equal distribution. This exacerbates the first problem as if I see e.g. P1 40% and P2 60% split, 50% (X or C) of 48 leaves me with 24 people. Of course, it could be worse. If the random assignment of the experimental manipulation ends up less than 50% for this smaller sub-sample, then again, it could be worse. The fear is I could be left with too small a sample to run my analyses.
3) Another complication is that the categories, P1 and P2 are sometimes less definitive and more relative. Usually we'd just split P1 and P2 via median. It is determined on the aggregate value of several scale measurements (total range 0-20). However, in my sample we might see some bias toward one end. In this case, I might have to do a relative comparison. In this case, I don't know where the median would lie to split my sample. But whatever the median becomes, I would then say something like, "these people are more P1 than P2" or vice versa. But I don't know what the average or SD would be for my sample.
What I can do in my experiment:
What I can do is measure people's P1/P2 types before assigning them to either X or C. I can't pre-test everyone before assignment so I'll only know the bigger picture 1 subject at a time until it stabilizes with enough sample.
Question:
So the question in short is, is there an randomization algorithm that can adapt to these unknowns as I know more from one experiment to the next?
Basically I want to measure the subject's P1/P2 measure, compare it against the entire sample up to that point and find out if they would be more or less likely to be in the upper-half toward P2 or lower-half toward P1. Then after I figure this out, I want to assign them to either X or C in a way that ensures better than simple random that I'll get equal number of participants in each condition.
I'm not sure if this is the right place to ask, maybe Stack Cross-Validated? Anyway, thanks ahead of time if you have some suggestions.
Extra note:
The application is programmed completely in Javascript.
Aucun commentaire:
Enregistrer un commentaire