QUOTE (tigre @ Mar 11 2003 - 08:32 AM)
Isn't it just a matter of taste somehow? If you want more security (and have enough time and patience) why not going for p<=0.01?
In a strictly proper test, the bar is set beforehand, and then you determine whether you achieved it after the completion of the test. One problem with setting the bar at 0.01 instead of at 0.05, is that you have a greater chance of not getting there -- and no fair lowering the bar to 0.05 post-test if you fail to get to 0.01!
Also, if you set the bar too high, you run the risk of making an error in the opposite direction (i.e., saying there isn't a difference when there really is one). However, in practice, almost nobody here worries about these types of errors, being much more concerned instead with not making the mistake of saying there is a difference when there really isn't one.
QUOTE
Let's say I do an ABX test on a hard-to-spot sample/setting. I'll need some practice/training rounds to be prepared. When I start the test and my 1st choice is wrong (0/1) I say "this belongs to the training" and press Reset. If the 1st choice is right, but the second is wrong (1/2) I say "I was unconcentrated and need to focus better" and press Reset.
That's called cherry-picking, and in order for a test to be strictly proper, that sort of thing can't be allowed. That is, every trial must count. So if you think you need training, train yourself before you make your first choice. The best way to avoid cherry-picking is either to make all the trials blind (so you can't see your results in real-time) or to use some sort of ABX profile (which nobody has implemented, so far).
ff123