These 3 pictures will help to explain the problem:
Picture (1)
Picture (3)
Picture (4)A typical example: A tester wants to get a p-value < 0.05, but he doesn't decide to perform e.g. 8 trials before the test. Instead he would stop the test whenever a p-value < 0.5 is reached (-> yellow numbers in Picture (1)). These results (5/5), (7/8), ... are the "stop points" of the test.
Problem: After each stop point the possibilities of 'movement' or 'ways' in the triangle are reduced. E.g. (5/6) can't be reached from (5/5) anymore because (5/5) will stop the test. This is why the numbers in pascal's triangle in picture (4) are changed compared to picture (3). The number of ways to reach the 2nd stop point (7/8) is reduced from 8 to 5.
Now the total proability (corrected p-value or 'c-value') to finish this test 'successfully' (= max. number of trials: 8, test stops when p-value < 0.05) is:
c-value (7/8) = 1/32 + 5/256 = 0.051
That doesn't seem very bad, but here are some c-values for bigger number of trials (p-value that stops the test 0.05):
15 trials -> 0.079
30 trials -> 0.129
50 trials -> 0.158
100 trials -> 0.202
Now the question is: Should we just modify ABX software to force people to specify a fixed number of trials before testing - showing p-values or show c-values if the number of trials hasn't been fixed before?
With this approach there might another problem - I quote a PM Schnofler sent me recently about this:
QUOTE
If I understood it correctly the idea goes like this:
The p-value, that is "the probability to get c or more correct in n trials if you guess blindly", doesn't give an accurate measure of "probability that you were guessing" because it doesn't take into account that the listener might just stop as soon as he has the value he wants and continue otherwise. So what we do is, we calculate the "probability to reach your current or a better p-value with up to n trials" and call this "corrected p-value" or c-value as you do in your source. Sounds nice, but why don't we go a step further and calculate the "probability to reach your current or a better c-value with up to n trials", because after all the listener could just stop as soon as he has his desired c-value or continue otherwise.
That's why I called it a "hack", that is c-values don't take a fundamentally different approach to calculate the measurement we'd like to have ("probability you were guessing"), but just try to "patch up" the approach we already have, and in the end leave you with the same problem you started out with.
I'm not sure about this, but if the tester isn't forced to specify a p-value that stops the test before the test starts - and the software stops or continues the test based on this automatically, Schnofler's thought is probably right. If a tester is allowed to watch c-values and stop the test based on them, we would need 'corrected c-values', 'corrected corrected c-values' ...
I think I have found a sollution for this problem - there might be better ones, but anyway - here it is:
The goal is: no matter how long the test is going to take, the c-value must not become higher than e.g. 0.05. Every stop point will 'consume' a part of this c-value. It's necessary to make sure that adding the probabilities of each stop point, the sum can never be bigger than the c-value we want to reach (here 0.05). A simple approach for something like this:
2^(-1) + 2^(-2) + 2^(-3) + ... + 2^(-n) < 1 , no matter how big n gets.
We have to choose the stop points like this (easier for me to explain from an example):
Picture (5)Desired c-value: c = 0.05 or lower.
1/2*0.05 = 0.025, so the 1st stop point must have a probability p < 0.025. This is the case for 6/6 correct trials with p = 1/64 = 0.0156
So 1st stop point: 6/6
c -> c - p = 0.05 - 0.0156 = 0.0344, the remaining "c" for the rest of the stop points.
Condition for the next stop point:
p < 0.5 * 0.0344 = 0.0172
From the table it's obvious that for the next stop point (n-1)/n the p is 6/2^n
For n=9: p = 6/512 = 0.0117
So 2nd stop point: 8/9
c -> c - p = 0.0344 - 0.0117 = 0.0227
p < 0.5 * 0.0227 = 0.0114
p = 33/2^n for next stop point (n-2)/n
for n=12: p = 33/4096 = 0.0081
so 3rd stop point: 10/12
c -> c - p = 0.0227 - 0.0081 = 0.0146
p < 0.5 * 0.0146 = 0.00730
p = 182/2^n for next stop point (n-3)/n
for n=16: p = 182/32768 = 0.0056
so 34d stop point: 12/15
c -> c - p = 0.0146 - 0.0056 = 0.0090
...
This way, it would be possible that the tester specifies a "probability that you could get that score by guessing" = c-value he wants to reach, and the ABX software tells where the stop points are - or just works as we're used to: It displays the current c-value based on the stop points it calculated from the 'goal c-value'.
Puh... writing this was hard, reading too I guess. As reward here's a little toy: I created some small dos-box program that can calculate c-values. It's attatched to this post. Enjoy

Edited: "probability that you're guessing" replaced with "probability that you could get that score by guessing"