Statistical Testing Error
In science and in life, we are always dealing with a large amount of annoying uncertainty. That’s just how it goes. If we waited to be certain in all situations before speaking or acting, we’d never say or do anything. When we do our probabilistic testing of hypotheses, we can expect to make errors periodically. There are two types of errors that we make, and from which we hope to learn.
TYPE I, OR ALPHA ERROR
We reject the null hypothesis when it is in reality true. (We accuse our friend of using a loaded coin when our friend was really playing fair.)
The alpha level not only sets the cutoff point at which we will reject the null hypothesis; it also sets the likelihood of committing this type of error. When we set an alpha level of .05, we are committing ourselves to a 5% chance of rejecting a true null hypothesis, at a p-level of .01 it’s 1%, at .001, it’s .1%. The lower the alpha level, the less chance of committing this type of error. But this certainty has a cost. The lower we set the alpha level, the greater the chance that we will commit a beta error and accept a false null hypothesis. We sacrifice what we call POWER in order to use a very low alpha level. This other competing type of error, which we must try to balance out the chance of, is:
TYPE II, OR BETA ERROR
We fail to reject the null hypothesis when it is in reality false. (We assume our friend is playing fair when in fact he or she is playing with a loaded coin.)
Beta is the probability of making this type of error. The implication of this type of error is that in our quest to be scrupulous about not accepting a chance or random effect as an actual systematic effect of interest, we ignore a possible effect of interest.