<< Hide Menu
5 min read•june 18, 2024
Josh Argo
Jed Quiaoit
Josh Argo
Jed Quiaoit
The first step in actually performing any statistical significance test is to calculate the test statistic used for that particular test. Since we are carrying out a significance test for a population mean, we should calculate a t-score. 💯
The t-score is a test statistic that is used in a t-test to evaluate the significance of the difference between the sample mean and a hypothesized population mean.
The t-score is calculated by dividing the difference between the sample mean and the hypothesized mean by the standard error of the mean. (The standard error of the mean is a measure of the variability of the sample mean and is calculated by dividing the standard deviation of the sample by the square root of the sample size.)
The larger the t-score, the more significant the difference between the sample mean and the hypothesized mean is.
When calculating a t-score, we use the general formula for critical values:
Ricardo has a bag of 30 🍊s. The bag says that each orange weighs an average of 4.5 oz.
Ricardo weighs all of the oranges in his bag and finds that they have an average of 4.65 oz with a standard deviation of 0.8 oz.
The type of hypothesis test (one-tailed or two-tailed) is determined by the alternative hypothesis that you specify. If the alternative hypothesis is directional (e.g. "the population mean is greater than X"), then you would perform a one-tailed test because you're looking for a t-score only in one tail of our curve.
On the other hand, if the alternative hypothesis is non-directional (e.g. "the population mean is not equal to X"), then you would perform a two-tailed test since we are looking to find a t-score in either tail of our curve.
It's important to note that the choice of one-tailed or two-tailed test can have a significant impact on the results of the hypothesis test. A one-tailed test is more powerful than a two-tailed test because it allows you to detect a difference in a specific direction (e.g. the population mean is greater than X). However, this increased power comes at the cost of increased risk of a type I error (rejecting the null hypothesis when it is true). A two-tailed test is less powerful than a one-tailed test, but it reduces the risk of a type I error because it is not biased towards any particular direction. ✊
Back to the hypothesis test, we'll then need to reference our t-score chart to calculate our p value.
First, let's determine the degrees of freedom, which is always one less than our sample size.
Then, we find our t-score in the column matching our given degrees of freedom (df) to estimate our p-value. If the degrees of freedom can not be found in the chart, round down to the nearest df.
In our example above with Ricardo's 🍊s, our df = 29. So we need to locate 29 df on the chart and then try to find 1.027.
Perhaps a much easier way to perform a one sample t test would be to use technology such as a graphing calculator. The most commonly used calculator for AP Statistics is the Texas Instruments TI-84. 📱
When performing a one sample t test, you will first enter into the stats menu:
If the p-value is less than the predetermined level of significance (usually 0.05), it means that the observed results are unlikely to have occurred by chance alone, and you can reject the null hypothesis in favor of the alternative hypothesis. On the other hand, if the p-value is greater than the level of significance, it means that the observed results are not statistically significant and you cannot reject the null hypothesis.
🎥 Watch: AP Stats - Inference: Hypothesis Tests for Means
© 2024 Fiveable Inc. All rights reserved.