<< Hide Menu
4 min read•june 18, 2024
Avanish Gupta
Jed Quiaoit
Avanish Gupta
Jed Quiaoit
When we collect data, we will record data from two different variables at the same time. Sometimes we have two categorical variables such as “class” and “does homework on time,” for example. We will want to know whether certain classes, such as juniors, finish their homework on time more than compared to other classes.
Bivariate categorical data refers to data that consists of two categorical variables. To analyze and understand the relationship between these two variables, we can use various graphical and statistical techniques.
One way to visualize bivariate categorical data is by using a histogram, which is a bar chart that shows the frequency or count of data points that fall into different categories. For example, if we have data on the class level (junior, senior, etc.) and whether or not students in each class finish their homework on time, we can create a histogram that shows the frequency of students in each class who do and do not finish their homework on time. 📚
A scatterplot is a useful tool for visualizing the relationship between two quantitative variables. To create a scatterplot, we plot the values of one variable on the x-axis and the values of the other variable on the y-axis. This allows us to see if there is a pattern or trend in the data and to assess the strength and direction of the relationship between the two variables. ✊
In the plant example, we could create a scatterplot of plant height and amount of fertilizer used to see if there is a relationship between these two variables. If we observe a positive relationship, it would suggest that using more fertilizer is correlated with taller plants. On the other hand, if we observe a negative relationship, it would suggest that using more fertilizer is correlated with shorter plants. 🌱
In both cases, we are trying to see whether both variables are related to each other. If we know that two variables are related to each other, then we may be able to predict the behavior of another variable if we know the value of one variable. Keep in mind that finding that two variables do not influence each other can also be just as strong of a analytical discovery than finding that they do influence each other. 🧲
Remember that the relationship between two variables can be positive, negative, or no relationship at all.
🎥 Watch: AP Stats - Scatterplots and Association
© 2024 Fiveable Inc. All rights reserved.