Distinguishing Correlation and Causation

Help Questions

Statistics › Distinguishing Correlation and Causation

Questions 1 - 10
1

A news headline states: “Drinking more water leads to lower afternoon fatigue.” The article cites an analysis of 500 office workers showing that those who reported drinking more cups of water per day (quantitative) also reported lower fatigue ratings on a 1–10 scale (quantitative). Workers were not assigned to drink specific amounts; the data came from a survey (observational study). Can we conclude that drinking more water causes lower fatigue? Why or why not?

No, because fatigue ratings are quantitative only if measured in minutes, not on a 1–10 scale.

No, because the study is observational; it shows an association between water intake and fatigue but cannot establish causation without random assignment.

Yes, because with 500 workers the sample size guarantees a causal conclusion.

Yes, because the headline matches the observed pattern in the survey data.

Explanation

This focuses on correlation versus causation in health-related surveys. Correlation describes an association, like more water intake relating to lower fatigue ratings. Causation requires random assignment in an experiment to isolate the effect. The survey is observational, with no assigned water amounts, so it shows association but not that water causes less fatigue—lifestyle factors could confound it. Answer B is justified as the observational design doesn't support the headline's causal claim. Many think large sample sizes prove causation, but that's false; design is key over size. A strategy: always check 'Were participants randomly assigned to treatments?' to assess causation.

2

A supermarket manager compared weekly data over a year and found that weeks with higher advertising spending (quantitative) tended to have higher total sales revenue (quantitative), a positive association. The manager did not randomly assign ad spending levels; spending varied based on season and promotions (observational study). Which statement is the most reasonable conclusion?

Higher advertising spending is associated with higher sales, but the observational design does not justify concluding that increasing ad spending causes sales to increase.

Increasing advertising spending causes sales to increase because the relationship is positive.

Higher sales cause the manager to spend more on advertising, so the causal direction is confirmed.

Because the data cover a full year, confounding variables cannot affect the relationship.

Explanation

We're examining correlation versus causation in business data. Correlation means variables are associated, like higher ad spending with higher sales. To claim causation, random assignment or control is needed to rule out confounders. This observational study with varying ad levels doesn't allow concluding ad spending causes sales increases—seasonal factors might influence both. Answer A is justified by the design limiting us to association. It's common to assume long-term data eliminates confounders, but without control, causation isn't proven. Check: 'Were participants randomly assigned to treatments?' for causation evaluation.

3

A teacher looked at her class records and found that students who spent more minutes per day on a math practice website (quantitative) tended to have higher end-of-unit test scores (quantitative). Students chose how much to practice; the teacher only collected practice logs and test scores (observational study). Which statement best describes what the data show?

More practice time causes higher test scores because students who practiced more scored higher.

There is no relationship because not every student who practiced more had a higher score.

Higher test scores cause students to spend more time practicing, so the direction of causation is established.

More practice time is associated with higher test scores, but the study design does not justify concluding that practice time causes higher scores.

Explanation

We're distinguishing correlation from causation in educational data. Correlation is an observed association, such as more practice time linking to higher test scores. For causation, we need random assignment to ensure no other factors are influencing the results. This is an observational study with students choosing their practice time, so we can't say practice causes better scores—motivation might affect both. Answer A is right because the design only supports association, not causation. A frequent misconception is that if most who practice more score higher, it proves causation, but individual variations and confounders prevent that. Ask yourself: 'Were participants randomly assigned to treatments?' for similar scenarios.

4

Researchers randomly assigned 120 students to either listen to music while studying or study in silence for 30 minutes. Afterward, each student took the same vocabulary quiz and received a score (quantitative). The music group had a lower average quiz score than the silence group. Which statement is the most reasonable conclusion?

Because the quiz scores are quantitative, the study can only show correlation, not causation.

The results do not allow any conclusion because experiments cannot be used to study cause-and-effect.

Because students were randomly assigned, the results support the conclusion that studying with music can reduce vocabulary quiz performance compared with studying in silence.

The results show that lower quiz scores cause students to choose to listen to music while studying.

Explanation

The concept is differentiating correlation and causation in study results. Correlation is just an association, but experiments can go further. Causation is supported by random assignment, which helps control for other variables. Here, students were randomly assigned to music or silence, so the lower scores in the music group suggest music can cause reduced performance. Answer A is correct due to the experimental design enabling causal inference. A misconception is that quantitative measures alone imply causation, but random assignment is what matters. To apply this, ask: 'Were participants randomly assigned to treatments?'

5

A school counselor reviewed records from 300 students and found that students who reported more hours of sleep per night (quantitative) tended to have higher average quiz scores (quantitative), showing a positive association. The counselor did not assign sleep amounts; the data came from a voluntary survey and existing grade records (observational study). Which statement is the most reasonable conclusion?

Higher quiz scores cause students to sleep more, so the direction of causation is established.

Because the sample is fairly large, the association proves that sleep hours cause quiz scores to increase.

Because this is an observational study, we can say sleep hours are associated with quiz scores, but we cannot conclude that more sleep causes higher quiz scores.

Getting more sleep causes higher quiz scores because the association is positive.

Explanation

The key concept here is distinguishing between correlation and causation in data analysis. Correlation means there is an association or relationship between two variables, like how more sleep hours tend to go with higher quiz scores in this study. To establish causation, we need an experiment with random assignment to control for confounding factors that might explain the association. This scenario is an observational study because the counselor just reviewed existing records without assigning sleep amounts, so we can only say there's an association, not that more sleep causes better scores. The correct answer, B, is justified because the observational design doesn't rule out confounders like study habits affecting both sleep and scores. A common misconception is that a strong or positive correlation proves causation, but that's not true without experimental control. To apply this elsewhere, always ask: 'Were participants randomly assigned to treatments?'

6

A researcher found a negative correlation between the number of absences a student had during a semester (quantitative) and the student’s final course grade (quantitative): more absences tended to go with lower grades. The researcher used existing attendance and grade records and did not assign absences (observational study). Which statement best describes what the data show?

Absences and grades are associated, but because the study is observational we cannot conclude that absences cause grades to change.

The data prove that improving grades will reduce absences, so the causal direction is from grades to absences.

More absences cause lower grades because the correlation is negative.

Because the data come from school records rather than a survey, the relationship must be causal.

Explanation

This question addresses correlation and causation in school performance data. Correlation is an association, seen in the negative link between absences and grades. Causation needs random assignment to treatments to isolate effects. As an observational study using records without assigning absences, it shows association but not that absences cause lower grades—illness might affect both. Answer B is correct because the design doesn't support causation. People often think data from reliable sources like records prove causation, but study type is crucial. Always ask: 'Were participants randomly assigned to treatments?' to determine if causation is possible.

7

A fitness app company analyzed data from its users and found a negative correlation between daily minutes of exercise (quantitative) and resting heart rate (quantitative): users who exercised more tended to have lower resting heart rates. The company did not assign exercise levels; it only observed self-reported exercise and measured heart rate from wearable devices (observational study). Can we conclude that increasing exercise causes resting heart rate to decrease? Why or why not?

No, because correlation can only be positive; a negative relationship cannot be meaningful.

Yes, because wearable-device measurements are accurate, so causation is proven.

No, because without random assignment this observational study shows association but does not establish that exercise causes changes in resting heart rate.

Yes, because a negative correlation means exercise directly lowers resting heart rate.

Explanation

We're exploring the difference between correlation and causation when interpreting relationships in data. Correlation indicates an association, such as the negative link where more exercise minutes are tied to lower resting heart rates. Causation requires an experiment with random assignment to groups to minimize the influence of other variables. Since this is an observational study with self-reported data and no assigned exercise levels, we can't conclude that exercise causes lower heart rates—other factors like overall health might be at play. Answer A is correct because the lack of random assignment in the observational design prevents causal claims. People often mistakenly think a negative correlation directly implies causation, but correlation alone, regardless of direction, doesn't prove cause-and-effect. A useful strategy is to check: 'Were participants randomly assigned to treatments?'

8

A city’s monthly report shows that when average outdoor temperature (quantitative) is higher, electricity use per household (quantitative) is also higher, indicating a positive correlation. The report uses historical meter readings and weather records; no variables were controlled or assigned (observational data). Which statement is the most reasonable conclusion?

Higher temperature causes electricity use to rise because the relationship is positive.

Because the data come from official records, no other variables could explain the association.

Temperature is associated with electricity use, but because the data are observational we cannot conclude temperature causes the change in electricity use from this report alone.

Electricity use causes outdoor temperature to increase, so reducing electricity use will cool the weather.

Explanation

The core idea is separating correlation from causation when analyzing patterns in data. Correlation means two variables move together, like higher temperatures associating with higher electricity use. Establishing causation demands controlling variables through random assignment, which isn't present here. This observational data from records shows association but can't confirm temperature causes increased electricity use, as confounders like air conditioning habits could explain it. Answer C is correct because the observational nature limits us to association without causal proof. It's a misconception that official or large datasets automatically imply causation, but study design matters more than data source. To transfer this, always inquire: 'Were participants randomly assigned to treatments?'

9

In a randomized experiment, 80 volunteers were randomly assigned to drink either a caffeinated beverage (about 100 mg caffeine) or a caffeine-free beverage each morning for one week. At the end of the week, their average reaction time on a computer test (quantitative) was recorded. The caffeinated group had faster average reaction times than the caffeine-free group. Which statement best describes what the data show?

Because this was an experiment, we can only conclude caffeine and reaction time are associated, not that caffeine affects reaction time.

The results show that faster reaction time causes people to choose caffeinated drinks.

Because the study used random assignment, the results support the conclusion that caffeine can affect reaction time.

The results prove that caffeine will always make every person faster, with no exceptions.

Explanation

This question highlights the distinction between correlation and causation in research findings. Correlation is simply an association between variables, but here we have more because it's an experiment. To infer causation, random assignment to treatments is essential to balance out potential confounders across groups. In this randomized experiment, volunteers were assigned to caffeinated or non-caffeinated drinks, so the faster reaction times in the caffeinated group support that caffeine can cause improved reaction times. Answer A is justified by the experimental design with random assignment, which allows for causal conclusions. A common misconception is that experiments prove effects for everyone without exceptions, but they show average effects in the studied group. Remember to ask: 'Were participants randomly assigned to treatments?' to evaluate causation.

10

A company tested a new phone setting designed to reduce screen time. In a randomized experiment, 200 users were randomly assigned to either have the setting turned on or left off for two weeks. At the end, each user’s average daily screen time in minutes (quantitative) was recorded. The group with the setting turned on had lower average daily screen time. Can we conclude that turning on the setting causes a reduction in screen time? Why or why not?

No, because screen time is quantitative, which prevents drawing causal conclusions.

Yes, because users were randomly assigned to the setting conditions, so the difference in average screen time supports a causal effect of the setting.

Yes, because any difference in group averages proves the setting will reduce screen time for every individual user.

No, because random assignment only shows correlation and cannot support cause-and-effect conclusions.

Explanation

Here, we differentiate correlation from causation in tech experiments. Correlation is an association, but random assignment elevates it to potential causation. Causation is inferred when groups are randomly assigned, balancing confounders. In this randomized experiment, assigning the phone setting led to lower screen time, supporting that the setting causes reduction. Answer B is justified by the design allowing causal conclusions. A misconception is that experiments guarantee effects for all individuals, but they indicate average effects. Use this strategy: ask 'Were participants randomly assigned to treatments?' to check for causation.

Page 1 of 5