Representing Two Categorical Variables

Help Questions

AP Statistics › Representing Two Categorical Variables

Questions 1 - 10
1

A restaurant tracked day type (Weekday or Weekend) and whether customers order dessert (Yes or No). The results are shown in the table. Which comparison is appropriate for assessing the relationship between day type and ordering dessert using conditional distributions?

Compare the percent who order dessert on weekdays to the percent who order dessert on weekends.

Compare the percent of weekend visits among dessert orders to the percent of weekend visits among non-dessert orders.

Compare the number of dessert orders on weekdays to the number of dessert orders on weekends.

Compare the overall percent of visits that occur on weekends to the overall percent that occur on weekdays.

Compare the overall percent who order dessert to the overall percent who do not.

Explanation

This question tests understanding of conditional distributions when analyzing the relationship between day type and dessert ordering. To assess this relationship, we need to compare the proportion of customers who order dessert within each day type - specifically, the percent who order dessert on weekdays versus the percent who order dessert on weekends. Choice C correctly identifies this comparison. Options B and E look at marginal distributions, D compares raw counts, and A conditions on dessert ordering instead of day type. When examining associations between categorical variables, we typically condition on the explanatory variable (day type) and compare how the response variable (dessert ordering) is distributed across those conditions.

2

A streaming service sampled users and recorded subscription type (Basic or Premium) and whether the user watches at least 10 hours per week (Yes or No). The results are shown in the table. To evaluate whether weekly viewing time is associated with subscription type, which comparison is appropriate using conditional distributions?

Compare the overall percent of users who are Premium to the overall percent who are Basic.

Compare the percent of Premium users among those who watch at least 10 hours per week to the percent of Premium users among those who watch less than 10 hours per week.

Compare the number of users who watch at least 10 hours per week to the number who watch less than 10 hours per week.

Compare the overall percent who watch at least 10 hours per week to the overall percent who watch less than 10 hours per week.

Compare the percent who watch at least 10 hours per week among Basic users to the percent who watch at least 10 hours per week among Premium users.

Explanation

This question asks about comparing conditional distributions to assess the relationship between subscription type and viewing habits. To determine if weekly viewing time is associated with subscription type, we should compare the proportion of users who watch at least 10 hours per week within each subscription category - that is, the percent who watch heavily among Basic users versus the percent who watch heavily among Premium users. Choice A correctly identifies this comparison of conditional distributions. Options B and E examine marginal distributions, D compares raw counts, and C conditions on the wrong variable (viewing time instead of subscription type). The key insight is that we condition on the explanatory variable (subscription type) and examine how the response variable (viewing time) differs across those conditions.

3

A school surveyed 200 students about whether they participated in a school sport this year and whether they reported sleeping at least 8 hours on most school nights. The two-way table shows the results. Which comparison is appropriate for assessing whether sleep amount is associated with sports participation using conditional distributions?

$\ge 8$ hours__MATH_EXPR_1__lt;8$ hours
Plays a sport5446
Does not play a sport3862

Compare the overall proportion who play a sport to the overall proportion who do not play a sport.

Compare the proportion who play a sport among those who sleep $\ge 8$ hours to the proportion who play a sport among those who sleep __MATH_EXPR_1__lt;8$ hours.

Compare the counts of students who sleep $\ge 8$ hours in the two sport categories.

Compare the overall proportion who sleep $\ge 8$ hours to the overall proportion who sleep __MATH_EXPR_1__lt;8$ hours.

Compare the proportion who sleep $\ge 8$ hours among those who play a sport to the proportion who sleep $\ge 8$ hours among those who do not play a sport.

Explanation

This question assesses the skill of representing two categorical variables, specifically using conditional distributions to evaluate the association between sports participation and sleep amount. The appropriate comparison is to condition on sports participation and compare the proportions of students sleeping at least 8 hours across the two sports categories, as this reveals whether the distribution of sleep differs by sports group. For example, the conditional proportion of >=8 hours given plays a sport is 54/100 = 0.54, while given does not play is 38/100 = 0.38, indicating an association. A common distractor is choice B, which conditions on sleep instead and compares sports participation across sleep groups, but the question's setup with sports as rows suggests conditioning on sports for the comparison. Another distractor is choice A, which uses overall proportions without conditioning, failing to assess association properly. In a mini-lesson on conditional distributions: these are found by dividing cell counts by the relevant row or column total, allowing us to examine how one variable's outcomes vary depending on the category of the other variable, which is key for detecting associations in two-way tables.

4

A university sampled 220 students, recording whether each student lives on campus and whether they have a meal plan. The two-way table shows the results. Which comparison is appropriate for assessing whether having a meal plan is associated with living on campus using conditional distributions?

Has meal planNo meal plan
On campus8822
Off campus4466

Compare the proportion with a meal plan among on-campus students to the proportion with a meal plan among off-campus students.

Compare the counts of students with a meal plan in the on-campus and off-campus groups.

Compare the proportion living on campus among those with a meal plan to the proportion living on campus among those without a meal plan.

Compare the overall proportion living on campus to the overall proportion living off campus.

Compare the overall proportion with a meal plan to the overall proportion without a meal plan.

Explanation

The skill involved is representing two categorical variables with conditional distributions to assess association between living situation and meal plan status. The appropriate comparison conditions on living situation and compares meal plan proportions: 88/110 = 0.80 for on-campus versus 44/110 = 0.40 for off-campus, showing association. This aligns with row conditioning. Distractor choice B reverses by conditioning on meal plan and comparing living situation. Choice A uses overall meal plan proportions without conditioning. Mini-lesson: Conditional distributions are row or column percentages (cell divided by marginal), used to compare how one variable's distribution differs across categories of the other, with differences evidencing association.

5

A company surveyed employees about whether they work remotely at least 3 days per week (Remote/Not Remote) and whether they report high job satisfaction (High/Not High). The results are shown in the table. Which comparison is appropriate for assessing whether job satisfaction is associated with remote work status?

Categorical variables: Remote work status and Job satisfaction level.

Compare the overall percent high satisfaction to the overall percent not high satisfaction.

Compare the counts of high satisfaction employees between remote and not-remote groups.

Compare the percent remote among employees with high satisfaction to the percent remote among employees without high satisfaction.

Compare the marginal totals in the Remote and Not Remote rows.

Compare the percent with high satisfaction among remote employees to the percent with high satisfaction among not-remote employees.

Explanation

This question examines the skill of representing two categorical variables, remote work status and job satisfaction level, through conditional distributions for association. The correct approach is comparing the percent with high satisfaction among remote employees versus non-remote, conditioning satisfaction on remote status as explanatory. This aligns with evaluating if job satisfaction is associated with remote work, implying remote status influences satisfaction. Choice A distracts by reversing the conditioning, comparing remote status given satisfaction levels. For a mini-lesson, conditional distributions are computed as percentages of the response within explanatory groups; high satisfaction rates in remote and non-remote categories. Differences indicate association. Phrasing helps identify the explanatory variable correctly.

6

A streaming service sampled users and recorded whether they have a premium subscription (Premium/Standard) and whether they watched at least 10 hours last week (10+ hours/Under 10). The two-way table summarizes the sample. Which comparison is appropriate for assessing whether weekly viewing level is associated with subscription type?

Categorical variables: Subscription type and Weekly viewing level.

Compare the percent 10+ hours among premium users to the percent 10+ hours among standard users.

Compare the marginal percent 10+ hours to the marginal percent under 10 hours.

Compare the percent premium among 10+ hour viewers to the percent premium among under-10-hour viewers, because viewing level should be the explanatory variable.

Compare the counts of 10+ hour viewers in each subscription group, without using conditional percents.

Compare the overall percent premium to the overall percent standard.

Explanation

This question evaluates the skill of representing two categorical variables, subscription type and weekly viewing level, via conditional distributions to check for association. The suitable comparison is the percent of 10+ hours viewers among premium users versus among standard users, conditioning viewing level on subscription type as the explanatory variable. This fits the question's inquiry into whether viewing level is associated with subscription type, suggesting we examine viewing distributions given subscription. Choice A is a typical distractor, reversing the conditioning to treat viewing as explanatory, which contradicts the question's phrasing. For a mini-lesson, conditional distributions compute the proportions of one variable's categories within levels of another; compare percent 10+ hours for premium and standard groups. Significant differences in these percentages signal an association. Proper identification of the explanatory variable is key to selecting the right conditional comparison.

7

A company surveyed employees about work arrangement (Remote or In-office) and whether they report high job satisfaction (High/Not high). Use conditional distributions to describe the association between work arrangement and job satisfaction.

Which comparison is appropriate?

Compare the percent with high satisfaction among remote employees to the percent with high satisfaction among in-office employees.

Compare the percent who are remote among those with high satisfaction to the percent who are remote among those without high satisfaction.

Compare the number of remote employees with high satisfaction to the number of in-office employees with high satisfaction.

Compare the overall percent of employees who work remotely to the overall percent who work in-office.

Compare the overall percent with high satisfaction to the overall percent without high satisfaction.

Explanation

This question requires identifying the appropriate conditional distribution to examine the association between work arrangement and job satisfaction. To determine if work arrangement affects satisfaction, we need to condition on work arrangement and compare satisfaction rates. Option C correctly compares the percent with high satisfaction among remote employees to the percent with high satisfaction among in-office employees - this conditional comparison directly reveals whether satisfaction differs by work arrangement. Options A and B examine marginal distributions that don't address the relationship, while option D reverses the conditioning by looking at work arrangement given satisfaction level. Option E uses counts rather than percentages, failing to account for potentially different group sizes. The principle remains consistent: condition on the explanatory variable (work arrangement) and compare the response variable (satisfaction) across its categories.

8

A gym surveyed members about membership plan (Monthly or Annual) and whether they attend at least 3 times per week (Yes or No). The results are shown in the table. Which comparison is appropriate for assessing the relationship between membership plan and frequent attendance using conditional distributions?

Compare the percent of Annual-plan members among frequent attendees to the percent of Annual-plan members among non-frequent attendees.

Compare the overall percent who attend at least 3 times per week to the overall percent who do not.

Compare the number of frequent attendees on the Monthly plan to the number of frequent attendees on the Annual plan.

Compare the percent who attend at least 3 times per week among Monthly-plan members to the percent who attend at least 3 times per week among Annual-plan members.

Compare the overall percent of members on the Annual plan to the overall percent on the Monthly plan.

Explanation

This question asks about comparing conditional distributions to assess the relationship between membership plan and gym attendance frequency. To determine if these variables are associated, we should compare the proportion of members who attend frequently within each membership plan - that is, the percent who attend at least 3 times per week among Monthly-plan members versus the percent who attend at least 3 times per week among Annual-plan members. Choice D correctly identifies this comparison of conditional distributions. Options A and E examine marginal distributions, C compares raw counts rather than proportions, and B conditions on attendance frequency instead of membership plan. The key is to condition on the explanatory variable (membership plan) and examine how the response variable (attendance frequency) differs across those conditions.

9

A teacher recorded homework completion (Usually completes or Does not usually complete) and whether students passed the unit test (Pass or Fail). The results are shown in the table. To interpret the relationship using conditional distributions, which comparison is appropriate?

Compare the percent of students who usually complete homework among those who passed to the percent of students who usually complete homework among those who failed.

Compare the percent who passed among students who usually complete homework to the percent who passed among students who do not usually complete homework.

Compare the overall percent of students who usually complete homework to the overall percent who do not usually complete homework.

Compare the number of students who passed to the number who failed.

Compare the overall percent who passed to the overall percent who failed.

Explanation

This question tests understanding of conditional distributions when analyzing the relationship between homework completion and test performance. To interpret this relationship, we need to compare the proportion of students who passed within each homework completion category - specifically, the percent who passed among students who usually complete homework versus the percent who passed among students who don't usually complete homework. Choice A correctly identifies this comparison. Options B and E look at marginal distributions, C compares raw counts, and D conditions on test outcome instead of homework completion. When examining associations between categorical variables, we typically condition on the explanatory variable (homework completion) and compare how the response variable (test performance) is distributed across those conditions.

10

A city health department collected data on neighborhood type (Urban or Suburban) and whether residents get a flu shot (Yes or No). The results are shown in the table. Which comparison is appropriate for assessing the relationship between neighborhood type and flu-shot status using conditional distributions?

Compare the percent of Urban residents among those who got a flu shot to the percent of Urban residents among those who did not.

Compare the number of Urban residents who got a flu shot to the number of Suburban residents who got a flu shot.

Compare the percent who got a flu shot within Urban neighborhoods to the percent who got a flu shot within Suburban neighborhoods.

Compare the overall percent of residents who are Urban to the overall percent who are Suburban.

Compare the overall percent who got a flu shot to the overall percent who did not.

Explanation

This question asks about using conditional distributions to assess the relationship between neighborhood type and flu-shot status. To determine if these variables are associated, we should compare the proportion of residents who get flu shots within each neighborhood type - that is, the percent who get flu shots among Urban residents versus the percent who get flu shots among Suburban residents. Choice C correctly identifies this comparison of conditional distributions. Options A and E examine marginal distributions, B compares raw counts instead of proportions, and D conditions on flu-shot status rather than neighborhood type. The principle here is to condition on the explanatory variable (neighborhood type) and examine how the response variable (flu-shot status) varies across those conditions.

Page 1 of 5