Selecting Appropriate Inference Procedures: Categorical Data
Help Questions
AP Statistics › Selecting Appropriate Inference Procedures: Categorical Data
A school district wants to know whether a new text-message reminder system changes the proportion of parents who attend parent-teacher conferences. Two similar middle schools are chosen: one uses reminders (n = 180 parents invited) and the other does not (n = 200 parents invited). Attendance is recorded as Yes/No, and the counts are 108 Yes in the reminder school and 90 Yes in the no-reminder school. Which inference procedure is most appropriate to determine whether the reminder system is associated with a different attendance rate?
Two-sample $z$ test for a difference in proportions
Matched-pairs $t$ test for a mean difference
One-sample $z$ interval for a population proportion
Chi-square test for goodness of fit
Two-sample $t$ interval for a difference in means
Explanation
This question assesses the skill of selecting appropriate inference procedures for categorical data in AP Statistics, specifically comparing proportions from two independent samples. The scenario involves two similar middle schools, one with a reminder system and one without, recording Yes/No attendance, making it a comparison of two proportions to see if the reminder is associated with a different rate. The two-sample z-test for a difference in proportions is most appropriate because it handles independent samples and tests for a difference in population proportions based on the given counts. Other options like the one-sample z-interval are incorrect as they apply to single proportions, while the chi-square goodness of fit tests distributions against expected values, not comparisons between groups; the matched-pairs t-test is for quantitative data, and the two-sample t-interval is also for means. A common distractor is choosing chi-square for association, but since there are only two groups and binary outcomes, the two-sample z-test is equivalent and often preferred for direct proportion comparison. To align methods, remember that for two independent samples with categorical responses, check conditions like large sample sizes (np and n(1-p) ≥10 for each) before proceeding with z-procedures. This mini-lesson highlights that procedure selection depends on the number of samples, variable type, and whether you're estimating or testing.
A company compares customer satisfaction (Satisfied/Not Satisfied) between three store locations (North, Central, South). A random sample of customers from each location is surveyed and the results are summarized in a $3\times 2$ table of counts. The company wants to know whether satisfaction depends on location. Which inference procedure is most appropriate?
One-sample $z$ test for a population proportion
Two-sample $z$ interval for a difference in proportions
Chi-square test for goodness of fit
Chi-square test for association (independence) in a two-way table
Matched-pairs $t$ test for a mean difference
Explanation
In AP Statistics, inference for categorical data requires selecting procedures based on study design, here a 3x2 table comparing satisfaction across three locations to test dependence. The chi-square test for association (independence) is appropriate as it examines if satisfaction and location are related in the two-way table of counts. This method computes expected frequencies under independence and tests for significant differences. Distractors like the two-sample z-interval are for two groups only, not three; goodness of fit is for one variable; the one-sample z-test is single-proportion; and matched-pairs t-test is quantitative paired data. Mini-lesson: For more than two groups with categorical outcomes, use chi-square for homogeneity (equivalent to association here), checking expected counts ≥5. This ensures the method matches the multi-group comparison without reducing to pairwise tests.
Researchers want to determine whether there is an association between smoking status (smoker, nonsmoker) and exercise frequency (0–1 days/week, 2–4 days/week, 5–7 days/week) among adults in a city. They take a random sample of 420 adults and record each person’s smoking status and exercise category. Which inference procedure is most appropriate?
Chi-square test for independence
Two-sample $z$ interval for a difference in proportions
One-sample $z$ test for a population proportion
Matched-pairs $t$ test for a mean difference
Chi-square test for goodness of fit
Explanation
This question involves examining the relationship between two categorical variables: smoking status (2 categories) and exercise frequency (3 categories). The chi-square test for independence is specifically designed to test whether two categorical variables are associated. The chi-square goodness of fit (option B) tests a single variable against expected frequencies. The z-tests (options C and D) are for proportions, typically with binary outcomes. The matched-pairs t-test (option E) requires paired quantitative data. When investigating associations between two categorical variables in a contingency table, always use the chi-square test for independence.
A school district wants to know whether the proportion of students who prefer taking tests on a computer differs between middle school and high school students. A random sample of 120 middle school students and 150 high school students is selected. Each student is asked whether they prefer computer-based tests (Yes/No). Which inference procedure is most appropriate to compare the two groups’ preferences?
Chi-square test for goodness of fit
Matched-pairs $t$ test
One-sample proportion $z$ test
Chi-square test for association (independence) with more than two response categories
Two-proportion $z$ test
Explanation
This question tests your ability to select the appropriate procedure for comparing proportions between two independent groups. We have two distinct populations (middle school and high school students) and want to compare the proportion who prefer computer-based tests in each group. The two-proportion z test is designed specifically for this scenario - comparing proportions between two independent samples. The chi-square test for association would also work here but is less direct, while the one-sample proportion test only handles a single group. The matched-pairs test requires paired data, which we don't have since these are different students. When comparing proportions between two independent groups with a binary response (Yes/No), the two-proportion z test provides the most straightforward and powerful approach.
A university wants to determine whether the proportion of students who live on campus is the same across four class years (First-year, Sophomore, Junior, Senior). A random sample of 80 students from each class year is selected, and each student is recorded as living on campus (Yes/No). Which inference procedure is most appropriate?
Chi-square test for association (independence) in a $4\times2$ table
One-sample proportion $z$ interval
Matched-pairs two-proportion interval
Chi-square test for goodness of fit with four categories
Two-proportion $z$ test
Explanation
This question involves examining whether the proportion of students living on campus is the same across four class years. We can view this as testing for association between two categorical variables: class year (4 categories) and residence status (2 categories: on/off campus). The chi-square test for association in a 4×2 table tests whether these variables are independent - that is, whether the proportion living on campus is the same across all class years. The goodness of fit test would require specific expected proportions for each class, which we don't have. The two-proportion z test can only compare two groups at a time, not four simultaneously. When testing whether proportions are equal across more than two groups, the chi-square test for association provides a single, comprehensive test of the null hypothesis that all proportions are equal.
A public health official wants to estimate the difference in the proportion of residents who have received a flu shot between two counties. In County A, a random sample of 250 residents finds 160 vaccinated. In County B, a random sample of 220 residents finds 120 vaccinated. Which inference procedure is most appropriate to estimate the difference in proportions?
Chi-square test for association (independence)
One-sample proportion $z$ interval
Two-proportion $z$ interval for $p_A - p_B$
One-sample $t$ interval for a mean
Two-proportion $z$ test for $p_A - p_B$
Explanation
This problem asks for estimation rather than hypothesis testing, as indicated by the word 'estimate' in the question. We have two independent samples (County A and County B) and want to estimate the difference in vaccination proportions between them. The two-proportion z interval provides a confidence interval for the difference p₁ - p₂, which directly answers the question. The two-proportion z test would test a hypothesis about the difference but wouldn't provide an estimate. The one-sample proportion interval only handles a single proportion, while the chi-square test for association tests for relationships but doesn't estimate differences. When the goal is to estimate (not test) the difference between two proportions from independent samples, use the two-proportion z interval to construct a confidence interval for the difference.
A cafeteria manager believes that students choose lunch options in the following proportions: 50% hot meal, 30% salad, 20% sandwich. On one day, a random sample of 200 students is observed: 86 choose hot meal, 70 choose salad, and 44 choose sandwich. Which inference procedure is most appropriate to evaluate whether the observed distribution matches the manager’s stated proportions?
Chi-square test for goodness of fit
One-sample $z$ test for a population proportion
Chi-square test for independence
Two-sample $t$ test for a difference in means
Two-sample $z$ interval for a difference in proportions
Explanation
This problem involves testing whether observed frequencies for lunch choices match the manager's stated distribution (50% hot meal, 30% salad, 20% sandwich). The chi-square test for goodness of fit is specifically designed to test if observed data fits a hypothesized distribution for a single categorical variable. The chi-square test for independence (option A) requires two variables. The z-procedures (options B and D) work with proportions or binary outcomes. The t-test (option E) is for quantitative data. When testing if observed frequencies match expected proportions for one categorical variable, use chi-square goodness of fit.
A political scientist is studying whether political party (Democrat, Republican, Independent) is associated with preferred news source (TV, online, print). She surveys a random sample of 600 registered voters and records both categorical variables for each voter. Which inference procedure is most appropriate?
Two-sample $z$ test for a difference in proportions
Chi-square test for independence
Chi-square test for goodness of fit
Matched-pairs $t$ interval for a mean difference
One-sample $t$ interval for a population mean
Explanation
This scenario examines whether two categorical variables (political party with 3 categories and news source with 3 categories) are associated. The chi-square test for independence is the appropriate procedure for testing associations between two categorical variables in a contingency table. The two-sample z test (option A) compares proportions between two groups only. The chi-square goodness of fit (option C) tests a single variable against expected frequencies. The t-procedures (options D and E) are for quantitative data. When investigating whether two categorical variables are related, use the chi-square test for independence.
A marketing team claims that customers choose among three package designs (A, B, C) in equal proportions. In a random sample of 300 customers, 120 chose A, 96 chose B, and 84 chose C. Which inference procedure is most appropriate to assess whether the observed choices are consistent with the company’s claim?
Chi-square test for goodness of fit
Chi-square test for independence
One-sample $z$ test for a population proportion
Two-sample $z$ test for a difference in proportions
One-sample $t$ test for a population mean
Explanation
This scenario involves testing whether observed frequencies match expected frequencies for a single categorical variable with three categories (package designs A, B, C). The chi-square test for goodness of fit is designed exactly for this purpose - testing if observed data fits a hypothesized distribution (equal proportions: 33.3% each). The chi-square test for independence (option B) requires two categorical variables, not one. The z-tests (options C and E) are for proportions with binary outcomes, not multiple categories. The t-test (option D) is for quantitative data. When testing if a single categorical variable follows a specific distribution, use chi-square goodness of fit.
A researcher wants to know whether the proportion of commuters who use public transportation is different for people who work day shift versus night shift. An independent random sample of 110 day-shift workers finds 28 who use public transportation; an independent random sample of 90 night-shift workers finds 34 who use public transportation. Which inference procedure is most appropriate?
Chi-square test for goodness of fit
One-sample $t$ test for a population mean
Chi-square test for independence
Two-sample $z$ test for a difference in proportions
One-sample $z$ interval for a population proportion
Explanation
This question asks about comparing the proportion of public transportation users between two independent groups (day-shift and night-shift workers). The two-sample z test for a difference in proportions is the appropriate choice for testing whether two population proportions differ. The chi-square goodness of fit (option A) tests a single variable against expected frequencies. The one-sample procedures (options C and D) work with single populations. The chi-square test for independence (option E) requires examining association between two categorical variables, not comparing proportions. When testing for differences in proportions between two independent groups, use the two-sample z test.