Statistics: a Process of Making Inferences

Help Questions

Statistics › Statistics: a Process of Making Inferences

Questions 1 - 10
1

A school wants to estimate the proportion of all students at the school who bring lunch from home on a typical day. The office randomly selects 80 student IDs from the full student roster and asks those students whether they brought lunch from home today; 52 say yes. Why is random sampling important in this situation?

It guarantees that exactly 52 out of every 80 students at the school bring lunch from home

It allows the school to conclude that bringing lunch from home causes better grades

It means students were randomly assigned to bring lunch from home or buy lunch

It helps reduce bias so the sample is more likely to represent all students at the school

Explanation

Statistics uses samples to infer characteristics of populations. The population here is all students at the school, and the sample is the 80 randomly selected students surveyed about bringing lunch. The parameter is the proportion of all students who bring lunch from home, with the statistic being 52/80 from the sample. Random sampling is crucial because it reduces bias, making the sample more representative and supporting valid inferences to the population. A reasonable inference is that the population proportion is likely near 52/80, though not exactly, and it doesn't imply causation like better grades. People often confuse random sampling with random assignment, but sampling selects who to measure, not assigns treatments. To transfer this: ask 'Who do we want to know about?' (all students) and 'Who did we measure?' (the 80 selected).

2

A town wants to estimate the proportion of all households in the town that have a backyard garden. A researcher randomly selects 150 addresses from the town’s address database and finds that 45 of the selected households report having a backyard garden. Which statement is a reasonable inference?

If a household has a backyard garden, it must have been randomly selected

Exactly 45 households in the entire town have backyard gardens

Having a backyard garden causes households to use less water

About $45/150$ of all households in the town likely have backyard gardens, though the exact town proportion may differ

Explanation

We use statistics to infer from samples to populations. The population is all households in the town, and the sample is the 150 randomly selected addresses surveyed. The parameter is the proportion of all households with backyard gardens, with the statistic 45/150. Random sampling helps reduce bias for better representation and generalization. Inference: the population proportion is probably around 45/150, but may differ, and doesn't imply causation like water usage. Common mistake: equating sample results exactly to the population, overlooking variability. Transfer: ask 'Who do we want to know about?' (all households) and 'Who did we measure?' (150 selected).

3

A streaming service wants to estimate the proportion of all active subscribers in the United States who watched at least one documentary last week. It randomly selects 500 U.S. active subscriber accounts and finds that 210 watched at least one documentary. What population parameter is being estimated?

The proportion of all active U.S. subscribers who watched at least one documentary last week

The mean number of documentaries watched by the 500 sampled accounts last week

The claim that watching documentaries causes subscribers to stay active longer

The proportion of the 500 sampled accounts that watched at least one documentary last week

Explanation

Statistics uses samples for population inferences. The population is all active U.S. subscribers, and the sample is the 500 randomly selected accounts. The parameter is the proportion who watched at least one documentary last week, statistic 210/500. Random sampling minimizes bias, supporting reliable generalizations. Reasonable: parameter likely near 210/500, but not exactly, and no causation like watching causing longer activity. Misconception: confusing parameter (population) with statistic (sample). Ask: 'Who do we want to know about?' (all U.S. subscribers) and 'Who did we measure?' (500 selected).

4

A city library wants to estimate the proportion of all registered library card holders in the city who prefer e-books over printed books. The library uses a computer to randomly select 120 card holders from its registration list and surveys them; 48 say they prefer e-books. What population parameter is being estimated?

The mean number of e-books read per month by all city residents

The proportion of the 120 surveyed card holders who prefer e-books

The proportion of all registered library card holders in the city who prefer e-books

The claim that preferring e-books causes people to visit the library less often

Explanation

In statistics, we use samples to make inferences about larger populations when it's impractical to survey everyone. Here, the population is all registered library card holders in the city, and the sample is the 120 randomly selected card holders who were surveyed. The parameter of interest is the proportion of the entire population who prefer e-books, while the sample statistic is the proportion of the 120 (48/120) who prefer e-books. Random sampling is important because it reduces bias and helps ensure the sample represents the population, allowing for reliable generalizations. Based on this, we can reasonably infer that the population proportion is likely around 48/120, but we cannot know it exactly without surveying everyone. A common misconception is that the sample proportion equals the population proportion exactly, but samples provide estimates that may vary. To apply this, ask: 'Who do we want to know about?' (all card holders) and 'Who did we measure?' (the 120 surveyed).

5

A university wants to estimate the proportion of all students currently enrolled who own a bicycle. It randomly selects 100 students from the enrollment list and finds that 34 own a bicycle. Which inference is supported by this random sample?

Owning a bicycle causes students to be selected into the sample

At least 34 students in every class at the university own a bicycle

The university’s true proportion of students who own a bicycle is likely near $34/100$, though it may not be exactly $34/100$

Exactly 34% of all enrolled students own a bicycle

Explanation

Statistics infers from samples to populations. Population: all currently enrolled students, sample: 100 randomly selected. Parameter: proportion owning a bicycle in population, statistic: 34/100. Random sampling reduces bias for generalization. Inference: population proportion likely near 34/100, but not exactly, no causation or per-class guarantees. Error: thinking sample proportion is exactly population's, ignoring sampling error. Ask: 'Who do we want to know about?' (all enrolled) and 'Who did we measure?' (100 selected).

6

A pet supply store wants to estimate the mean amount (in dollars) spent per purchase among all purchases made at the store this weekend. The store’s register system randomly selects 40 receipts from all weekend receipts and computes a sample mean of $18.50. Which statement best describes how the sample is used?

The sample mean $18.50$ is a statistic used to estimate the population mean spending per purchase this weekend

The random sample proves that spending $18.50$ causes customers to buy pet supplies again

The $40$ receipts are the population, and the weekend receipts are the sample

The sample mean $18.50$ is the population mean, so no estimation is needed

Explanation

We use statistics to estimate populations from samples. Population: all purchases this weekend, sample: 40 randomly selected receipts. Parameter: mean spending per purchase in population, statistic: sample mean $18.50. Random sampling cuts bias, aids inference. The statistic estimates the parameter, likely close but not identical, no causation like spending causing repeats. Misconception: sample mean as exact population mean, but it's an estimate. Transfer: ask 'What do we want to know about?' (all weekend purchases) and 'What did we measure?' (40 receipts).

7

A bike manufacturer wants to estimate the mean time (in minutes) it takes to assemble all bikes produced on a particular day. The quality team uses a random-number generator to select 25 bike serial numbers from that day’s production list and records each selected bike’s assembly time. Why might the sample mean differ from the true population mean?

Because random samples can vary from sample to sample, even when taken from the same population

Because the sample mean is a population parameter, not a statistic

Because the bikes were randomly assigned to be assembled faster or slower

Because random sampling forces the sample mean to equal the population mean every time

Explanation

In statistics, samples estimate population values. The population is all bikes produced that day, sample the 25 randomly selected for timing. Parameter: mean assembly time for population, statistic: sample mean. Random sampling reduces bias for valid inferences. The sample mean may differ from the population mean due to variability, not because it equals it every time or involves assignment. Misconception: random sampling vs. assignment—sampling selects, assignment allocates treatments. Strategy: ask 'What do we want to know about?' (all bikes that day) and 'What did we measure?' (25 selected).

8

A cereal company wants to estimate the population mean fill weight (in grams) of all cereal boxes produced at Plant A today. Every 10 minutes, a computer randomly selects one box from the conveyor belt; by the end of the day, 30 boxes are weighed. Which statement correctly describes the population and the sample?

Population: all boxes produced at Plant A today; Sample: the 30 randomly selected boxes that were weighed

Population: the 30 boxes weighed; Sample: all boxes produced at Plant A today

Population: the 30 boxes weighed; Sample: the 30 boxes not weighed

Population: all cereal boxes produced by the company this year; Sample: the 30 boxes weighed at Plant A today

Explanation

Statistics involves using data from samples to draw conclusions about populations. In this case, the population is all cereal boxes produced at Plant A today, and the sample is the 30 randomly selected boxes that were weighed. The parameter is the mean fill weight of all boxes in the population, and the statistic is the mean weight of the 30 sampled boxes. Random sampling matters as it minimizes bias and supports inferences from the sample to the population. We can infer that the population mean is likely close to the sample mean, but it's not guaranteed to be identical due to sampling variability. A misconception is confusing the sample with the population, like thinking the 30 boxes represent the entire year's production instead of just today's. Remember: ask 'Who (or what) do we want to know about?' (all today's boxes) and 'Who (or what) did we measure?' (the 30 selected).

9

A school district wants to estimate the mean number of minutes of homework completed per night by all 9th-grade students in the district. The district randomly selects 50 9th-grade students from an enrollment list and records each student's minutes of homework last night. Which statement correctly describes the population and the sample?

Population: the 50 selected students; Sample: all 9th-grade students in the district

Population: all students in the district; Sample: the 50 selected 9th-grade students

Population: all 9th-grade students in the state; Sample: the 50 randomly selected 9th-grade students

Population: all 9th-grade students in the district; Sample: the 50 randomly selected 9th-grade students

Explanation

Statistics uses samples to make inferences about populations, helping us understand groups like students without measuring everyone. Here, the population is all 9th-grade students in the district, and the sample is the 50 randomly selected 9th-graders whose homework minutes were recorded. The parameter is the mean homework minutes for the entire population, and the statistic is the mean from the sample. Random sampling matters as it minimizes bias, making the sample more representative and allowing better generalizations to the population. We can reasonably infer the population mean is close to the sample mean, but we can't know it exactly without checking everyone. A misconception is confusing population with sample, thinking the sample is the whole group. Remember: ask 'Who do we want to know about?' (all 9th-graders in the district) and 'Who did we measure?' (the 50 selected).

10

A bike shop wants to estimate the mean repair cost for all repairs completed last month. The owner randomly selects 30 repair invoices from all invoices dated last month and computes the sample mean repair cost. Why might the owner get a different sample mean if a different random sample of 30 invoices were selected?

Because random sampling forces all samples to have the same mean

Because random samples can vary, different subsets of invoices may have different average costs

Because selecting invoices at random proves which mechanic caused higher costs

Because random sampling assigns repair costs to invoices at random

Explanation

Statistics uses samples to infer about populations, estimating costs without reviewing every invoice. The population is all repair invoices from last month, and the sample is the 30 randomly selected invoices whose mean cost was calculated. The parameter is the mean repair cost for the population, with the statistic being the sample mean. Random sampling is important to reduce bias, allowing representation and generalization. Different samples might yield different means due to variability, but not because it forces equality or assigns costs randomly. Misconception: assuming random sampling eliminates all differences, but variability persists. Transfer: ask 'What do we want to know about?' (all invoices last month) and 'What did we average?' (the 30 selected).

Page 1 of 5