Statistical Inference

Help Questions

USMLE Step 1 › Statistical Inference

Questions 1 - 10
1

Which of the following is the most accurate interpretation of this 95% confidence interval?

If the study were repeated many times, 95% of the calculated confidence intervals would contain the true mean difference.

There is a 95% probability that the true mean difference in LDL reduction lies between 15 and 35 mg/dL.

95% of patients in the study had an LDL reduction between 15 and 35 mg/dL.

There is a 5% chance that the new statin is ineffective.

Explanation

A 95% confidence interval provides a range of plausible values for the true population parameter. The correct frequentist interpretation is that if the same study were conducted 100 times, 95 of the resulting confidence intervals would be expected to contain the true mean difference in the population.

2

In this scenario, the researchers have made which of the following types of error?

Type II error

Selection bias

Confounding

Type I error

Explanation

A Type I error occurs when the null hypothesis is incorrectly rejected. The null hypothesis states there is no difference between the groups. Here, the researchers concluded there was a difference (rejected the null) when in fact no true difference existed. This is the definition of a Type I error (false positive). The probability of making a Type I error is denoted by α.

3

The initial study's failure to detect a real effect is an example of which of the following?

Type II error

Lead-time bias

Type I error

Recall bias

Explanation

A Type II error occurs when one fails to reject a null hypothesis that is actually false. In this case, the initial study failed to find a significant effect (did not reject the null hypothesis of no difference) when a true effect existed. This is the definition of a Type II error (false negative). The probability of making a Type II error is denoted by β.

4

Which of the following measures of central tendency would be the most appropriate to describe the typical income in this community?

Range

Mean

Mode

Median

Explanation

In a skewed distribution, the mean is heavily influenced by extreme values (outliers). The median represents the 50th percentile and is resistant to outliers. Therefore, for skewed data such as income, the median provides a more accurate representation of the central or 'typical' value than the mean.

5

Which of the following is the most effective way for the researchers to increase the power of their study?

Increase the p-value threshold for significance (e.g., from 0.05 to 0.10)

Enroll a more heterogeneous patient population

Increase the number of participants in the study

Decrease the number of participants in the study

Explanation

Power is the ability of a study to detect a true effect (1 - β). The power of a study is primarily influenced by the sample size, effect size, and alpha level. Increasing the sample size is the most common and effective method to increase statistical power, as it reduces the standard error and makes it easier to detect a true difference between groups.

6

Which of the following best describes the standard error of the mean (SEM)?

It measures the variability of individual newborn weights around the population mean.

It is an estimate of the standard deviation of the distribution of sample means.

It describes the 95% range for the individual measurements.

It is always larger than the standard deviation.

Explanation

The standard deviation (SD) measures the variability or spread of individual data points within a single sample. The standard error of the mean (SEM) estimates the variability of the means of multiple samples taken from the same population. It quantifies how precisely the sample mean estimates the true population mean and is calculated as SD / √n.

7

Which of the following best describes the relationship between the measures of central tendency for this distribution?

Mean = Median = Mode

Mode < Median < Mean

Mean < Median < Mode

Mean = Mode, but different from Median

Explanation

This describes a negatively skewed (left-skewed) distribution. In such a distribution, the outliers are on the lower end. The mode is the most frequent value (the peak), which is 8 hours. The median is less affected by the low-value outliers than the mean. The mean is pulled downward by the low values. Therefore, the relationship is Mean < Median < Mode.

8

How would the 95% confidence interval (CI) for the mean reduction in recovery time in Study B compare to that of Study A?

The CI width would be identical in both studies.

The CI in Study B would be shifted to the right.

The CI in Study B would be wider.

The CI in Study B would be narrower.

Explanation

The width of a confidence interval is inversely related to the square root of the sample size. A larger sample size (like in Study B) leads to a smaller standard error of the mean, resulting in a more precise estimate of the true population parameter. This increased precision is reflected by a narrower confidence interval.

9

Which of the following statements represents the most critical consideration when interpreting this result?

Statistical significance does not necessarily imply clinical significance.

The study must have been underpowered.

The result is not valid because the p-value is close to 0.05.

A Type II error may have occurred.

Explanation

While the result is statistically significant (p < 0.05), the absolute difference in survival is only 3%. A critical step in interpreting research is to evaluate whether a statistically significant finding is also clinically meaningful. A small, clinically unimportant difference can become statistically significant if the sample size is very large. Clinicians must decide if a 3% survival benefit justifies the potential costs, side effects, and risks of the new therapy.

10

This pre-specified probability threshold is known as which of the following?

The power of the study

The effect size

The beta (β) level

The alpha (α) level

Explanation

The alpha (α) level, or significance level, is the pre-specified probability of committing a Type I error. It is the threshold below which a p-value is considered statistically significant. By convention, α is typically set to 0.05, meaning the researchers accept up to a 5% chance of incorrectly rejecting a true null hypothesis.

Page 1 of 2