Fit Lines to Scatter Plots
Help Questions
8th Grade Math › Fit Lines to Scatter Plots
A scatter plot shows (x = number of chores completed, y = allowance earned). Two students drew different lines. Student 1’s line has many points about 1–2 dollars away from it. Student 2’s line has many points about 5–6 dollars away from it. Which line is a better fit, and why?
Student 2’s line, because being farther from the points means the line is more accurate.
Student 2’s line, because it is steeper and steeper lines are always better.
Student 1’s line, because the vertical distances (residuals) from the points to the line are smaller for most points.
Both lines are equally good, because a best-fit line must have all points on one side of it.
Explanation
This question tests informally fitting a straight line to a scatter plot showing linear association and assessing fit by judging point closeness to the line. Linear association (points trending along straight direction) modeled by straight line: fit line informally by drawing through middle of point cloud, balancing points above and below (roughly equal numbers each side). Assess fit: good fit has most points close to line (small vertical distances from points to line—predictions accurate), poor fit has points far from line (large vertical deviations—predictions less reliable). Line captures linear trend, allows prediction (use line to estimate y for new x). For example, in a scatter plot of chores vs allowance, comparing two lines where one has smaller vertical distances (1-2 dollars away) vs larger (5-6 away), the one with smaller residuals fits better. Student 1’s line is better, because the vertical distances (residuals) from the points to the line are smaller for most points. Common errors include preferring steeper lines or thinking farther distances are more accurate. Fitting: (1) observe trend direction (upward→positive slope, downward→negative), (2) estimate center of points (where is middle of cloud?), (3) draw line through center following trend (balance points above/below), (4) verify reasonable (does line capture pattern? points fairly close?). Assessing: (1) observe vertical distances from points to line (how far off?), (2) count how many close (within 1-2 grid squares) vs far (5+ squares away), (3) judge: most close=good fit, many far=poor fit, (4) compare alternatives (if multiple lines, which has points closer on average?). Real data: rarely perfect (scatter means variability, line approximates), outliers exist (don't force line to hit outlier—fit majority), linear adequate if points roughly straight trend (even if imperfect). Mistakes: line through outlier ignoring majority, all points above or below (unbalanced), fit by horizontal distance (wrong—vertical distance for y-prediction matters), claiming poor fit is good or vice versa.
A line of best fit for a scatter plot of (x = minutes practiced, y = free throws made) is drawn as $y \approx 0.8x + 2$. Using this line, about how many free throws would be predicted for $x=20$ minutes?
About 10
About 22
About 16
About 18
Explanation
This question tests informally fitting a straight line to a scatter plot showing linear association and assessing fit by judging point closeness to the line. Linear association (points trending along straight direction) modeled by straight line: fit line informally by drawing through middle of point cloud, balancing points above and below (roughly equal numbers each side). Assess fit: good fit has most points close to line (small vertical distances from points to line—predictions accurate), poor fit has points far from line (large vertical deviations—predictions less reliable). Line captures linear trend, allows prediction (use line to estimate y for new x). For example, in a scatter plot of minutes practiced vs free throws made showing upward linear trend, using the line y ≈ 0.8x + 2 to predict for x=20 gives y ≈ 18, assessing fit by how well it approximates actual points. The correct prediction is about 18, calculated as 0.8*20 + 2 = 18, using the line for estimation. Common errors include miscalculating the prediction like choosing 16 (forgetting to add 2) or 22 (using wrong slope). Fitting: (1) observe trend direction (upward→positive slope, downward→negative), (2) estimate center of points (where is middle of cloud?), (3) draw line through center following trend (balance points above/below), (4) verify reasonable (does line capture pattern? points fairly close?). Assessing: (1) observe vertical distances from points to line (how far off?), (2) count how many close (within 1-2 grid squares) vs far (5+ squares away), (3) judge: most close=good fit, many far=poor fit, (4) compare alternatives (if multiple lines, which has points closer on average?). Real data: rarely perfect (scatter means variability, line approximates), outliers exist (don't force line to hit outlier—fit majority), linear adequate if points roughly straight trend (even if imperfect). Mistakes: line through outlier ignoring majority, all points above or below (unbalanced), fit by horizontal distance (wrong—vertical distance for y-prediction matters), claiming poor fit is good or vice versa.
A student drew a line of best fit for a scatter plot of (x = number of texts sent, y = phone battery percent used). One point is far away from the rest (an outlier). Which choice best describes how the line should be drawn?
Draw the line through the outlier and ignore the rest of the points.
Draw a curved line so it hits every point exactly, including the outlier.
Draw the line to fit the main cluster of points, without letting the one outlier determine the slope.
Draw a horizontal line, because outliers mean there is no relationship at all.
Explanation
This question tests informally fitting a straight line to a scatter plot showing linear association and assessing fit by judging point closeness to the line. Linear association (points trending along straight direction) modeled by straight line: fit line informally by drawing through middle of point cloud, balancing points above and below (roughly equal numbers each side). Assess fit: good fit has most points close to line (small vertical distances from points to line—predictions accurate), poor fit has points far from line (large vertical deviations—predictions less reliable). Line captures linear trend, allows prediction (use line to estimate y for new x). For example, in a scatter plot of texts sent vs battery used with one outlier, fitting line through the main cluster's center, assessing fit by closeness of majority points, ignoring the outlier's pull. The best approach is to draw the line to fit the main cluster of points, without letting the one outlier determine the slope. Common errors include drawing through the outlier only or using a curved or horizontal line. Fitting: (1) observe trend direction (upward→positive slope, downward→negative), (2) estimate center of points (where is middle of cloud?), (3) draw line through center following trend (balance points above/below), (4) verify reasonable (does line capture pattern? points fairly close?). Assessing: (1) observe vertical distances from points to line (how far off?), (2) count how many close (within 1-2 grid squares) vs far (5+ squares away), (3) judge: most close=good fit, many far=poor fit, (4) compare alternatives (if multiple lines, which has points closer on average?). Real data: rarely perfect (scatter means variability, line approximates), outliers exist (don't force line to hit outlier—fit majority), linear adequate if points roughly straight trend (even if imperfect). Mistakes: line through outlier ignoring majority, all points above or below (unbalanced), fit by horizontal distance (wrong—vertical distance for y-prediction matters), claiming poor fit is good or vice versa.
A scatter plot shows the relationship between distance biked (x) and time (y) for a student biking at a steady pace. Which statement best explains why a straight line is a reasonable model?
A straight line is reasonable because the points follow a roughly straight upward trend, showing a linear relationship.
A straight line is reasonable because the line should be drawn through the highest point and the lowest point only.
A straight line is reasonable only if every point is exactly on the line.
A straight line is not reasonable because time and distance can never be related.
Explanation
This question tests informally fitting a straight line to a scatter plot showing linear association and assessing fit by judging point closeness to the line. Linear association (points trending along straight direction) modeled by straight line: fit line informally by drawing through middle of point cloud, balancing points above and below (roughly equal numbers each side). Assess fit: good fit has most points close to line (small vertical distances from points to line—predictions accurate), poor fit has points far from line (large vertical deviations—predictions less reliable). Line captures linear trend, allows prediction (use line to estimate y for new x). For example, in a scatter plot of distance biked vs time showing upward linear trend at steady pace, fitting line through approximate center sloping upward, assessing fit by observing most points close to the line (good fit for linear relationship). The best explanation is that a straight line is reasonable because the points follow a roughly straight upward trend, showing a linear relationship. Common errors include thinking lines must hit every point or only connect extremes. Fitting: (1) observe trend direction (upward→positive slope, downward→negative), (2) estimate center of points (where is middle of cloud?), (3) draw line through center following trend (balance points above/below), (4) verify reasonable (does line capture pattern? points fairly close?). Assessing: (1) observe vertical distances from points to line (how far off?), (2) count how many close (within 1-2 grid squares) vs far (5+ squares away), (3) judge: most close=good fit, many far=poor fit, (4) compare alternatives (if multiple lines, which has points closer on average?). Real data: rarely perfect (scatter means variability, line approximates), outliers exist (don't force line to hit outlier—fit majority), linear adequate if points roughly straight trend (even if imperfect). Mistakes: line through outlier ignoring majority, all points above or below (unbalanced), fit by horizontal distance (wrong—vertical distance for y-prediction matters), claiming poor fit is good or vice versa.
A student plotted the number of push-ups completed (x) and the time in seconds to finish (y). The points show a negative linear trend (more push-ups means more time). Two lines are suggested. Which line is the better fit?
Line A is worse, because a best-fit line must pass through every point exactly.
Line B, because it is as steep as possible and passes through an extreme point.
Line A, because it goes through the middle of the point cloud and leaves about the same number of points above and below.
Line B, because a best-fit line should make all points fall above the line.
Explanation
This question tests informally fitting a straight line to a scatter plot showing linear association and assessing fit by judging point closeness to the line. Linear association (points trending along straight direction) modeled by straight line: fit line informally by drawing through middle of point cloud, balancing points above and below (roughly equal numbers each side). Assess fit: good fit has most points close to line (small vertical distances from points to line—predictions accurate), poor fit has points far from line (large vertical deviations—predictions less reliable). Line captures linear trend, allows prediction (use line to estimate y for new x). For example, in a scatter plot of push-ups vs time showing downward linear trend, fitting line through approximate center sloping downward, assessing fit by observing most points within ±5 seconds of line vertically (good fit), or comparing two possible lines where one passes closer to majority of points (better fit). The correct choice is Line A, because it goes through the middle of the point cloud and leaves about the same number of points above and below, capturing the negative trend. Common errors include prioritizing extreme points or expecting the line to pass through every point. Fitting: (1) observe trend direction (upward→positive slope, downward→negative), (2) estimate center of points (where is middle of cloud?), (3) draw line through center following trend (balance points above/below), (4) verify reasonable (does line capture pattern? points fairly close?). Assessing: (1) observe vertical distances from points to line (how far off?), (2) count how many close (within 1-2 grid squares) vs far (5+ squares away), (3) judge: most close=good fit, many far=poor fit, (4) compare alternatives (if multiple lines, which has points closer on average?). Real data: rarely perfect (scatter means variability, line approximates), outliers exist (don't force line to hit outlier—fit majority), linear adequate if points roughly straight trend (even if imperfect). Mistakes: line through outlier ignoring majority, all points above or below (unbalanced), fit by horizontal distance (wrong—vertical distance for y-prediction matters), claiming poor fit is good or vice versa.
The scatter plot shows the relationship between the number of practice shots made in basketball ($x$) and free-throw percentage in a game ($y$). Two students drew possible lines.
Line A: $y=4x+40$
Line B: $y=6x+30$
Based on the scatter plot, which line is a better fit (closer to more points overall)?
Line B, because it is steeper and reaches higher scores faster.
Line A, because it goes through the middle of the point cloud and has smaller vertical distances for most points.
Line B, because a best-fit line should pass through the highest point.
Neither line can fit because a line must pass through every point.
Explanation
Tests informally fitting straight line to scatter plot showing linear association and assessing fit by judging point closeness to line. Linear association (points trending along straight direction) modeled by straight line: fit line informally by drawing through middle of point cloud, balancing points above and below (roughly equal numbers each side). Assess fit: good fit has most points close to line (small vertical distances from points to line—predictions accurate), poor fit has points far from line (large vertical deviations—predictions less reliable). Line captures linear trend, allows prediction (use line to estimate y for new x). For example, in a scatter plot of practice shots vs free-throw percentage showing upward linear trend, fitting line through approximate center sloping upward, assessing fit by observing most points close to the line vertically (good fit), or comparing two possible lines where one passes closer to majority of points (better fit). The correct line is Line A (y=4x+40) because it goes through the middle of the point cloud and has smaller vertical distances for most points (choice A). A common error is choosing a steeper line because it reaches higher values faster or passes through the highest point, but the best fit balances closeness to all points, not extremes. Fitting: (1) observe trend direction (upward→positive slope, downward→negative), (2) estimate center of points (where is middle of cloud?), (3) draw line through center following trend (balance points above/below), (4) verify reasonable (does line capture pattern? points fairly close?). Assessing: (1) observe vertical distances from points to line (how far off?), (2) count how many close (within 1-2 grid squares) vs far (5+ squares away), (3) judge: most close=good fit, many far=poor fit, (4) compare alternatives (if multiple lines, which has points closer on average?). Real data: rarely perfect (scatter means variability, line approximates), outliers exist (don't force line to hit outlier—fit majority), linear adequate if points roughly straight trend (even if imperfect). Mistakes: line through outlier ignoring majority, all points above or below (unbalanced), fit by horizontal distance (wrong—vertical distance for y-prediction matters), claiming poor fit is good or vice versa.
A scatter plot shows the relationship between the number of text messages sent in a day ($x$) and the number of minutes spent on the phone ($y$). A student drew the line $y=-2x+120$.
The points on the scatter plot clearly trend upward from left to right.
What is the best critique of the student’s line?
The line is wrong because it crosses the $y$-axis.
The line is correct because negative slopes always balance points above and below.
The slope is the wrong direction; the line should have a positive slope to match the upward trend.
The line is wrong because a best-fit line must go through every point.
Explanation
Tests informally fitting straight line to scatter plot showing linear association and assessing fit by judging point closeness to line. Linear association (points trending along straight direction) modeled by straight line: fit line informally by drawing through middle of point cloud, balancing points above and below (roughly equal numbers each side). Assess fit: good fit has most points close to line (small vertical distances from points to line—predictions accurate), poor fit has points far from line (large vertical deviations—predictions less reliable). Line captures linear trend, allows prediction (use line to estimate y for new x). For example, in a scatter plot of text messages vs phone minutes with upward trend, a downward-sloping line fails to capture the positive association. The best critique is that the slope is the wrong direction; the line should have a positive slope to match the upward trend (choice A). A common error is accepting a negative slope because it balances points or crosses the y-axis, but the slope must match the trend direction. Fitting: (1) observe trend direction (upward→positive slope, downward→negative), (2) estimate center of points (where is middle of cloud?), (3) draw line through center following trend (balance points above/below), (4) verify reasonable (does line capture pattern? points fairly close?). Assessing: (1) observe vertical distances from points to line (how far off?), (2) count how many close (within 1-2 grid squares) vs far (5+ squares away), (3) judge: most close=good fit, many far=poor fit, (4) compare alternatives (if multiple lines, which has points closer on average?). Real data: rarely perfect (scatter means variability, line approximates), outliers exist (don't force line to hit outlier—fit majority), linear adequate if points roughly straight trend (even if imperfect). Mistakes: line through outlier ignoring majority, all points above or below (unbalanced), fit by horizontal distance (wrong—vertical distance for y-prediction matters), claiming poor fit is good or vice versa.
A scatter plot shows the relationship between the number of laps swum ($x$) and total time in minutes ($y$). A student drew a line of best fit.
To judge how well the line fits, which distances should you mainly look at?
The vertical distances from the points to the line (how far up or down each point is from the line).
The horizontal distances from the points to the line (how far left or right each point is from the line).
The distance from each point to the origin.
Only whether the line touches at least two points.
Explanation
Tests informally fitting straight line to scatter plot showing linear association and assessing fit by judging point closeness to line. Linear association (points trending along straight direction) modeled by straight line: fit line informally by drawing through middle of point cloud, balancing points above and below (roughly equal numbers each side). Assess fit: good fit has most points close to line (small vertical distances from points to line—predictions accurate), poor fit has points far from line (large vertical deviations—predictions less reliable). Line captures linear trend, allows prediction (use line to estimate y for new x). For example, in a scatter plot of laps swum vs time, assessing fit by measuring vertical distances from points to the line, judging smaller averages as better. The correct distances to look at are the vertical distances from the points to the line (choice A). A common error is using horizontal distances or distances to the origin, but vertical residuals matter for predicting y from x. Fitting: (1) observe trend direction (upward→positive slope, downward→negative), (2) estimate center of points (where is middle of cloud?), (3) draw line through center following trend (balance points above/below), (4) verify reasonable (does line capture pattern? points fairly close?). Assessing: (1) observe vertical distances from points to line (how far off?), (2) count how many close (within 1-2 grid squares) vs far (5+ squares away), (3) judge: most close=good fit, many far=poor fit, (4) compare alternatives (if multiple lines, which has points closer on average?). Real data: rarely perfect (scatter means variability, line approximates), outliers exist (don't force line to hit outlier—fit majority), linear adequate if points roughly straight trend (even if imperfect). Mistakes: line through outlier ignoring majority, all points above or below (unbalanced), fit by horizontal distance (wrong—vertical distance for y-prediction matters), claiming poor fit is good or vice versa.
A science class measured the temperature outside ($x$ in °F) and the number of cups of lemonade sold at lunch ($y$). The scatter plot points trend upward in a roughly straight pattern.
Should a straight line be a reasonable model for this relationship?
Yes, because the best-fit line should always be horizontal to balance points.
No, because a line is only reasonable if every point is exactly on the line.
No, because upward trends must be modeled with a curve, not a line.
Yes, because the points show a positive linear association (they follow an upward straight trend).
Explanation
Tests informally fitting straight line to scatter plot showing linear association and assessing fit by judging point closeness to line. Linear association (points trending along straight direction) modeled by straight line: fit line informally by drawing through middle of point cloud, balancing points above and below (roughly equal numbers each side). Assess fit: good fit has most points close to line (small vertical distances from points to line—predictions accurate), poor fit has points far from line (large vertical deviations—predictions less reliable). Line captures linear trend, allows prediction (use line to estimate y for new x). For example, in a scatter plot of temperature vs lemonade sales showing upward linear trend, fitting line through approximate center sloping upward, assessing fit by observing most points close to the line vertically (good fit). The correct assessment is yes, a straight line is reasonable because the points show a positive linear association following an upward straight trend (choice A). A common error is thinking a line is only reasonable if it passes through every point or that upward trends require a curve, but lines approximate linear trends even with scatter. Fitting: (1) observe trend direction (upward→positive slope, downward→negative), (2) estimate center of points (where is middle of cloud?), (3) draw line through center following trend (balance points above/below), (4) verify reasonable (does line capture pattern? points fairly close?). Assessing: (1) observe vertical distances from points to line (how far off?), (2) count how many close (within 1-2 grid squares) vs far (5+ squares away), (3) judge: most close=good fit, many far=poor fit, (4) compare alternatives (if multiple lines, which has points closer on average?). Real data: rarely perfect (scatter means variability, line approximates), outliers exist (don't force line to hit outlier—fit majority), linear adequate if points roughly straight trend (even if imperfect). Mistakes: line through outlier ignoring majority, all points above or below (unbalanced), fit by horizontal distance (wrong—vertical distance for y-prediction matters), claiming poor fit is good or vice versa.
A scatter plot shows the relationship between the number of items bought at a school store ($x$) and the total cost in dollars ($y$). A line is drawn on the plot.
If about 80% of the points are within about $\pm 1$ dollar vertically of the line, what is the best description of the line’s fit?
Good fit, because points far from the line mean the line captures more variation.
Poor fit, because some points are not exactly on the line.
Poor fit, because the line should be drawn through the highest and lowest points only.
Good fit, because most points are close to the line (small residuals).
Explanation
Tests informally fitting straight line to scatter plot showing linear association and assessing fit by judging point closeness to line. Linear association (points trending along straight direction) modeled by straight line: fit line informally by drawing through middle of point cloud, balancing points above and below (roughly equal numbers each side). Assess fit: good fit has most points close to line (small vertical distances from points to line—predictions accurate), poor fit has points far from line (large vertical deviations—predictions less reliable). Line captures linear trend, allows prediction (use line to estimate y for new x). For example, in a scatter plot of items bought vs cost showing linear trend, assessing fit as good if 80% of points are within ±1 dollar vertically of the line. The best description is good fit because most points are close to the line with small residuals (choice A). A common error is calling it poor fit because not all points are exactly on the line or suggesting the line should only connect extremes. Fitting: (1) observe trend direction (upward→positive slope, downward→negative), (2) estimate center of points (where is middle of cloud?), (3) draw line through center following trend (balance points above/below), (4) verify reasonable (does line capture pattern? points fairly close?). Assessing: (1) observe vertical distances from points to line (how far off?), (2) count how many close (within 1-2 grid squares) vs far (5+ squares away), (3) judge: most close=good fit, many far=poor fit, (4) compare alternatives (if multiple lines, which has points closer on average?). Real data: rarely perfect (scatter means variability, line approximates), outliers exist (don't force line to hit outlier—fit majority), linear adequate if points roughly straight trend (even if imperfect). Mistakes: line through outlier ignoring majority, all points above or below (unbalanced), fit by horizontal distance (wrong—vertical distance for y-prediction matters), claiming poor fit is good or vice versa.