High Impact Tutoring Built By Math Experts
Personalized standards-aligned one-on-one math tutoring for schools and districts
Here you will learn about a box plot, including how to draw a box plot to represent a set of data, how to read data from a box plot, and how to interpret and compare box plots.
Students will first learn about box plots as part of statistics and probability in 6 th grade.
A box plot is a diagram showing the following information for a set of data:
This set of descriptive statistics is called the five-number summary. The box plot must be featured on a scale to show these values clearly.
Each box plot should be featured on a numerical scale.
The lower quartile can also be referred to as the 25 th percentile.
The median can also be referred to as the 50 th percentile
The upper quartile can also be referred to as the 75 th percentile.
Box plots are sometimes called box and whisker plots, with the ‘whiskers’ being the ends representing the lowest and highest values.
Box plots are particularly useful for data analysis when comparing two or more data sets; it is easy to make visual comparisons of average (median) and spread (range and interquartile range).
When data is skewed (the data distribution is not symmetrical or near-symmetrical), or there are many outliers or extreme values, a box plot provides better data visualization than other chart types or graphs.
In high school, you will learn about measures of skewness that use the quartiles, and how to identify different types of skewness visually on a box plot. If a box plot is perfectly symmetrical, the data could have a normal distribution.
Use this quiz to check your grade 6th to 7th students’ understanding of representing data. 10+ questions with answers covering a range of 6th and 7th grade representing data topics to identify areas of strength and support!
DOWNLOAD FREEUse this quiz to check your grade 6th to 7th students’ understanding of representing data. 10+ questions with answers covering a range of 6th and 7th grade representing data topics to identify areas of strength and support!
DOWNLOAD FREEHow does this relate to 6 th and 7 th grade math?
In order to draw a box plot:
Draw a box plot using the following information.
Minimum value | 10 |
Lower quartile (Q1) | 15 |
Median (Q2) | 21 |
Upper quartile (Q3) | 28 |
Maximum value | 35 |
In this example, all of the values are given.
2Draw a scale, and mark the five key values: minimum value, lower quartile \textbf{(LQ)} , median, upper quartile \textbf{(UQ)} , and maximum value.
The scale needs to be long enough to mark the lowest and highest values, so in this example, the scale could be 0 to 40.
Mark the five key values with vertical lines.
3Join the lower quartile and upper quartile to form the box, and draw horizontal lines to the minimum and maximum values.
The box runs from the lower quartile (15) to the upper quartile (28). The whiskers join to the box; the lower whisker ends at the minimum value (10) and the upper whisker ends at the maximum value (35).
Draw a box plot for the following data points.
1, \, 1, \, 2, \, 3, \, 5, \, 7, \, 7, \, 8, \, 10, \, 12, \, 15
Determine the median and quartiles.
The five-number summary for this box plot is therefore
Minimum value | 1 |
Lower quartile (Q1) | 2 |
Median (Q2) | 7 |
Upper quartile (Q3) | 10 |
Maximum value | 15 |
Draw a scale, and mark the five key values: minimum value, lower quartile \textbf{(LQ)} , median, upper quartile \textbf{(UQ)} , and maximum value.
The scale needs to be long enough to mark the lowest and highest values, so in this example, the scale could be 0 to 20.
Mark the five key values with vertical lines.
Join the lower quartile and upper quartile to form the box, and draw horizontal lines to the minimum and maximum values.
The box runs from the lower quartile (2) to the upper quartile (10). The whiskers join to the box; the lower whisker ends at the minimum value (1) and the upper whisker ends at the maximum value (15).
This table shows some descriptive statistics for a set of data.
Minimum value | 4 |
Lower quartile (Q1) | 13 |
Median (Q2) | 18 |
Interquartile range (IQR) | 10 |
Range (R) | 23 |
Determine the median and quartiles.
In this problem, you are given the IQR and range, and you need to find the upper quartile and maximum value.
U Q=L Q+I Q R \text {, so } U Q=13+10=23
Similarly, \text{range }=\text{highest value}-\text{lowest value}, so
\text{lowest value}+\text{range}=\text{highest value}
So you do 4+23=27; the highest value is 27.
The key values for the box plot are as follows.
Minimum value | 4 |
Lower quartile (Q1) | 13 |
Median (Q2) | 18 |
Upper quartile (Q3) | 23 |
Maximum value | 27 |
Draw a scale, and mark the five key values: minimum value, lower quartile \textbf{(LQ)} , median, upper quartile \textbf{(UQ)} , and maximum value.
The scale needs to be long enough to mark the lowest and highest values, so in this example, the scale could be 0 to 30.
Mark the five key values with vertical lines.
Join the lower quartile and upper quartile to form the box, and draw horizontal lines to the minimum and maximum values.
The box runs from the lower quartile (13) to the upper quartile (23). The whiskers join to the box; the lower whisker ends at the minimum value (4) and the upper whisker ends at the maximum value (27).
It is important to be able to read key information from a box plot, and also to compare data distributions of two box plots.
When comparing two box plots, you should make a comment about:
The comparison must be put into context of the question.
For example,
Box plot A shows the length of words in a book for a 5 year old child.
Box plot B shows the length of words in a book for an 8 year old child.
If the median is higher for box plot B, the contextual solution would be:
The median word length is longer in book B than in book A.
Or
The median word length is lower in book A than in book B.
When describing the spread of the data, if the interquartile range of the data is a larger value for book B than book A, the contextual solution would be:
The word lengths in book B are more spread out than in book A.
Or
The word lengths in book A are more concise than in book B.
In order to read a box plot:
This table shows some descriptive statistics for a set of data.
Minimum value | 5 |
Lower quartile (Q1) | |
Median (Q2) | 16 |
Upper quartile (Q3) | |
Maximum value |
This box plot represents the same set of data.
Use the box plot to fill in the missing information in the table.
Identify the five-number summary values.
The minimum value and median have already been provided.
The lower quartile is the vertical line on the box plot at the lower end (left end) of the box. Reading the data value off the x -axis, the lower quartile is 10.
The upper quartile is the vertical line at the upper end (right end) of the box. Reading the data value off the x -axis, the upper quartile is 23.
The upper whisker ends at the highest value; this is 25.
The completed table is therefore
Minimum value | 5 |
Lower quartile (Q1) | 10 |
Median (Q2) | 16 |
Upper quartile (Q3) | 23 |
Maximum value | 25 |
Calculate the interquartile range.
The interquartile range is the difference between the upper quartile and the lower quartile.
IQR = 23-10 = 13.
In order to compare two box plots:
Two classes of students took the same math test. The two box plots below show a summary of their results rounded to the nearest whole number.
Class A
Class B
Compare these two sets of data.
Compare the median and the five-number summaries.
The median for Class A is 14. In Class B, \, 14 is before Q1. This means that at least 75 \% of the data in Class B is greater than the median for Class A.
The median for Class B is 20. In Class A, \, 20 is after Q3. This means that between 25 \%-50 \% of the data in Class B is larger than any data point in Class A.
Compare the \textbf{IQR} (or range) to comment on the spread or consistency.
The interquartile range (IQR) for Class A is 6. The overall range for Class A is 16.
The IQR for Class B is 12. The overall range for Class B is 24.
In both values, Class A has less range. You can say that, ‘Class A was more consistent than Class B’. This just means that scores for Class A were more closely bunched together, whereas the scores for Class B were more spread out.
Alternatively, you could state ‘Class B had a larger interquartile range and overall range than Class A.’
Class A (see Example 5 ) also took an English test. Their scores are summarized below. Each test score is rounded to the nearest whole number. Compare the data distributions of scores on the math and English tests.
English
Minimum value | 10 |
Lower quartile (Q1) | 20 |
Median (Q2) | 24 |
Upper quartile (Q3) | 27 |
Maximum value | 30 |
Math
Before you can compare the box plots, you need to convert the data into the same format.
If you read the key data value from the math box plot, we get the following five-number summary for math scores:
Minimum value | 7 |
Lower quartile (Q1) | 10 |
Median (Q2) | 14 |
Upper quartile (Q3) | 16 |
Maximum value | 23 |
Compare the medians and five-number summaries.
The median value for English is 24. In Math, 24 over the maximum value. This means that at least 50 \% of the English scores were greater than any of the Math scores.
The median value for math is 14. In English, this is lower than the Q1 score. This means that at least 50 \% of the Math scores are below the bottom 25 \% of the English scores.
Remember, your answer must compare the two values; it is not sufficient to just state the two medians. You also need to give context, so in this example, refer to both of the subjects and scores on the tests rather than just ‘English has a smaller median.’
Compare the \textbf{IQR} (or range) to comment on the spread or consistency.
The IQR in English is 7 and the overall range is 20 and the IQR in math is 6 and the overall range is 16.
You could say that, ‘there was similar variability of scores in math and English.’ There was a similar bunching together of the middle 50 \% of the data and the overall data.
Remember, your answer must compare the two values; it is not sufficient to just state the two IQRs or ranges. You also need to give context, so in this example, refer to both of the subjects and marks scored on the tests rather than just ‘the IQRs are the same.’
Note that you could also answer this question by drawing a box plot of the English data – and it’s a very common type of assessment question to be asked to draw a box plot in the first part of a question, then compare this box plot with another in the second part of the question. Whichever method is required, the analysis is the same.
1. Draw a box plot to show this five-number summary.
The ends of the whiskers are plotted at the minimum and maximum values. Draw lines for the QI, median (Q2), and Q3, and connect these to form the box.
2. Draw a box plot to show this set of data.
The data set is in order and so the five key values for the box plot are as follows:
The ends of the whiskers are plotted at the minimum and maximum values. Draw lines for the Q1, median (Q2), and Q3, and connect these to form the box. Draw the line from the lowest value to the lower quartile, and from the upper quartile to the highest value.
3. Look at this box plot.
Which could be the value of the lower quartile?
The lower quartile is the lower end of the box; it is somewhere between 10 and 12 so the value could be 11.
4. Look at this box plot.
What is the value of the median?
The median is the vertical line running through the middle of the box between the lower quartile and the upper quartile; the value is 30.
5. Look at these two box plots and choose the incorrect statement:
A:
B:
A and B have the same median.
A and B have the same range.
The maximum value of B is larger than the maximum value of A.
The interquartile range of A is smaller than the interquartile range of B.
The IQR of A is 40-24=16.
The IQR of B is 36-16=10.
So the IQR of A is larger than the IQR of B.
6. The two box plots show the English and math test results of a class of high school students. Choose the statement below that is incorrect.
English
Math
Up to 25 \% of the math scores were higher than the highest English score.
The highest score in English was greater than the highest score in math.
The lowest score in English was less than the lowest score in math.
There was similar variability in scores in English and math.
The highest score in English was 52.
The highest score in Math was 56.
Therefore, the highest score in English was less than the highest score in math, and the statement is incorrect.
A box plot is a standardized way to represent a dataset that summarizes the key features based on the five-number summary: the minimum value, the first quartile, the median, the third quartile, and the maximum value.
Other names for a box plot are box and whisker plot and box and whisker diagram.
A box plot displays summary statistics of a dataset, such as the median and quartiles, while a violin plot combines a box plot with a density plot to show the data distribution more comprehensively.
At Third Space Learning, we specialize in helping teachers and school leaders to provide personalized math support for more of their students through high-quality, online one-on-one math tutoring delivered by subject experts.
Each week, our tutors support thousands of students who are at risk of not meeting their grade-level expectations, and help accelerate their progress and boost their confidence.
Find out how we can help your students achieve success with our math tutoring programs.
Prepare for math tests in your state with these 3rd Grade to 8th Grade practice assessments for Common Core and state equivalents.
Get your 6 multiple choice practice tests with detailed answers to support test prep, created by US math teachers for US math teachers!