9.1 Measures of Central Tendency

Applications

Objectives

When you’ve finished this chapter, you’ll be able to accomplish the following:

  1. Differentiate among mean, median and mode.
  2. Select the best measurement for your data and purposes.
  3. Calculate and report mean, median, and mode for data in a spreadsheet.

As long as we have Bull Trout (Salvelinus confluentus) on our minds…

We’re sampling Bull Trout in the Bull River watershed. Let’s say we electrofish, collect, and measure 240 fish. We have total length in millimeters and weight in grams for each fish. What do we do with this data next? What would you want to know? What might biologists managing this species want to know?

 

The Bull River Guard Station was built in 1908 as the Ranger's house and office. This structure was a primary ranger station from 1908 to 1920, surviving the 1910 fire. The cabin was home to Granville "Granny" Gordon (District Ranger), his wife and three daughters. When the 1910 fires roared thru the country, Mrs. Gordon prepared for the worst by soaking gunny sacks in a tub of water. If they had to escape the fire, they would wrap themselves in the gunny sacks and race to the Bull River to wait out the fire. As the fire closed in on the ranger station, it shifted direction and swept up Pilik ridge sparing their home. The cabin is a two story building, containing 700 square feet.
When working as a fisheries technician, you may very well stay at a remote guard station close to your project site. This is the Bull River Guard Station. The Bull River Guard Station was built in 1908 as the Ranger’s house and office. This structure was a primary ranger station from 1908 to 1920, surviving the 1910 fire. The cabin was home to Granville “Granny” Gordon (District Ranger), his wife and three daughters. When the 1910 fires roared thru the country, Mrs. Gordon prepared for the worst by soaking gunny sacks in a tub of water. If they had to escape the fire, they would wrap themselves in the gunny sacks and race to the Bull River to wait out the fire. As the fire closed in on the ranger station, it shifted direction and swept up Pilik ridge sparing their home.

Applications

What do technicians in our fields do? Well – much of the time we are sampling from a larger population to make inferences about that population. We might be sampling from a population of Ponderosa pine (Pinus ponderosa), a population of Bull Trout (Salvelinus confluentus), or a population of streams within a watershed.

Understanding the relationship between a population and samples we take from it (and the mathematical symbols we use when working with samples) will help you navigate this short section on statistics.

 

Large Bull Trout in Montana showing typical light spotting on a darker body with large head and mouth.
Large Bull Trout in Montana showing typical light spotting on a darker body with large head and mouth.

Objectives

When you’ve finished this chapter, you’ll be able to describe the relationship between a population and samples taken from it and to use symbols to describe these.



We often describe data using a measure of central tendency. This is a number that we use to describe the typical data value. In this module, we will look at three measures of central tendency: the mean, the median, and the mode. Each of these has pros and cons, depending on the particular data set.

Mean

The mean of a set of data is what we commonly call the average: add up all of the numbers and then divide by how many numbers there were.

Exercises

  1. The table below shows the amount of time, rounded to the nearest half minute, it took Marty to complete the Friday crossword puzzle in the New York Times. Calculate the mean completion time for these thirteen puzzles.
    Oct 6, 2023 [latex][/latex]10.0\text{ min}[latex][/latex]
    Oct 13, 2023 [latex][/latex]13.0\text{ min}[latex][/latex]
    Oct 20, 2023 [latex][/latex]11.0\text{ min}[latex][/latex]
    Oct 27, 2023 [latex][/latex]9.0\text{ min}[latex][/latex]
    Nov 3, 2023 [latex][/latex]8.5\text{ min}[latex][/latex]
    Nov 10, 2023 [latex][/latex]9.5\text{ min}[latex][/latex]
    Nov 17, 2023 [latex][/latex]11.0\text{ min}[latex][/latex]
    Nov 24, 2023 [latex][/latex]12.0\text{ min}[latex][/latex]
    Dec 1, 2023 [latex][/latex]11.5\text{ min}[latex][/latex]
    Dec 8, 2023 [latex][/latex]9.5\text{ min}[latex][/latex]
    Dec 15, 2023 [latex][/latex]11.0\text{ min}[latex][/latex]
    Dec 22, 2023 [latex][/latex]11.0\text{ min}[latex][/latex]
    Dec 29, 2023 [latex][/latex]7.0\text{ min}[latex][/latex]
  2. The table below shows the average price of a gallon of regular unleaded gasoline in the Seattle metro area for ten weeks in late 2023.[1] Compute the mean price over this time period.
    Oct 23, 2023 [latex][/latex]\textdollar4.81[latex][/latex]
    Oct 30, 2023 [latex][/latex]\textdollar4.70[latex][/latex]
    Nov 6, 2023 [latex][/latex]\textdollar4.63[latex][/latex]
    Nov 13, 2023 [latex][/latex]\textdollar4.57[latex][/latex]
    Nov 20, 2023 [latex][/latex]\textdollar4.49[latex][/latex]
    Nov 27, 2023 [latex][/latex]\textdollar4.45[latex][/latex]
    Dec 4, 2023 [latex][/latex]\textdollar4.39[latex][/latex]
    Dec 11, 2023 [latex][/latex]\textdollar4.34[latex][/latex]
    Dec 18, 2023 [latex][/latex]\textdollar4.28[latex][/latex]
    Dec 25, 2023 [latex][/latex]\textdollar4.22[latex][/latex]

Median

The median is the middle number in a set of data; it has an equal number of data values below it as above it. The numbers must be arranged in order, usually smallest to largest but largest to smallest would also work. Then we can count in from both ends of the list and find the median in the middle.

If there are an odd number of data values, there will be one number in the middle, which is the median.

If there are an even number of data values, there will be two numbers in the middle. The mean of these two numbers is the median.

Exercises

  1. Here are Marty's Friday crossword puzzle completion times again, in minutes, listed in order from fastest to slowest. What is the median completion time? [latex][/latex]7.0, 8.5, 9.0, 9.5, 9.5, 10.0, 11.0, 11.0, 11.0, 11.0, 11.5, 12.0, 13.0[latex][/latex]
  2. Here are the Seattle gas prices again, listed in order from lowest to highest. What is the median price? [latex][/latex]\textdollar4.22[latex][/latex], [latex][/latex]\textdollar4.28[latex][/latex], [latex][/latex]\textdollar4.34[latex][/latex], [latex][/latex]\textdollar4.39[latex][/latex], [latex][/latex]\textdollar4.45[latex][/latex], [latex][/latex]\textdollar4.49[latex][/latex], [latex][/latex]\textdollar4.57[latex][/latex], [latex][/latex]\textdollar4.63[latex][/latex], [latex][/latex]\textdollar4.70[latex][/latex], [latex][/latex]\textdollar4.81[latex][/latex]

The five houses on a block have these property values: [latex][/latex]\textdollar250,000[latex][/latex]; [latex][/latex]\textdollar300,000[latex][/latex]; [latex][/latex]\textdollar320,000[latex][/latex]; [latex][/latex]\textdollar190,000[latex][/latex]; [latex][/latex]\textdollar220,000[latex][/latex].

  1. Find the mean property value.
  2. Find the median property value.

A new house is built on the block, making the property values [latex][/latex]\textdollar250,000[latex][/latex]; [latex][/latex]\textdollar300,000[latex][/latex]; [latex][/latex]\textdollar320,000[latex][/latex]; [latex][/latex]\textdollar190,000[latex][/latex]; [latex][/latex]\textdollar220,000[latex][/latex]; [latex][/latex]\textdollar750,000[latex][/latex].

  1. Find the mean property value.
  2. Find the median property value.
  3. Which of these measures appears to give a more accurate representation of the typical house on the block?

The mean is better to work with when we do more complicated statistical analysis, but it is sensitive to extreme values; in other words, one very large or very small number can have a significant effect on the mean. The median is not sensitive to extreme values, which can make it a better measure to use when describing data that has one or two numbers very different from the remainder of the data.

Mode

The mode is the value that appears most frequently in the data set. On the game show Family Feud, the goal is to guess the mode: the most popular answer.

If no numbers are repeated, then the data set has no mode. If there are two values that are tied for most frequently occurring, then they are both considered a mode and the data set is called bimodal. If there are more than two values tied for the lead, we usually say that there is no mode.[2] (It's like in sports: there is usually one MVP, but occasionally there are two co-MVPs. Having three or more MVPs would start to get ridiculous.)

Exercises

  1. Here are Marty's Friday crossword puzzle completion times one last time, in minutes. What is the mode of the completion times? [latex][/latex]7.0, 8.5, 9.0, 9.5, 9.5, 10.0, 11.0, 11.0, 11.0, 11.0, 11.5, 12.0, 13.0[latex][/latex]
  2. Here are the Seattle gas prices one last time. What is the mode of the prices? [latex][/latex]\textdollar4.22[latex][/latex], [latex][/latex]\textdollar4.28[latex][/latex], [latex][/latex]\textdollar4.34[latex][/latex], [latex][/latex]\textdollar4.39[latex][/latex], [latex][/latex]\textdollar4.45[latex][/latex], [latex][/latex]\textdollar4.49[latex][/latex], [latex][/latex]\textdollar4.57[latex][/latex], [latex][/latex]\textdollar4.63[latex][/latex], [latex][/latex]\textdollar4.70[latex][/latex], [latex][/latex]\textdollar4.81[latex][/latex]
  3. One hundred cell phone owners are asked which carrier they use. What is the mode of the data?
    AT&T Mobility Verizon Wireless T-Mobile US Dish Wireless U.S. Cellular
    [latex][/latex]43[latex][/latex] [latex][/latex]29[latex][/latex] [latex][/latex]24[latex][/latex] [latex][/latex]2[latex][/latex] [latex][/latex]2[latex][/latex]
  4. Fifty people are asked what their favorite type of Girl Scout cookie is. What is the mode?
    S’Mores Samoas Tagalongs Trefoils Thin Mints
    [latex][/latex]4[latex][/latex] [latex][/latex]16[latex][/latex] [latex][/latex]5[latex][/latex] [latex][/latex]9[latex][/latex] [latex][/latex]16[latex][/latex]

Let's put it all together and find the mean, median, and mode of some data sets. Sportsball!

Exercises

From 2001-2019, these are the numbers of games won by the New England Patriots each NFL season.[3]
[latex][/latex]11[latex][/latex], [latex][/latex]9[latex][/latex], [latex][/latex]14[latex][/latex], [latex][/latex]14[latex][/latex], [latex][/latex]10[latex][/latex], [latex][/latex]12[latex][/latex], [latex][/latex]16[latex][/latex], [latex][/latex]11[latex][/latex], [latex][/latex]10[latex][/latex], [latex][/latex]14[latex][/latex], [latex][/latex]13[latex][/latex], [latex][/latex]12[latex][/latex], [latex][/latex]12[latex][/latex], [latex][/latex]12[latex][/latex], [latex][/latex]12[latex][/latex], [latex][/latex]14[latex][/latex], [latex][/latex]13[latex][/latex], [latex][/latex]11[latex][/latex], [latex][/latex]12[latex][/latex].
  1. Find the mean number of games won from 2001 to 2019.
  2. Find the median number of games won from 2001 to 2019.
  3. Find the mode of the number of games won from 2001 to 2019.
  4. Do any of these measures appear to be misleading, or do they all represent the data fairly well?

From 2001-2019, these are the numbers of games won by the Buffalo Bills each NFL season.[4]
[latex][/latex]3[latex][/latex], [latex][/latex]8[latex][/latex], [latex][/latex]6[latex][/latex], [latex][/latex]9[latex][/latex], [latex][/latex]5[latex][/latex], [latex][/latex]7[latex][/latex], [latex][/latex]7[latex][/latex], [latex][/latex]7[latex][/latex], [latex][/latex]6[latex][/latex], [latex][/latex]4[latex][/latex], [latex][/latex]6[latex][/latex], [latex][/latex]6[latex][/latex], [latex][/latex]6[latex][/latex], [latex][/latex]9[latex][/latex], [latex][/latex]8[latex][/latex], [latex][/latex]7[latex][/latex], [latex][/latex]9[latex][/latex], [latex][/latex]6[latex][/latex], [latex][/latex]10[latex][/latex].

  1. Find the mean number of games won from 2001 to 2019.
  2. Find the median number of games won from 2001 to 2019.
  3. Find the mode of the number of games won from 2001 to 2019.
  4. Do any of these measures appear to be misleading, or do they all represent the data fairly well?

Some sets of data may not be easy to describe with one measure of central tendency.

Exercises

Thirteen clementines are weighed. Their masses, in grams, are
[latex][/latex]82[latex][/latex], [latex][/latex]90[latex][/latex], [latex][/latex]90[latex][/latex], [latex][/latex]92[latex][/latex], [latex][/latex]93[latex][/latex], [latex][/latex]94[latex][/latex], [latex][/latex]94[latex][/latex], [latex][/latex]102[latex][/latex], [latex][/latex]107[latex][/latex], [latex][/latex]107[latex][/latex], [latex][/latex]108[latex][/latex], [latex][/latex]109[latex][/latex], [latex][/latex]109[latex][/latex].

  1. Determine the mean. Does the mean appear to represent the mass of a typical clementine?
  2. Determine the median. Does the median appear to represent the mass of a typical clementine?
  3. Determine the mode. Does the mode appear to represent the mass of a typical clementine?

Suppose that the [latex][/latex]108[latex][/latex]-gram clementine is a tiny bit heavier and the masses are actually
[latex][/latex]82[latex][/latex], [latex][/latex]90[latex][/latex], [latex][/latex]90[latex][/latex], [latex][/latex]92[latex][/latex], [latex][/latex]93[latex][/latex], [latex][/latex]94[latex][/latex], [latex][/latex]94[latex][/latex], [latex][/latex]102[latex][/latex], [latex][/latex]107[latex][/latex], [latex][/latex]107[latex][/latex], [latex][/latex]109[latex][/latex], [latex][/latex]109[latex][/latex], [latex][/latex]109[latex][/latex].

  1. Determine the new mean. Is the new mean different from the original mean?
  2. Determine the new median. Is the new median different from the original median?
  3. Determine the new mode. Is the new mode different from the original mode? Does it represent the mass of a typical clementine?
Clementine the cat weighs more than 109 grams.

  1. Source: https://www.eia.gov/petroleum/gasdiesel/
  2. The concepts of trimodal and multimodal data exist, but we aren't going to consider anything beyond bimodal in this textbook.
  3. Source: https://www.pro-football-reference.com/teams/nwe/index.htm 
  4. Source: https://www.pro-football-reference.com/teams/buf/index.htm 

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Technical Math: Applications for the Environmental Sciences Copyright © by Marilyn Nielson and Morgan Chase is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, except where otherwise noted.

Share This Book