"

1.5 Interpreting and Presenting Data

Learning Objectives

By the end of this section, you will be able to:

  • Identify different types of data used by scientists.
  • Differentiate between qualitative and quantitative data.
  • Describe different ways data is presented.
  • Understand how to read and createtables, and line, bar, and scatter plot graphs.

Types of Data

There are different types of data that can be collected in an experiment. Typically, biologists try to design experiments that collect objective, quantitative data.

Objective data is fact-based, measurable, and observable. This means that if two people made the same measurement with the same tool, they would get the same answer. The measurement is determined by the object that is being measured. The length of a worm measured with a ruler is an objective measurement. The observation that a chemical reaction in a test tube changed color is an objective measurement. Both of these are observable facts.

Subjective data is based on opinions, points of view, or emotional judgment. Subjective data might give two different answers when collected by two different people. The measurement is determined by the subject who is doing the measuring. Surveying people about which of two chemicals smells worse is a subjective measurement. Grading the quality of a presentation is a subjective measurement. Rating your relative happiness on a scale of 1-5 is a subjective measurement. All of these depend on the person who is making the observation – someone else might make these measurements differently.

Quantitative measurements gather numerical data. For example, measuring a worm as being 5cm in length is a quantitative measurement.

Qualitative measurements describe a quality, rather than a numerical value. Saying that one worm is longer than another worm is a qualitative measurement.

Quantitative Qualitative
Objective The chemical reaction has produced 5cm of bubbles. The chemical reaction has produced a lot of bubbles.
Subjective I give the amount of bubbles a score of 7 on a scale of 1-10. I think the bubbles are pretty.

Example Experiment

An experiment might be conducted to test the hypothesis that phosphate limits the growth of algae in freshwater ponds. A series of artificial ponds are filled with water and half of them are treated by adding phosphate each week, while the other half are treated by adding a salt that is known not to be used by algae. The independent variable here is the phosphate (or lack of phosphate). The experimental or treatment cases are the ponds with added phosphate and the control ponds are those with the salt that is known to not be used by algae. Just adding something is also a control against the possibility that adding extra matter to the pond has an effect. If the treated ponds show lesser growth of algae, then we have found support for our hypothesis. If they do not, then we reject our hypothesis. Be aware that rejecting one hypothesis does not determine whether or not the other hypotheses can be accepted; it simply eliminates one hypothesis that is not valid. Using the scientific method, the hypotheses that are inconsistent with experimental data are rejected. How many times should you perform your test? How many samples should be in each test? The answer is “as many as is feasible”. For the purposes of educational laboratory experiences, that answer is typically around three times. However, if you were testing a new drug, you would need many more than three samples in order to show that the drug was safe and effective!

Presenting Data

After you have collected data in an experiment, you need to figure out the best way to present that data in a meaningful way. Depending on the type of data, and the story that you are trying to tell using that data, you may present your data in different ways.

Descriptive titles

All figures that present data should stand alone – this means that you should be able to interpret the information contained in the figure without referring to anything else (such as the methods section of the paper). This means that all figures should have a descriptive title that gives information about the independent and dependent variable. Another way to state this is that the title should describe what you are testing and what you are measuring. A good starting point to developing a title is “the effect of [the independent variable] on the [dependent variable].”

Here are some examples of good titles for figures:

  • The effect of exercise on heart rate
  • Growth rates of E. coli at different temperatures
  • The relationship between heat shock time and transformation efficiency

Here are a few less effective titles:

  • Heart rate and exercise
  • Graph of E. coli temperature growth
  • Table for experiment 1

Data Tables

The easiest way to organize data is by putting it into a data table. In most data tables, the independent variable (the variable that you are testing or changing on purpose) will be in the column to the left and the dependent variable(s) will be across the top of the table. You should use a data table while you are collecting your data and to display your data when the actual numerical values of the data are more important than the trends.

Be sure to:

  • Label each row and column so that the table can be interpreted
  • Include the units that are being used
  • Add a descriptive title for the table

Example

You are evaluating the effect of different types of fertilizers on plant growth. You plant 12 tomato plants in pots that are the same size and which contain the same type of soil. You divide the plants into three groups, where each group contains four plants. To the first group, you do not add fertilizer and the plants are watered with plain water. The second and third groups are watered with two different brands of fertilizer. After three weeks, you measure the growth of each plant in centimeters and calculate the average growth for each type of fertilizer.

The effect of different brands of fertilizer on tomato plant growth over three weeks
Treatment Plant Height (cm)
Plant 1 Plant 2 Plant 3 Plant 4 Average
No treatment 10 12 8 9 9.75
Brand A 15 16 14 12 14.25
Brand B 22 25 21 27 23.75

Graphing data

Graphs are used to display data because it is easier to see trends in the data when it is displayed visually compared to when it is displayed numerically in a table. Complicated data can often be displayed and interpreted more easily in a graph format than in a data table.

In a graph, the X-axis runs horizontally (side to side) and the Y-axis runs vertically (up and down). Typically, the independent variable will be shown on the X axis and the dependent variable will be shown on the Y axis (just like you learned in math class!).

Line Graph

Line graphs are the best type of graph to use when you are displaying a change in something over a continuous range. For example, you could use a line graph to display a change in temperature over time. Time is a continuous variable because it can have any value between two given measurements. It is measured along a continuum. Between 1 minute and 2 minutes are an infinite number of values, such as 1.1 minute or 1.93456 minutes.

line graph on left labeled right type of graph. bar graph on right labeled wrong type of graph
Figure 1.14. Change in something over time is being graphed. This data is continuous: there are points between the tested values – there is a data points that could have been tested between5 sec and 10 sec, for example. Because the data is continuous, a line graph is the correct type of graph to use.

When trying to decide if you should use a line graph, consider whether the experiment could have tested additional values that are between the values that were tested. Other than time, some other common continuous variables are temperature, pH, and concentration or amount (such as mL or concentration).

Since the variable on the X axis is continuous, you must label the values continuously rather than evenly spacing the data points provided (Figure 1). For example, if you collected data at minutes 0, 1, 5, 10, and 30, there should be 1 units-worth of space between 0 and 1 and 5 units worth of space between 5 and 10. You would not evenly space the values on the axis.

Fig. 1.15 The values on the X axis of the left-hand graph are spaced correctly. The values on the X axis of the right-hand graph are spaced incorrectly. Notice how much it changes the shape of the graph and therefore your interpretation of the data!

Changes in several different samples can be shown on the same graph by using lines that differ in color, symbol, etc.

Figure 1.16 Change in bubble height in centimeters over 120 seconds for three samples containing different amounts of enzyme. Sample A contained no enzyme, sample B contained 1mL of enzyme, sample C contained 2 mL of enzyme. This data should be shown with a line graph because there are values between the tested values. For example, data could have been collected at 20 seconds even though it wasn’t.

Bar Graph

Bar graphs are used to compare measurements between different groups. Bar graphs should be used when your data is not continuous, but rather is divided into different categories (Figure 4). If you counted the number of birds of different species, each species of bird would be its own category. There is no value between “robin” and “eagle”, so this data is not continuous.

Bar graph on left labeled right type of graph. line graph on right labeled wrong type of graph.
Figure 1.17 The effect of different substances on paramecium movement was tested. This data is categorical: there is nothing halfway between coffee and orange juice (coffee orange juice sounds disgusting). Because the data is categorical, a bar graph is the correct type of graph to use.

Scatter Plot

Scatter Plots are used to evaluate the relationship between two different continuous variables. These graphs compare changes in two different variables at once (Figure 5). For example, you could look at the relationship between height and weight. Both height and weight are continuous variables. You could not use a scatter plot to look at the relationship between number of children in a family and weight of each child because the number of children in a family is not a continuous variable: you can’t have 2.3 children in a family.

scatterplot of weight vs height
Fig. 1.18 The relationship between height (in meters) and weight (in kilograms) of members of the girls softball team. “OLS example weight vs height scatterplot” by Stpasha is in the Public Domain

How to make a graph

  1. Identify your independent and dependent variables.
  2. Choose the correct type of graph by determining whether each variable is continuous or not.
  3. Determine the values that are going to go on the X and Y axis. If the values are continuous, they need to be evenly spaced based on the value.
  4. Label the X and Y axis, including units.
  5. Graph your data.
  6. Add a descriptive caption to your graph. Note that data tables are titled above the figure and graphs are captioned below the figure.

Example

Let’s go back to the data from our fertilizer experiment and use it to make a graph. I’ve decided to graph only the average growth for the four plants because that is the most important piece of data. Including every single data point would make the graph very confusing.

  1. The independent variable is type of treatment and the dependent variable is plant growth (in cm).
  2. Type of treatment is not a continuous variable. There is no midpoint value between fertilizer brands (Brand A 1/2 doesn’t make sense). Plant growth is a continuous variable. It makes sense to sub-divide centimeters into smaller values. Since the independent variable is categorical and the dependent variable is continuous, this graph should be a bar graph.
  3. Plant growth (the dependent variable) should go on the Y axis and type of treatment (the independent variable) should go on the X axis.
  4. Notice that the values on the Y axis are continuous and evenly spaced. Each line represents an increase of 5cm.
  5. Notice that both the X and the Y axis have labels that include units (when required).
  6. Notice that the graph has a descriptive caption that allows the figure to stand alone without additional information given from the procedure: you know that this graph shows the average of the measurements taken from four tomato plants.
bar graph showing effect of fertilizer
Fig. 1.19 Average growth (in cm) of tomato plants when treated with different brands of fertilizer. There were four tomato plants in each group (n = 4).

Glossary

objective data: fact-based, measurable, and observable. This means that if two people made the same measurement with the same tool, they would get the same answer.

Qualitative measurements: describe a quality, rather than a numerical value.

Quantitative measurements: gather numerical data.

subjective data: based on opinions, points of view, or emotional judgment. Subjective data might give two different answers when collected by two different people.

falsifiable: able to be disproven by experimental results

hypothesis: a suggested explanation for an event, which can be tested

hypothesis-based science: a form of science that begins with a specific explanation that is then tested

scientific method: a method of research with defined steps that include experiments and careful observation

variable: a part of an experiment that can vary or change

References

Mt Hood Community College Biology 101 Copyright © 2016 by Lisa Bartee and Christine Anderson is licensed under a Creative Commons Attribution 4.0 International License, except where otherwise noted.

Media Attributions

  • nature5

License

Icon for the Creative Commons Attribution 4.0 International License

ABE 074: Biology Copyright © 2024 by Tacoma Community College is licensed under a Creative Commons Attribution 4.0 International License, except where otherwise noted.