Teaching Datasets

Teaching dataset 1 - Jornada ANPP data

This dataset contains annual net primary production (NPP) data, measured in grams biomass per square meter, from the Jornada NPP study sites. There are 15 NPP study sites, in 5 different vegetation zones, across the Jornada Basin. At each site there are 49 1x1 meter quadrats where repeated measures of plant volume by species take place several each year. These volume data have been converted to biomass using an allometric method, and then to net primary production using the increment in biomass from one measurement to the next.

The annual data are in this EDI dataset:

Peters, D. and L. Huenneke. 2022. Annual mean estimates of aboveground net primary production (NPP) at 15 sites at Jornada Basin LTER, 1989-ongoing ver 105. Environmental Data Initiative. https://doi.org/10.6073/pasta/925c86b094c123fae4ddabf31cf92a30

First we will need to load the tidyverse library.

library(tidyverse)

Then, load a comma-separated value file from the EDI repository. At the EDI repository, each file in a dataset is assigned a download URL. This can be passed to the tidyverse::read_csv() function to read that file into a dataframe.

# Assign the address for the csv file hosted on EDI to a variable
infile <- "https://pasta.lternet.edu/package/data/eml/knb-lter-jrn/210011003/105/127124b0f04a1c71f34148e3d40a5c72"

# Read this file into a tibble with `tidyverse::read_csv()`
anpp.annual <- read_csv(infile, na=c('NA', '', '.'))

## Rows: 465 Columns: 4
## ── Column specification ─────────────────────────────────────────────────────────────────────
## Delimiter: ","
## chr (2): zone, site
## dbl (2): year, npp_g_m2
## 
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

R tells us that we have two character data columns (zone and site), and two floating point (dbl) columns that contain the year of measurement - year - and estimated annual NPP values (npp_g_m2). Lets make a simple plot to look at the data. To do this we are using the ggplot() function, which is part of the ggplot2 library that was attached when we loaded the tidyverse package`

ggplot(anpp.annual, aes(x = year, y = npp_g_m2, col = site, group = site)) +
  geom_line() +
  theme(axis.text.x = element_text(angle = 90, vjust = 0.5))

Now, lets output this into our data/ directory so that we can load it easily next time.

write_csv(anpp.annual, '../data/td01_anpp.annual.csv')

Teaching Datasets

Greg Maurer, Darren James

2022-06-19

Introduction

Teaching dataset 1 - Jornada ANPP data