n.b. the lecture notes for the Scripting QC portion contain most of the piecewise code to build this exercise
Say you are setting up a network of bird surveyors from around the state of Minnesota. As data coordinator, you will be receiving files from all over the state, and be expected to produce a clean, consistent dataset from a multitude of submitted observations.
Using the cleaned ‘WMA-bird’ dataset as a model, you decide that the following column names and types should be standard:
Column name | Type |
---|---|
WMA | character |
date_sampled | date (YYYY-MM-DD) |
latin_name | character |
count_observed | integer |
The script should output a list of column names for the file, whether they match the standard names, and the type of data according to your input procedure.
count_observed
column in the standard data. The function should return: