"Tell me and I forget. Teach me and I remember. Involve me and I learn."
― Benjamin Franklin
"Tell me and I forget. Teach me and I remember. Involve me and I learn."
― Benjamin Franklin
With the internet, we are in a new age of data:
Jenny Bryan said: "Classroom data are like teddy bears and real data are like a grizzly bear with salmon blood dripping out its mouth."
Traditional Classroom Data | Real Data |
---|---|
Some attributes of real data:
Inconsistent formatting is a real pain:
dplyr
PackageTo take this, we now officially introduce the dplyr
package: a grammar of data manipulation
Were it not for this package, I probably wouldn't be taking a data-centric view to this course.
Why do I have a dplyr
sticker on my laptop? Why is dplyr
so good IMO?
function()
you use.Say hello to the FMV: the five main verbs
select()
columns by variable namefilter()
rows matching criteriamutate()
existing variables to create new onesarrange()
rowssummarise()
numerical variables that are group_by()
categorical variables_join()
two separate data frames by
corresponding variablesKeep looking back and forth between book and cheatsheet!
Re-read Chapter 5 and create five of your own problems using the gap
data frame you downloaded for Exam 1. Save this file as ps9.Rmd
in your LastnameFirstname
folder.
We will work through many of these problems in class on Monday.