Introduction to Data Analysis in R
2023-10-10
1 Welcome!
This site hosts the course curriculum for Colorado State University’s WR 696 course: Introduction to Data Analysis in R
Set up Instructions
As your first step, please follow the instructions on the R Setup page to make sure you have all the necessary software installed on your computer before the class starts.
Goals
The broad goal of this course is to introduce you to the skills necessary for a career that involves data science - namely manipulating data sets and performing statistical tests in R. The more specific goals of this course are to teach students to:
- Navigate the RStudio interface and create R Markdown documents for reproducible reporting;
- Utilize R packages and functions to manipulate and analyze data effectively and apply data wrangling techniques using the {tidyverse} framework;
- Differentiate between various data types and structures within R; and
- Explore comparative analyses, linear regression, and trend analysis techniques to reveal patterns in data.
Approach and Expectations
This class is flipped, meaning all materials for the week ahead must be reviewed before class. To encourage this we will have weekly quizzes on pre-class content each Monday before we dive into the assignment.
So without lectures in class what do we do together? We code! This class has almost four hours of contact time per week, and we design lessons so that you should be able to finish your assignments in class. The flipped class allows for deeper discussion about the common pitfalls of coding and allows for collaborative work time with your fellow classmates.
Generally we will do a quick live-code review of concepts from the assignment and the pre-class materials, but more than 1.5 hours per day will be dedicated to you coding and working on the assignment in class.
As such, coming to class is a vital part of how you can be successful and we fully expect you to be there every day.
We also will actively encourage a collaborative coding environment where students help each other and discuss the best approach to solving various coding problems. We also hope that outside of class, you will use our Teams channel to discuss code issues!
We will always send announcements with assignments, web links, and other updates through Canvas. The course syllabus will also be posted on Canvas.
Class Schedule
Week | Date (Monday) | Content |
1 | 8/22 | Before class: Primers Basics and R Markdown; download R and RStudio Introduction to R, RStudio, RMarkdown |
2 | 8/28 | Before class: Work with Data Exploratory data analysis |
3 | 9/4 | No Class Monday 9/4! Before class Wednesday: Visualize Data Data visualization |
4 | 9/11 | Before class: Tidy Your Data T-tests, ANOVA |
5 | 9/18 | Before class: Iterate Linear modelling pt. 1 |
6 | 9/25 | Before class: Write Functions Linear modelling pt. 2 |
7 | 10/2 | Modelling assumptions, power |
8 | 10/9 | PCA |
Additional introductory resources
If you are looking to learn even more outside of what this class offers, below are some great introductory R resources:
Stat 158 - Vectors, data frames, installing R, etc…
RStudio Materials - A series of videos, books, and more to get you started in R.
1.0.0.1 Tidyverse Introduction
R for Data Science - Covers all of the basic intro material, from a tidyverse perspective. As discussed, this is one way to find solutions in R, it happens to be my preferred way, but there are lots of Base R ways that work just fine!
Stat 159 - A CSU specific course for an intro to the tidyverse
R Markdown - The primary book for learning more about R Markdown and all of its quirks.
Cheatsheets - Short, clear documents that cover so much material from dplyr to shiny apps. Great for quick references. We find the
rmarkdown
andggplot2
ones particularly useful!