Before we Start


  • Use Positron to write and execute R scripts
  • Use install.packages() to install packages (libraries)

Introduction to R


  • Access individual values by location using []
  • Access arbitrary sets of data using [c(...)]
  • Use logical operations and logical vectors to access subsets of data

Starting with Data


  • Use read_csv to read tabular data in R

Data Wrangling with dplyr and tidyr


  • Use the dplyr package to manipulate data frames
  • Use select() to choose variables from a data frame
  • Use filter() to choose data based on values
  • Use group_by() and summarize() to work with subsets of data
  • Use mutate() to create new variables
  • Use the tidyr package to change the layout of data frames
  • Use pivot_wider() to go from long to wide format
  • Use pivot_longer() to go from wide to long format

A couple of plots. And making our own functions


  • Boxplots are useful for visualizing the distribution of a continuous variable
  • Barplots are useful for visualizing categorical data
  • Functions allows you to repeat the same set of operations again and again
  • Loops allows you to apply the same function to lots of data
  • Logical tests allow you to apply different calculations on different sets of data

What is the next step?What should I do next?


  • Practice is important
  • Working on data that YOU find interesting is a really good idea
  • The amount of ressources online is immense
  • KUB Datalab is there for you