Data scientists extract useful knowledge from a deluge of data, and they develop new tools to manage this task daily. These tools frequently rely on a programming language. Unfortunately, learning a programming language can be daunting for both new and experienced researchers. It relies on detailed knowledge of new software, grammatical and syntactical conventions, along with new data organization practices. It is easy to get lost in the sea of information on each of these topics. Fortunately, a popular new suite of tools has emerged within R called the tidyverse. Packages within the tidyverse share a similar design philosophy and data structure, which allows for the communication of complex statistical processes with only a handful of core functions. A common way to integrate data manipulation, visualization, and modeling with the tidyverse has led to an explosion of new R users. This workshop will introduce attendees to basic computer processes and file structures, useful core tidyverse packages called dplyr and ggplot2, and a new package for modeling called tidymodels. Integrating R and the tidyverse into research projects enhances reproducibility, supports long-term project management, and allows users to access up-to-the-minute statistical tools free of charge.
Participants must bring a laptop with R, RStudio, and specific packages downloaded and working correctly. Download instructions will be emailed to each participant before the workshop; please follow the instructions carefully to ensure you are prepared and can get the most out of our time together. Lunch is provided, and the deadline to place your lunch order for this workshop is BLANK. You may still register after this date, if there is space, but we will not be able to provide lunch for late registrants.