These are the course notes for the Monash Data Fluency’s
“Programming and Tidy data analysis in R” course. This is an
intermediate level R workshop, for people wishing to increase their
fluency in R. It covers tasks that while not specifically involved in
statistical analysis are necessary when working with data: loading data
and getting it into a form that is easy to work with, automating
repetitive tasks, identifying problems such as the need for
normalization and transformation, and properly understanding the story
the data wants to tell.
This workshop is designed to work Posit Cloud. Monash users can log in to Posit Cloud with their Monash google account. You can also use R and RStudio on your own laptop.
Create a new project, either in Posit Cloud or RStudio on your own laptop, then run the R code below. This will download files and install packages used in this workshop. If using R on your own laptop, then to use FastQC you will need to have installed Java, and the FastQC steps will probably need some adjustments for Windows computers (this isn’t an essential part of the workshop).
# Download workshop files
download.file(
"https://monashdatafluency.github.io/r-progtidy/r-progtidy-files.zip",
"r-progtidy-files.zip")
unzip("r-progtidy-files.zip")
# Download FastQC software
download.file(
"https://www.bioinformatics.babraham.ac.uk/projects/fastqc/fastqc_v0.11.8.zip",
"fastqc_v0.11.8.zip")
unzip("fastqc_v0.11.8.zip")
system("chmod u+x FastQC/fastqc")
# Install tidyverse packages and rmarkdown
install.packages(c("tidyverse", "rmarkdown"))
# Optional: install some further packages
# (these are mentioned but not essential to the workshop)
install.packages(c("devtools", "usethis", "BiocManager"))
BiocManager::install("edgeR")
The workshop files downloaded by the setup instructions above are also available here: