Quick start with R: aggregate() command (Part 10)

In Part 10, let’s look at the aggregate() command for creating summary tables using R. You may have a complex dataset that includes categorical variables of several levels, and you may wish to create summary tables for each level of the categorical variable. For example,…

Quick start with R: Sub-setting (Part 9)

In Part 9, let’s look at sub-setting in R. Let’s provide summary tables on the following data set of tourists from different countries, the numbers of their children, and the amount of money they spent while on vacation. Copy and paste the following array into…

Quick start with R: Basic commands (Part 8)

In Part 8, let’s look at some basic commands in R. Set up the following vectors by cutting and pasting from this document: a <- c(3,-7,-3,-9,3,-1,2,-12, -14) b <- c(3,7,-5, 1, 5,-6,-9,16, -8) d <- c(1,2,3,4,5,6,7,8,9) Now figure out what each of the following commands…

Using Sample Size Calculator application

The following video tutorial gives a brief overview of the Sample Size Calculator application. The Sample Size Calculator is an interactive Shiny application which allows you to calculate sample size when estimating population mean value or population proportion.   [su_youtube_advanced url=”https://www.youtube.com/watch?v=7juKRR-Kahs” rel=”no”]

Quick start with R: Basic plotting (Part 6)

In Part 6, let’s look at basic plotting in R. Try entering the following three commands together (the semi-colon allows you to place several commands on the same line). x <- seq(-4, 4, 0.2) ;  y <- 2*x^2 + 4*x – 7 plot(x, y) This…

Quick start with R: Exponential models (Part 5)

In Parts 3 and 4 we used the lm() command to perform least squares regressions. We saw how to check for non-linearity in our data by fitting polynomial models and checking whether they fit the data better than a linear model. Now let’s see how…

Quick start with R: More about regression (Part 3)

In Part 2 we created two variables and used the lm() command to perform a least squares regression on them, treating one of them as the dependent variable and the other as the independent variable. Here they are again. height = c(186, 165, 149, 206,…