List the datasets in dplyr.
Load the built-in dataset starwars and use glimpse() to see an overview.
Convert the built-in base R mtcars dataset to a tibble (you will need to find the function for this; it isn’t in the chapter), and store it in the object mt.
mt <- NULL
Using the data directory created by reprores::getdata() (or download the zip file, read “disgust_scores.csv” into a table.
disgust <- NULL
Override the default column specifications to skip the id column.
disgust_skip <- NULL
How many rows and columns are in the disgust dataset?
disgust_rows <- NULL
disgust_cols <- NULL
Load the data in “data/stroop.csv” as stroop1 and “data/stroop.xlsx” as stroop2.
stroop1 <- NULL
stroop2 <- NULL
Use glimpse() to figure out the difference between the two data tables and fix the problem.
stroop2b <- NULL
Create a tibble with the columns name, age, and country of origin for 2 people you know.
people <- NULL
Create a tibble that has the structure of the table below, using the minimum typing possible. (Hint: rep()). Store it in the variable my_tbl.
| ID | A | B | C |
|---|---|---|---|
| 1 | A1 | B1 | C1 |
| 2 | A1 | B2 | C1 |
| 3 | A1 | B1 | C1 |
| 4 | A1 | B2 | C1 |
| 5 | A2 | B1 | C1 |
| 6 | A2 | B2 | C1 |
| 7 | A2 | B1 | C1 |
| 8 | A2 | B2 | C1 |
my_tbl <- NULL
Set the following objects to the number 1 with the indicated data type:
one_int (integer)one_dbl (double)one_chr (character)one_int <- NULL
one_dbl <- NULL
one_chr <- NULL
Set the objects T_log, T_chr, T_int and T_dbl to logical, character, integer and double values that will all be equal to TRUE.
T_log <- NULL
T_chr <- NULL
T_int <- NULL
T_dbl <- NULL
Check your answers with this code:
# these should all evaluate to TRUE
tests <- list(
T_log_is_TRUE = T_log == TRUE,
T_chr_is_TRUE = T_chr == TRUE,
T_int_is_TRUE = T_int == TRUE,
T_dbl_is_TRUE = T_dbl == TRUE,
T_log_is_log = is.logical(T_log),
T_chr_is_chr = is.character(T_chr),
T_int_is_int = is.integer(T_int),
T_dbl_is_dbl = is.double(T_dbl)
)
str(tests) # this shows a condensed version of the list
## List of 8
## $ T_log_is_TRUE: logi(0)
## $ T_chr_is_TRUE: logi(0)
## $ T_int_is_TRUE: logi(0)
## $ T_dbl_is_TRUE: logi(0)
## $ T_log_is_log : logi FALSE
## $ T_chr_is_chr : logi FALSE
## $ T_int_is_int : logi FALSE
## $ T_dbl_is_dbl : logi FALSE
Create a vector of the numbers 3, 6, and 9.
threes <- NULL
The built-in vector letters contains the letters of the English alphabet. Use an indexing vector of integers to extract the letters that spell ‘cat’.
cat <- NULL
The function colors() returns all of the color names that R is aware of. What is the length of the vector returned by this function? (Use code to find the answer.)
col_length <- NULL
Create a named list called col_types where the name is each column in the built-in dataset table1 and the value is the column data type (e.g., “double”, “character”, “integer”, “logical”).
col_types <- NULL
Set the object x to the integers 1 to 100. Use vectorised operations to set y to x squared. Use plot(x, y) to visualise the relationship between these two numbers.
x <- NULL
y <- NULL
Set t to the numbers 0 to 100 in increments of 0.1. Set x to the sine of t and y to the cosine of t (you will need to find the functions for sine and cosine). Plot x against y.
t <- NULL
x <- NULL
y <- NULL
The function call runif(n, min, max) will draw n numbers from a uniform distribution from min to max. If you set n to 10000, min to 0 and max to 1, this simulates the p-values that you would get from 10000 experiments where the null hypothesis is true. Create the following objects:
pvals: 10000 simulated p-values using runif()is_sig: a logical vector that is TRUE if the corresponding element of pvals is less than .05, FALSE otherwisesig_vals: a vector of just the significant p-valuesprop_sig: the proportion of those p-values that were significantset.seed(8675309) # ensures you get the same random numbers each time you run this code chunk
pvals <- NULL
is_sig <- NULL
sig_vals <- NULL
prop_sig <- NULL