Consider the following string. Which command would you use to replace the x with blank (whitespace)?
string <- c("169 millimeters x 117 millimeters x 9.1 millimeters")
A. chartr(string, x)B. chartr(string, "x", "~")C. chartr(string, old = "x", new=" ")D. chartr(string, "x", " - ")Consider the following string. Which command would you use to replace the x with blank (whitespace)?
string <- c("169 millimeters x 117 millimeters x 9.1 millimeters")
A. chartr(string, x)B. chartr(string, "x", "~")C. chartr(string, old = "x", new=" ")D. chartr(string, "x", " - ")
CORRECT ANSWER: C
What is the result of the following R code?
df1 <- c("VIC", "NSW", "TAS", "WA", "SA")df2 <- c("WA", "SA", "NSW", "TAS", "VIC")identical(df1, df2)
A. TRUEB. FALSEC. "WA", "SA", "NSW"D. "TAS", "VIC"What is the result of the following R code?
df1 <- c("VIC", "NSW", "TAS", "WA", "SA")df2 <- c("WA", "SA", "NSW", "TAS", "VIC")identical(df1, df2)
A. TRUEB. FALSEC. "WA", "SA", "NSW"D. "TAS", "VIC"
CORRECT ANSWER: B
Which one of the following is NOT one of the print functions?
A. cat()B. print()C. noquoteD. quoteWhich one of the following is NOT one of the print functions?
A. cat()B. print()C. noquoteD. quote
CORRECT ANSWER: D
Which one of the following removes all punctuations in the vector x?
x <- c("hello!", "good-day.", "hi 5:)")
A. str_subset(x, "[:alnum:]")B. str_extract(x, "[:alnum:]")C. str_remove(x, "[:punct:]")D. str_replace_all(x, "[:punct:]", "")Which one of the following removes all punctuations in the vector x?
x <- c("hello!", "good-day.", "hi 5:)")
A. str_subset(x, "[:alnum:]")B. str_extract(x, "[:alnum:]")C. str_remove(x, "[:punct:]")D. str_replace_all(x, "[:punct:]", "")
CORRECT ANSWER: D
According to the following code, what will be the result of y?
x <- "Now, I am HAPPY"y <- length(x)y
A. 4B. 1C. 2D. 5According to the following code, what will be the result of y?
x <- "Now, I am HAPPY"y <- length(x)y
A. 4B. 1C. 2D. 5
CORRECT ANSWER: B
Which one of the following functions from lubridate package will convert z into a date format?
z <- c("08.06.2018", "29062018", "23/03/2018", "30-01-2018")
A. ymd(z)B. dmy(z)C. ydm(z)D. hms(z)Which one of the following functions from lubridate package will convert z into a date format?
z <- c("08.06.2018", "29062018", "23/03/2018", "30-01-2018")
A. ymd(z)B. dmy(z)C. ydm(z)D. hms(z)
CORRECT ANSWER: B
In which one of the following, values are divided by their standard deviation (or root mean square)?
A. Box-Cox transformationB. logarithmic transformationC. z-score standardisationD. square root transformationIn which one of the following, values are divided by their standard deviation (or root mean square)?
A. Box-Cox transformationB. logarithmic transformationC. z-score standardisationD. square root transformation
CORRECT ANSWER: C
According to the following code, what will be the result of y?
minmaxnormalise <- function(x) {(x - min(x)) / (max(x) - min(x))}x <- c(5, 4, NA, 2, 5)y <- minmaxnormalise(x)y
A. 1.00 1.00 NA 1.00 1.00B. 1.00 0.67 NA 0.00 1.00C. NA NA NA NA NAD. 0.00 0.00 NA 1.00 1.00According to the following code, what will be the result of y?
minmaxnormalise <- function(x) {(x - min(x)) / (max(x) - min(x))}x <- c(5, 4, NA, 2, 5)y <- minmaxnormalise(x)y
A. 1.00 1.00 NA 1.00 1.00B. 1.00 0.67 NA 0.00 1.00C. NA NA NA NA NAD. 0.00 0.00 NA 1.00 1.00
CORRECT ANSWER: C
Which one of the following packages has a function to detect multivariate outliers?
A. library(dplyr)B. library(MVN)C. library(tidyr)D. library(validate)Which one of the following packages has a function to detect multivariate outliers?
A. library(dplyr)B. library(MVN)C. library(tidyr)D. library(validate)
Which of the following can be used to deal with outliers?
A. CappingB. TransformingC. ImputingD. All of themWhich of the following can be used to deal with outliers?
A. CappingB. TransformingC. ImputingD. All of them
Which one of the following is the reason for the error given below?
df <- data.frame(col1 = c(2, 0 / 0, NA, 1 / 0,-Inf, Inf), col2 = c(NA, Inf / 0, 2 / 0, NaN,-Inf, 4))is.infinite(df)
A. is.infinite() function accepts only vectorial input.B. there is no infinite value in the data frame.C. data frame has missing values.D. there is a division by zero problem in the data frame.Which one of the following is the reason for the error given below?
df <- data.frame(col1 = c(2, 0 / 0, NA, 1 / 0,-Inf, Inf), col2 = c(NA, Inf / 0, 2 / 0, NaN,-Inf, 4))is.infinite(df)
A. is.infinite() function accepts only vectorial input.B. there is no infinite value in the data frame.C. data frame has missing values.D. there is a division by zero problem in the data frame.
Consider the following data frame. What command would you use to find the total missing values in each column?
df <- data.frame(col1 = c(1:3, NA), col2 = c("this", NaN, "is", "text"), col3 = c(TRUE, FALSE, TRUE, TRUE), col4 = c(2.5, 4.2, 3.2, NA))
A. sum(is.na(df))B. is.na(df)C. is.nan(df)D. colSums(is.na(df))Consider the following data frame. What command would you use to find the total missing values in each column?
df <- data.frame(col1 = c(1:3, NA), col2 = c("this", NaN, "is", "text"), col3 = c(TRUE, FALSE, TRUE, TRUE), col4 = c(2.5, 4.2, 3.2, NA))
A. sum(is.na(df))B. is.na(df)C. is.nan(df)D. colSums(is.na(df))
According to the following code, what will be the result of y?
x <- c(1:3, NA, 5, NA)y <- which(is.na(x))y
A. 4 6B. TRUEC. FALSE FALSE FALSE TRUE FALSE TRUED. NAAccording to the following code, what will be the result of y?
x <- c(1:3, NA, 5, NA)y <- which(is.na(x))y
A. 4 6B. TRUEC. FALSE FALSE FALSE TRUE FALSE TRUED. NA
A relational database contains 2 data sets namely sales and employees.
The sales data set gives information about the each sale with an id followed by customer id and salesperson id with quantity of the item and payment type. Here is the sales data set:
sales
## # A tibble: 4 x 6## sales_id sales_person_id customer_id product_id quantity payment_type## <dbl> <chr> <dbl> <dbl> <dbl> <chr> ## 1 201 A1 1 102 2 Debit ## 2 202 B3 2 101 3 Credit ## 3 203 A1 3 101 1 Cash ## 4 204 A2 1 103 5 DebitThe employees data set allows you to look up the name and surname of the sales person using the sales person id. Here is the employees data set:
employees
## # A tibble: 6 x 3## sales_person_id first_name last_name## <chr> <chr> <chr> ## 1 A1 John Doe ## 2 A2 Jane Smith ## 3 A3 Micheal Brown ## 4 B1 Jim Johnson ## 5 B2 Karen Wilson ## 6 B3 Kate Tayloremployees connects to sales via the sales_person_id variable.sales
## # A tibble: 4 x 6## sales_id sales_person_id customer_id product_id quantity payment_type## <dbl> <chr> <dbl> <dbl> <dbl> <chr> ## 1 201 A1 1 102 2 Debit ## 2 202 B3 2 101 3 Credit ## 3 203 A1 3 101 1 Cash ## 4 204 A2 1 103 5 Debitemployees
## # A tibble: 6 x 3## sales_person_id first_name last_name## <chr> <chr> <chr> ## 1 A1 John Doe ## 2 A2 Jane Smith ## 3 A3 Micheal Brown ## 4 B1 Jim Johnson ## 5 B2 Karen Wilson ## 6 B3 Kate Taylor# Q16: How would you find the names of sales people who made a sale while dropping all the information in `sales` data set?employees %>% semi_join(sales)
## Joining, by = "sales_person_id"## # A tibble: 3 x 3## sales_person_id first_name last_name## <chr> <chr> <chr> ## 1 A1 John Doe ## 2 A2 Jane Smith ## 3 B3 Kate Taylor# Q17: How would you find the names of sales people who didn't make a sale?employees %>% anti_join(sales)
## Joining, by = "sales_person_id"## # A tibble: 3 x 3## sales_person_id first_name last_name## <chr> <chr> <chr> ## 1 A3 Micheal Brown ## 2 B1 Jim Johnson ## 3 B2 Karen WilsonAccording to the given information, how would you find the names of sales people (employees) who made a sale while dropping all the information in the sales data set?
A. anti_join(employees, sales)B. semi_join(employees, sales)C. union(employees, sales)D. bind_cols(employees, sales)According to the given information, how would you find the names of sales people (employees) who made a sale while dropping all the information in the sales data set?
A. anti_join(employees, sales)B. semi_join(employees, sales)C. union(employees, sales)D. bind_cols(employees, sales)
According to the given information, how would you find the names of sales people who didn't make a sale?
A. anti_join(employees, sales)B. semi_join(employees, sales)C. union(employees, sales)D. bind_cols(employees,sales)According to the given information, how would you find the names of sales people who didn't make a sale?
A. anti_join(employees, sales)B. semi_join(employees, sales)C. union(employees, sales)D. bind_cols(employees,sales)





Consider the id_lookup and ratings data sets, what would be the result of:
ratings %>% left_join(id_lookup)#ORleft_join(ratings, id_lookup)
A. Picture 1B. Picture 2C. Picture 3D. Picture 4Consider the id_lookup and ratings data sets, what would be the result of:
ratings %>% left_join(id_lookup)#ORleft_join(ratings, id_lookup)
A. Picture 1B. Picture 2C. Picture 3D. Picture 4
Consider the id_lookup and ratings data sets, what would be the result of:
id_lookup %>% anti_join(ratings)#ORanti_join(id_lookup, ratings)
A. Picture 1B. Picture 2C. Picture 3D. Picture 4Consider the id_lookup and ratings data sets, what would be the result of:
id_lookup %>% anti_join(ratings)#ORanti_join(id_lookup, ratings)
A. Picture 1B. Picture 2C. Picture 3D. Picture 4Which one of the following will order this data frame in an ascending order using col2 , col3 and col1 , respectively?
df <- data.frame(col1 = c(4, 3, 1), col2 = c(81, 12, 4), col3 = c(54, 22, 66))
A. df %>% select(col1, col2, col3)B. df %>% filter(col1, col2, col3)C. df %>% arrange(col1, col2, col3)D. df %>% arrange(col2, col3, col1)Which one of the following will order this data frame in an ascending order using col2 , col3 and col1 , respectively?
df <- data.frame(col1 = c(4, 3, 1), col2 = c(81, 12, 4), col3 = c(54, 22, 66))
A. df %>% select(col1, col2, col3)B. df %>% filter(col1, col2, col3)C. df %>% arrange(col1, col2, col3)D. df %>% arrange(col2, col3, col1)
According to the following code, what will be the class of df?
df <- data.frame(col1 = 1:3, col2 = c("this", "is", "text"), col3 = c(TRUE, FALSE, TRUE), col4 = c(25.5, 44.2, 54.9))df <- as.matrix(df)class(df)
A. listB. vectorC. matrixD. data.frameAccording to the following code, what will be the class of df?
df <- data.frame(col1 = 1:3, col2 = c("this", "is", "text"), col3 = c(TRUE, FALSE, TRUE), col4 = c(25.5, 44.2, 54.9))df <- as.matrix(df)class(df)
A. listB. vectorC. matrixD. data.frame
According to the following code, what will be the ordering of the levels for y?
y <- factor(c("low", "moderate", "low", "severe", "low", "high", "moderate", "severe"), levels = c("low" , "moderate", "high" , "severe"), ordered = TRUE) y
A. moderate < high < severe < lowB. low < severe < high < moderateC. low < moderate < high < severeD. severe < high < moderate < lowAccording to the following code, what will be the ordering of the levels for y?
y <- factor(c("low", "moderate", "low", "severe", "low", "high", "moderate", "severe"), levels = c("low" , "moderate", "high" , "severe"), ordered = TRUE) y
A. moderate < high < severe < lowB. low < severe < high < moderateC. low < moderate < high < severeD. severe < high < moderate < low
Keyboard shortcuts
| ↑, ←, Pg Up, k | Go to previous slide |
| ↓, →, Pg Dn, Space, j | Go to next slide |
| Home | Go to first slide |
| End | Go to last slide |
| Number + Return | Go to specific slide |
| b / m / f | Toggle blackout / mirrored / fullscreen mode |
| c | Clone slideshow |
| p | Toggle presenter mode |
| t | Restart the presentation timer |
| ?, h | Toggle this help |
| Esc | Back to slideshow |