100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Exam (elaborations)

SOA PA EXAM QUESTIONS AND CORRECT ANSWERS

Rating
-
Sold
-
Pages
25
Grade
A+
Uploaded on
21-03-2025
Written in
2024/2025

SOA PA EXAM QUESTIONS AND CORRECT ANSWERS What to examine when assessing the bivariate relationship between a Continuous predictor variable and a Continuous target variable? ANSWScatter plots. Correlation between each variable [cor() in R]. What to examine when assessing (univariate analysis) a Continuous predictor variable? ANSWAssess the histogram of the distribution. Check the skewness (does it need to have a log transformation). - Check for extreme (unreasonable) outliers - Check for obvious errors in data - Check for obvious duplicates What to examine when assessing the bivariate relationship between a Continuous predictor variable and a Continuous target variable? ANSWScatter plots. Correlation between each variable [cor() in R]. What to examine when assessing (univariate analysis) a Continuous predictor variable? ANSWAssess the histogram of the distribution. Check the skewness (does it need to have a log transformation). - Check for extreme (unreasonable) outliers - Check for obvious errors in data - Check for obvious duplicates

Show more Read less
Institution
SOA
Course
SOA










Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
SOA
Course
SOA

Document information

Uploaded on
March 21, 2025
Number of pages
25
Written in
2024/2025
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

Content preview

SOA PA EXAM QUESTIONS AND
CORRECT ANSWERS
What to examine when assessing the bivariate relationship between a Continuous predictor variable
and a Continuous target variable? ANSW✅✅Scatter plots. Correlation between each variable
[cor() in R].



What to examine when assessing (univariate analysis) a Continuous predictor variable?
ANSW✅✅Assess the histogram of the distribution. Check the skewness (does it need to have a log
transformation).

- Check for extreme (unreasonable) outliers

- Check for obvious errors in data

- Check for obvious duplicates



What to examine when assessing (univariate analysis) a Factor predictor variable?
ANSW✅✅Assess Bar chart. (Count of observations per factor level)



What data questions should be considered while reading the project statement? ANSW✅✅Is the
project statement more interested in interpretable models or more accurate complicated models?

What type of variable is the target variable?

What type of variable are the predictor variables?

Are there any outliers that need to be removed?

Are there any Factor variables that could be combined?



R-Code; Histogram Continuous Variable ANSW✅✅ggplot(df, aes(x = variable)) +

geom_histogram(bins = 30) +

labs(x = "variable")



R-Code; Bar chart for a factor variable ANSW✅✅ggplot(df, aes(x = variable)) +

geom_bar() +

labs(x = "variable")

,What to examine when assessing the bivariate relationship between a Factor predictor variable and
a binary target variable? ANSW✅✅A table to asses (with rows as factor levels) the mean
probabilities, counts of observations of each factor, and counts of each observation of each binary
target.



What to examine when assessing the bivariate relationship between a Continuous predictor variable
and a binary target variable? ANSW✅✅- A graph with separate histograms for a continuous
variable, one for those with target binary = 0 and one for those with binary = 1;

- Box plots summarized based on binary target;

- Tables summarizing the mean, median, and count of the predictor based on each binary target



What to examine when assessing the bivariate relationship between a Factor predictor variable and
a Continuous target variable? ANSW✅✅Box Plots and tables summarizing the mean, median, and
count of the target based on each factor



R-Code; Table for binary target and factor variable ANSW✅✅data %>%

group_by(variable) %>%

summarise(

zeros = sum(Target == 0),

ones = sum(Target == 1),

n = n(),

proportion = mean(Target)

)



R-Code; Separate histograms for a continuous variable and a binary target ANSW✅✅ggplot(

data,

aes(

x = variable,

group = Target,

fill = as.factor(Target),

y = ..density..

)

)+

, geom_histogram(position = "dodge", bins = 30)



R-Code; Relevel Factor variables ANSW✅✅table <- as.data.frame(table(df$variable))

max <- which.max(table[, 2])

level.name <- as.character(table[max, 1])

df$variable <- relevel(df$variable, ref = level.name)



R-Code; Remove all observations in entire data set of a variable greater than or equal to 50
ANSW✅✅data <- data[data$variable <= 50, ]



R-Code; Remove all observations of a factor variable = "value" ANSW✅✅toBeRemoved <-
which(data$factor=="value")

data <- data[-toBeRemoved, ]



R-Code; Combine factor levels into new factors. ANSW✅✅var.levels <- levels(df$variable)

df$occupation_comb <- mapvalues(df$variable, var.levels, c("Group12", ... , "GroupNA"))



R-Code; remove a variable from the dataframe. ANSW✅✅df$variable <- NULL



R-Code; Create training and testing sets. ANSW✅✅set.seed(n)

train_ind <- createDataPartition(df$Target, p = 0.7, list = FALSE)

data.train <- df[train_ind, ]

data.test <- df[-train_ind, ]



What type of data to use a log transformation? ANSW✅✅Right Skewed (common with variables
of Time, Distance, or Money which have a lower boundary of 0)



What type of data to use a Logit transformation? ANSW✅✅Binary (boolean) Target variable



Define Principal Component Analysis ANSW✅✅- An unsupervised learning technique which
linearly combines the initial variables in a data set to create new orthogonal principal components
which then can be used to assess the correlation between the initial variables.

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
PEAKGRADES Chamberlain College Of Nursing
View profile
Follow You need to be logged in order to follow users or courses
Sold
26
Member since
1 year
Number of followers
6
Documents
4006
Last sold
3 weeks ago
PEAK GRADES

Hello everyone...Explore a wide range of Nursing Exams, Test Banks, Study Guides, and other valuable study materials on this page. If you need any additional resources, simply reach out to us, and we’ll deliver them promptly! Please remember to leave a review after your purchase to help us improve customer satisfaction. Thank you

4.3

4 reviews

5
2
4
1
3
1
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions