100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Summary

Summary Week 11

Rating
-
Sold
2
Pages
4
Uploaded on
15-07-2023
Written in
2022/2023

Continue to use the code in Chapter 4 to look at the relationship between predictors and the response variable. ANSWER THESE QUESTIONS: 1. Which religious group has the highest proportion of unemployed? 2. What does the 'se' column stand for? 3. What does the function regexp_extract() do? 4. Which ethnicity has the highest proportion of unemployed? Use the code provided to create a box plot of religion v. unemployed. ANSWER THE FOLLOWING QUESTIONS: 5. What does the box plot show? Is there an outlier? Using the code in the text examine the relationship between drinking, drugs, and the response variable. ANSWER THE FOLLOWING QUESTION: 6. Explain what the contingency table is showing. Using the code in the text, create a mosaic chart from the contingency table. ANSWER THE FOLLOWING QUESTIONS: 7. What drinking status correlates most strongly with what drug use status?

Show more Read less
Institution
Big Data Tools & Architecture
Course
Big Data Tools & Architecture








Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Big Data Tools & Architecture
Course
Big Data Tools & Architecture

Document information

Summarized whole book?
No
Which chapters are summarized?
Unknown
Uploaded on
July 15, 2023
File latest updated on
February 22, 2024
Number of pages
4
Written in
2022/2023
Type
Summary

Subjects

Content preview

Chapter 4: Exploratory Data Analysis Check the relationship between predictors and the response variable. 1.Which religious group has the highest proportion of unemployed
I see that Islam has the highest proportion of individuals who are unemployed. 2.What does the ‘se’ column stand for? I see that the column ‘se’ stand for special exemption of religions. 3.What does the function regexp_extract() do? The function is extracting religion group identified by the regular expression.

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
datascience24 Self
View profile
Follow You need to be logged in order to follow users or courses
Sold
84
Member since
3 year
Number of followers
16
Documents
159
Last sold
1 month ago

Please message me using Send Message option for new assignment requests.

4.5

12 reviews

5
10
4
0
3
1
2
0
1
1

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions