100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Exam (elaborations)

cse 6250 Questions with 100% Actual correct answer

Rating
-
Sold
-
Pages
6
Grade
A
Uploaded on
26-06-2024
Written in
2023/2024

cse 6250 Questions with 100% Actual correct answer

Institution
CSE
Module
CSE









Whoops! We can’t load your doc right now. Try again or contact support.

Document information

Uploaded on
June 26, 2024
Number of pages
6
Written in
2023/2024
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

Content preview

cse 6250
The 4 V's of Data - ANS-Volume (amount of data)
Variety
Velocity (real time data)
Veracity (noise, missing data, errors)

Predictive Modeling Pipeline - ANS-1. Prediction Target
2. Cohort Construction
3. Feature Construction
4. Feature Selection
5. Predictive Model
6. Performance Evaluation

New cases of heart failure that occurs each year in the US - ANS-550,000

Prospective vs Retrospective Studies - ANS-Prospective: Identify cohort -> collect data
Retrospective: Collect data -> identify cohort

Case patients - ANS-have the condition you're trying to predict

Mapreduce - ANS-It is:
- a programming model where the developer can specify parallel computation algorithms
- an execution environment (hadoop is the Java implementation of MapReduce and HDFS)
- a software package

It provides:
- Distributed storage
- Distributed computation
- Fault tolerance

Mapreduce system - ANS-has 2 components - mappers, and reducers

all the data with be partitioned and processed by multiple mappers (and it pre-aggregates the
data)

shuffle stage - mapper results are sent to the reducers

the reducers process the intermediate (mapper) results (ex. one reducer for heart disease,
another for cancer, etc.)

, Mapreduce fault recovery - ANS-if mapper 2 fails during execution of the mapreduce program,
then the mapreduce system will restart mapper 2 and go through the same workload again to
make sure it doesn't fail

(this same process happens for reducers)

Mapreduce KNN - ANS-Map()
Input:
- all points
- query point p

Output:
- k nearest neighbors

Emit the k closest points to p


Reduce() - goes through all the local nearest neighbors to identify the global nearest neighbors
to p
Input:
- key: null
- values: local neighbors
- query point p

Output:
- k nearest neighbots

Emit the k closest points to p among all local neighbors

Mapreduce linear regression - ANS-see notes

Limitations of MapReduce - ANS-MapReduce is not optimized for iteration and multi-stage
computation

- Logistic regression is hard to implement

Iterative batch gradient descent is hard to implement in MapReduce (it's not efficient)

MapReduce optimal setup - ANS-Single Pass (ex. computing histograms)

Uniformly-distributed keys (if it is skewed, then one reducer has to do almost all the jobs)

No synchronization needed (the only synchronization MapReduce has is between map and
reduce phase)

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
DoctorHkane Havard School
View profile
Follow You need to be logged in order to follow users or courses
Sold
732
Member since
4 year
Number of followers
168
Documents
22476
Last sold
1 week ago

Explore my Stuvia collection for essential study aids: test banks, exams, summaries, and cases. With five years of expertise as an academic writer, I have honed my skills in crafting top-notch essays, exams, and research dissertations. My proficiency lies in producing well-structured and thoroughly researched content that meets academic standards. I am adept at handling various subjects and ensuring a seamless flow of ideas. Whether it's delivering compelling arguments in essays, creating challenging yet fair exam questions, or delving into in-depth research for dissertations, my experience equips me to excel in diverse academic writing tasks. I pride myself on meeting deadlines and maintaining the highest quality in every piece I produce. REACH ON iamnjokikelvin1@gmail

Read more Read less
4.6

386 reviews

5
308
4
29
3
21
2
10
1
18

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their exams and reviewed by others who've used these revision notes.

Didn't get what you expected? Choose another document

No problem! You can straightaway pick a different document that better suits what you're after.

Pay as you like, start learning straight away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and smashed it. It really can be that simple.”

Alisha Student

Frequently asked questions