100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Exam (elaborations)

Georgia Tech ISYE 6501: Introduction to Analytics Modeling Professor: Dr. Joel Sokol Homework 3. 100% pass rate.

Rating
-
Sold
-
Pages
11
Grade
A+
Uploaded on
25-04-2023
Written in
2022/2023

Georgia Tech ISYE 6501: Introduction to Analytics Modeling Professor: Dr. Joel Sokol Homework 3. 100% pass rate. Document Content and Description Below ISYE 6501: Introduction to Analytics Modeling Professor: Dr. Joel Sokol Homework 3 31 January 2018Overview This week’s lesson involves data preparation, including outlier identification, handlin g outliers, and an introduction to change detection. Data preparation involves inspecting data visually for outliers and using a statistical test, Grubbs Test, to detect outliers in a univariate data set assumed to come from a normally distributed population. The null and alternative hypotheses are two mutually exclusive statements about a population. A hypothesis test uses sample data to determine whether to reject the null hypothesis. The null hypothesis states that all the data values come from the same normal distribution. The alternative hypothesis states that either the smallest or largest data value is an outlier.1 The CUMSUM test is used for change detection. CUSUM: St = max{0, St-1 + (xt – mu - C)} Is St >= T? Calculate metric St and declare an observed change when St goes above some threshold (T). At each time period, observe xt and see how far above the expectation it is (xt – mu) and add it to the previous period’s metric (St-1). Take the max of 0 and that value (essentially keep the value if it’s > 0), else reset running total to zero. Sometimes there are random values (up to 50% of time), so we include a value C to pull the running total down a little bit. The bigger the C, the harder it is for to St to get large and the LESS SENSITIVE the model is. The smaller the C, the more sensitive the model is since St can get larger faster. How do you choose good values for C and t so the model is finds changes quickly but isn’t too sensitive? Use data! Evaluate how costly the C and T boundaries are to your situation. Higher T = slower detection but less false detection changes. Lower T = faster detection but more likely to falsely detect changes. Question 5.1 – Crime Data Analysis Using crime data from

Show more Read less
Institution
ISYE 6501
Course
ISYE 6501









Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
ISYE 6501
Course
ISYE 6501

Document information

Uploaded on
April 25, 2023
Number of pages
11
Written in
2022/2023
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
Savior NCSU
View profile
Follow You need to be logged in order to follow users or courses
Sold
93
Member since
2 year
Number of followers
70
Documents
3434
Last sold
1 month ago

3.5

25 reviews

5
9
4
7
3
3
2
0
1
6

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions