100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.6 TrustPilot
logo-home
Exam (elaborations)

Machine Learning & Data Mining

Rating
-
Sold
-
Pages
7
Grade
A+
Uploaded on
18-06-2024
Written in
2023/2024

Machine Learning & Data Mining

Institution
Course









Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Study
Course

Document information

Uploaded on
June 18, 2024
Number of pages
7
Written in
2023/2024
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

Content preview

Machine Learning & Data Mining
What is Machine Learning? - correct answer-"Field of study that gives
computers the ability to learn without being explicitly programmed"
- Samuel, 1959

"Learning is changing behavior in a way that makes performance better in the
future"
- Witten & Frank 1999

"Improvement with experience at some task" and "A well-defined ML problem:
- improve over task T
- w/ regards to performance measure p
- based on experience E"
...Mitchell, 1997

Data Mining - - correct answer-Extract interesting knowledge from large
unstructured data-sets
* non-obvious, comprehensible, meaningful, useful

3 V's - correct answer-1.) Volume: terabytes and up.
2.) Velocity: from streaming data
3.) Variety: numeric, video, sensor, unstructured text...

Curse of Dimensionality - correct answer-The curse of dimensionality refers to
how certain learning algorithms may perform poorly in high-dimensional data.

First, it's very easy to overfit the the training data, since we can have a lot of
assumptions that describe the target label (in case of supervised learning). In
other words we can easily express the target using the dimensions that we
have.

Second,we may need to increase the number of training data exponentially, to
overcome the curse of dimensionality and that may not be feasible.

Third, in ML learning algorithms that depends on the distance, like k-means
for clustering or k nearest neighbors, everything can become far from each
others and it's difficult to interpret the distance between the data points.

, Entropy - correct answer-measure of uncertainty of a random variable
(acquisition of information corresponds to a reduction of entropy)

Information Gain - correct answer-of an attribute in Entropy from partitioning
the data according to that attribute

Noise - correct answer-Imprecise or incorrect attribute values or labels
- Can't always quantify it, but should know from situation if it is present
- E.g. labels may require subjective judgement or values may come from
imprecise measurements

Main symptom of over fitting - correct answer-Much better performance on the
training data than on independent test data

Key insights to kNN - correct answer-- Each sample can be considered to be a
point in sample space
- if two samples are close to each other in space, they should be close to each
other in their target values

lazy learning - correct answer-

Eager learning - correct answer-When given training data, construct model for
future use in prediction that summarises the data

- Analogy: compilation in programming language
- Slow in model construction, quicker in subsequent use
- Model itself may be useful/informative

Lazy Learning - correct answer-No explicit model constructed
- Calculations deferred until new case to be classified

Training Set Quality - MNAR - correct answer-When the missing values are
neither MCAR nor MAR. People w/ depression not reporting it.

Training Set Quality - MAR - correct answer-When missing data is not random
but can be totally related to a variable where there is complete information
Example - Men not reporting depression

Training Set Quality - MCAR - correct answer-The presence/absence of data
is completely independent of observable variables

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
topgradesdr Jackson State University
Follow You need to be logged in order to follow users or courses
Sold
1541
Member since
2 year
Number of followers
9
Documents
16577
Last sold
2 weeks ago
TOPGRADES DOCTOR

Hi there! I'm an experienced academic professional specializing in exam preparation, test banks, and assignments. Whether you're gearing up for a big test, looking for top-notch study guides, or need expertly crafted assignments, I've got you covered. My materials are: Accurate and Comprehensive: Designed to help you excel in your studies. Tailored to Your Needs: Covering various subjects with real exam-style questions and solutions. Time-Saving: Concise, easy-to-understand resources to help you study smarter. Let me help you achieve your academic goals with confidence!

Read more Read less
4.8

288 reviews

5
251
4
22
3
7
2
2
1
6

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions