Exam (elaborations)

Machine Learning & Data Mining

Rating

Sold

Pages

Grade

A+

Uploaded on

18-06-2024

Written in

2023/2024

Machine Learning & Data Mining

Institution

Module

Whoops! We can’t load your doc right now. Try again or contact support.

Report Copyright Violation

Written for

Institution: Data Science
Study: Data Science
Module: Data Science

All documents for this subject (201)

Document information

Uploaded on: June 18, 2024
Number of pages: 7
Written in: 2023/2024
Type: Exam (elaborations)
Contains: Questions & answers

Subjects

machine learning data mining

Content preview

Machine Learning & Data Mining
What is Machine Learning? - correct answer-"Field of study that gives
computers the ability to learn without being explicitly programmed"
- Samuel, 1959

"Learning is changing behavior in a way that makes performance better in the
future"
- Witten & Frank 1999

"Improvement with experience at some task" and "A well-defined ML problem:
- improve over task T
- w/ regards to performance measure p
- based on experience E"
...Mitchell, 1997

Data Mining - - correct answer-Extract interesting knowledge from large
unstructured data-sets
* non-obvious, comprehensible, meaningful, useful

3 V's - correct answer-1.) Volume: terabytes and up.
2.) Velocity: from streaming data
3.) Variety: numeric, video, sensor, unstructured text...

Curse of Dimensionality - correct answer-The curse of dimensionality refers to
how certain learning algorithms may perform poorly in high-dimensional data.

First, it's very easy to overfit the the training data, since we can have a lot of
assumptions that describe the target label (in case of supervised learning). In
other words we can easily express the target using the dimensions that we
have.

Second,we may need to increase the number of training data exponentially, to
overcome the curse of dimensionality and that may not be feasible.

Third, in ML learning algorithms that depends on the distance, like k-means
for clustering or k nearest neighbors, everything can become far from each
others and it's difficult to interpret the distance between the data points.

, Entropy - correct answer-measure of uncertainty of a random variable
(acquisition of information corresponds to a reduction of entropy)

Information Gain - correct answer-of an attribute in Entropy from partitioning
the data according to that attribute

Noise - correct answer-Imprecise or incorrect attribute values or labels
- Can't always quantify it, but should know from situation if it is present
- E.g. labels may require subjective judgement or values may come from
imprecise measurements

Main symptom of over fitting - correct answer-Much better performance on the
training data than on independent test data

Key insights to kNN - correct answer-- Each sample can be considered to be a
point in sample space
- if two samples are close to each other in space, they should be close to each
other in their target values

lazy learning - correct answer-

Eager learning - correct answer-When given training data, construct model for
future use in prediction that summarises the data

- Analogy: compilation in programming language
- Slow in model construction, quicker in subsequent use
- Model itself may be useful/informative

Lazy Learning - correct answer-No explicit model constructed
- Calculations deferred until new case to be classified

Training Set Quality - MNAR - correct answer-When the missing values are
neither MCAR nor MAR. People w/ depression not reporting it.

Training Set Quality - MAR - correct answer-When missing data is not random
but can be totally related to a variable where there is complete information
Example - Men not reporting depression

Training Set Quality - MCAR - correct answer-The presence/absence of data
is completely independent of observable variables

£6.53

Get access to the full document:

100% satisfaction guarantee

Immediately available after payment

Both online and in PDF

No strings attached

Get to know the seller

topgradesdr

4.8

(288)

Get to know the seller

topgradesdr Jackson State University

View profile

Sold

1541

Member since

2 year

Number of followers

Documents

16577

Last sold

2 weeks ago

TOPGRADES DOCTOR

Hi there! I'm an experienced academic professional specializing in exam preparation, test banks, and assignments. Whether you're gearing up for a big test, looking for top-notch study guides, or need expertly crafted assignments, I've got you covered. My materials are: Accurate and Comprehensive: Designed to help you excel in your studies. Tailored to Your Needs: Covering various subjects with real exam-style questions and solutions. Time-Saving: Concise, easy-to-understand resources to help you study smarter. Let me help you achieve your academic goals with confidence!

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their exams and reviewed by others who've used these revision notes.

Didn't get what you expected? Choose another document

No problem! You can straightaway pick a different document that better suits what you're after.

Pay as you like, start learning straight away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

“Bought, downloaded, and smashed it. It really can be that simple.”

Alisha Student

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller topgradesdr. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for £6.53. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 48586 documents were sold in the last 30 days Founded in 2010, the go-to place to buy revision notes and other study material for 16 years now

Machine Learning & Data Mining

Written for

Document information

Subjects

Content preview

Get to know the seller

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Didn't get what you expected? Choose another document

Pay as you like, start learning straight away

Frequently asked questions

What do I get when I buy this document?

Satisfaction guarantee: how does it work?

Who am I buying these notes from?

Will I be stuck with a subscription?

Can Stuvia be trusted?