100% tevredenheidsgarantie Direct beschikbaar na je betaling Lees online óf als PDF Geen vaste maandelijkse kosten 4,6 TrustPilot
logo-home
Tentamen (uitwerkingen)

Data Mining Exam 2 Study Guide

Beoordeling
-
Verkocht
-
Pagina's
8
Cijfer
A+
Geüpload op
18-06-2024
Geschreven in
2023/2024

Data Mining Exam 2 Study Guide

Instelling
Vak









Oeps! We kunnen je document nu niet laden. Probeer het nog eens of neem contact op met support.

Geschreven voor

Instelling
Studie
Vak

Documentinformatie

Geüpload op
18 juni 2024
Aantal pagina's
8
Geschreven in
2023/2024
Type
Tentamen (uitwerkingen)
Bevat
Vragen en antwoorden

Onderwerpen

Voorbeeld van de inhoud

Data Mining Exam 2 Study Guide
X - correct answer-attribute, predictor, independent variable, input

y - correct answer-class, response, dependent variable, output

Classification - correct answer-predicts categorical labels

Prediction - correct answer-predicts continuous values

Decision Tree - correct answer-a non-parametric supervised learning
algorithm, which is utilized for both classification and regression tasks. It has a
hierarchical, tree structure, which consists of a root node, branches, internal
nodes and leaf nodes.

K-Nearest Neighbors - correct answer-A data mining method that predicts
(classifies or estimates) an observation i's outcome value based on the k
observations most similar to observation i with respect to the input variables.

Naive Bayes Classifier - correct answer-an algorithm that predicts the
probability of a certain outcome based on prior occurrences of related events

Support Vector Machine - correct answer-Supervised learning classification
tool that seeks a dividing hyperplane for any number of dimensions can be
used for regression or classification

Nueral Networks - correct answer-a method in artificial intelligence that
teaches computers to process data in a way that is inspired by the human
brain.

Decision Tree Hyperparameters - correct answer-Many. Includes
min_samples_leaf , min_samples_split , max_leaf_nodes , or
min_impurity_decrease

K-Nearest Neighbor Hyperparameters - correct answer-K-value and distance
function

Decision tree disadvantages - correct answer--Prone to outliers
-tree can grow to be very complex while training complex datasets

, K-Nearest Neighbor disadvantages - correct answer--K has to be wisely
selected
-Large computation cost during runtime if sample size is large

What are two variable selection criteria? - correct answer--Entropy and
Information Gain
-Gini Index

Pure when Entropy = - correct answer-0

Impure when Entropy = - correct answer-1

Entropy - correct answer-a measure of the disorder of a system or energy
unavailable to do work.

Why the minus in the Entropy formula - correct answer-Probabilities are
always between 0 and 1.
log(x) where x < 1 is negative
Each term in the sum is negative, so the result of the sum negative meaning
that the minus makes the result positive

Information Gain - correct answer-the amount of knowledge acquired during a
certain decision or action

Random forests - correct answer--for supervised machine learning, where
there is a labeled target variable
-used for solving regression (numeric target variable) and classification
(categorical target variable) problems
-an ensemble method, meaning they combine predictions from other models
-Each of the smaller models in the random forest ensemble is a decision tree

What is the best hyperplane? - correct answer-The one that maximizes
distance from the hyperplane to data points

Margin - correct answer-the distance between hyperplane and data points

What is the name for the points closest to the hyperplane - correct
answer-Support Vectors

Maak kennis met de verkoper

Seller avatar
De reputatie van een verkoper is gebaseerd op het aantal documenten dat iemand tegen betaling verkocht heeft en de beoordelingen die voor die items ontvangen zijn. Er zijn drie niveau’s te onderscheiden: brons, zilver en goud. Hoe beter de reputatie, hoe meer de kwaliteit van zijn of haar werk te vertrouwen is.
topgradesdr Jackson State University
Volgen Je moet ingelogd zijn om studenten of vakken te kunnen volgen
Verkocht
1541
Lid sinds
2 jaar
Aantal volgers
9
Documenten
16577
Laatst verkocht
2 weken geleden
TOPGRADES DOCTOR

Hi there! I'm an experienced academic professional specializing in exam preparation, test banks, and assignments. Whether you're gearing up for a big test, looking for top-notch study guides, or need expertly crafted assignments, I've got you covered. My materials are: Accurate and Comprehensive: Designed to help you excel in your studies. Tailored to Your Needs: Covering various subjects with real exam-style questions and solutions. Time-Saving: Concise, easy-to-understand resources to help you study smarter. Let me help you achieve your academic goals with confidence!

Lees meer Lees minder
4,8

288 beoordelingen

5
251
4
22
3
7
2
2
1
6

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

Student with book image

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Veelgestelde vragen