Examen

ISYE 6501 –ACTUAL Midterm 1 and 2 EXAM- and Fina exam QUESTIONS ANSWERS & RATIONALES.

Puntuación

Vendido

Páginas

Grado

A+

Subido en

10-12-2025

Escrito en

2025/2026

ISYE 6501 –ACTUAL Midterm 1 and 2 EXAM- and Fina exam QUESTIONS ANSWERS & RATIONALES.

Institución

ISYE 6501

Grado

ISYE 6501

Ups! No podemos cargar tu documento ahora. Inténtalo de nuevo o contacta con soporte.

Informar violación de derechos de autor

Escuela, estudio y materia

Institución: ISYE 6501
Grado: ISYE 6501

Información del documento

Subido en: 10 de diciembre de 2025
Número de páginas: 64
Escrito en: 2025/2026
Tipo: Examen
Contiene: Preguntas y respuestas

Temas

isye 6501 actual midterm 1 and 2 exam and
isye 6501 actual midterm 1 and 2 exam and fina e
isye 6501 actual midterm 1 and 2 exam and fina e

Vista previa del contenido

ISYE 6501 –ACTUAL Midterm 1 and 2 EXAM- and
Fina exam QUESTIONS ANSWERS & RATIONALES.

ISYE 6501 - Midterm 1

A large value of K will lead to

a large variance in predictios

Setting a large value of k will ...

lead to a large model bias.

What are real effects?

Real relationships between attributes and responses. They are the same in all data sets,

What are random effects?

They are random but look like real effects. They are different in all data sets.

Why can't we measure a model's effectiveness on data it was trained on?

The model's performance on its training data is usually too optimistic, the model is fit to both real and
random pattenrs in the data, so it becomes overly specialized to the specific randomness in the training
set, that doesn't exist in other data.

If we use the same data to fit a model as we do to estimate how good it is, what is likely to happen?

The model will appear to be better than it really is.

The model will be fit to both real and random patterns in the data. The model's effectiveness on this
data set will include both types of patterns, but its true effectiveness on other data sets (with different
random patterns) will only include the real patterns

When comparing models, if we use the same data to pick the best model as we do to estimate how
good the best one is, what is likely to happen?

The model will appear to be better than it really is.

,The model with the highest measured performance is likely to be both good and lucky in its fit to
random patterns.

What is a training set used for

used to fit the models

What is a validation set used for?

used to choose best model

Why would we use two sets?

Reason to use two different sets is because if the first set, the training set, had unique random effects
that the classifer was designed for, we wouldn't be counting those benefits when we measure
effectiveness on the validation set.

What effects does randomness have on training /validation performance?

sometimes the randomness will make the performance look worse than it really is, and sometimes the
randomness will make the performance look better than it really is

how are high-performing models affected by randomness?

They are often boosted by above average random effects making it look better

what is a test data set used for?

to estimate performance of chosen model

When do we need a validation set?

When we are choosing between multiple models.

What are the data splits when working with one model?

70-90% training, 10-30% test

What are the data splits when comparing models?

50-70% training, split the rest between validation and test

What are two methods of splitting data?

random and roation

What is the rotation method of splitting data?

You take turns selecting points.
5 data point rotation sequence: (Training - Validation - Training - Test - Training

What is the advantage of rotation over randomness?

We make sure each part of the data is equally separated.

,What is the disadvantage of using rotation?

We have to make sure we aren't creating some other type of bias when we assign points.

what is k-fold cross validation?

split the training/validation data into k-parts; we train on k-1 parts and validate on the remaining part.

What metric do you use for k-fold cross validation when comparing models?

The average of all k evaluations.

What do we use when important data only appears in the validation or test sets?

cross-validation

What do we do after we've performed cross-validation?

We train the model again using all the data.

what are the benefits of k-fold cross validation?

better use of data, better estimate of model quality, and chooses model more effectively

What can clustering be used for?

grouping data points (e.g., market segmentation) and discovering groups in data points (e.g.,
personalized medicine

Which should we use most of the data for: training, validation, or test?

training

In k-fold cross-validation, how many times is each part of the data used for training, and for
validation?

k-1 times for training, and 1 time for validation

what is rectangular distance useful for?

calculating driving distance when the city is mapped in a grid

what is the value of p for euclidean distance

2

what is the general equation for p-norm distance

2-norm

Straight-line distance corresponds to which distance metric?

How do you find the distance of an infinity norm?

You find the largest | x_i - y_i |

, What is a centroid

the center of a cluster

What are the steps of k means?

0. Pick k clusters within range of data.
1. Assign each data point to nearest cluster center
2. Recalculate cluster centers (centroids)
3. Repeat 1 and 2 until no changes

How do we find the cluster centers?

We take the mean of all the data points in cluster.

Why is k-means an expectation-maximization

finding the mean of all the points in cluster is similar to finding an expectation.

Assigning data points to cluster centers is the maximization step. Really we are minimizing, but we could
think of it as maximizing the negative of the distance to a cluster center

What are some of the consequences of outliers in k-means?

It will drag the cluster center artificially to one side.

Because k-means is a heuristic and thus fast what can we do?

run it several times choosing different clusters centers and choose the best one and we can choose
different values of k

how does bias/variance change as k changes in KNN

the higher the k the higher the bias the lower the k the higher the variance. when K = 1 that is the most
complex model and thus likely to overfit the data.

How do we find the best value of k in k means?

Elbow method: we calculate the total distance of each data point to its cluster center and plot it in two
dimensions. We look for the kik in the graph.

When clustering for prediction how do we choose the prediction?

When we see a new point, we just choose whichever cluster center is closest.

What is the difference between classification and clustering?

With classification mdoels, we know each data point's attributes and we already know the right
classification for the data points (supervised). In clustering (unsupervised) we know the attributes but
we don't know what group any of these data points are in.

What is the difference between supervised learning and unsupervised learning?

$22.49

Accede al documento completo:

100% de satisfacción garantizada

Inmediatamente disponible después del pago

Tanto en línea como en PDF

No estas atado a nada

Conoce al vendedor

NursingTotur2

2.9

(12)

Documento también disponible en un lote

Conoce al vendedor

NursingTotur2 Walden University

Ver perfil

Seguir

Vendido

121

Miembro desde

8 meses

Número de seguidores

Documentos

2445

Última venta

23 horas hace

Teachme2 TUTOR

Hi! ,I'm Nurse Simeon , a certified TeachMe2 Totur with over 5 helping University and college students succeed. I am a Verified Nursing Tutor specializing in Ihuman Case Studies, Advanced pharmacology (NR565), HESI, TEAS 7, Pediatrics and More, creating HIGH QUALITY,EXAM FOCUSED STUDY GUIDES. Every document is crafted to be clear, accurate, and easy understanding saving you study time and improving your grades. Whatever you are preparing for Hesi A2, NCLEX or University coursework ,my notes are trusted by hundreds of students like you. ✅ Backed by toturing experience. ✅ Organized by topic and exam need. ✅ Instant access and affordable pricing. Let's help you pass smarter ,not harder. Browse my store now !

Lee mas Leer menos

2.9

12 reseñas

Recientemente visto por ti

Por qué los estudiantes eligen Stuvia

Creado por compañeros estudiantes, verificado por reseñas

Calidad en la que puedes confiar: escrito por estudiantes que aprobaron y evaluado por otros que han usado estos resúmenes.

¿No estás satisfecho? Elige otro documento

¡No te preocupes! Puedes elegir directamente otro documento que se ajuste mejor a lo que buscas.

Paga como quieras, empieza a estudiar al instante

Sin suscripción, sin compromisos. Paga como estés acostumbrado con tarjeta de crédito y descarga tu documento PDF inmediatamente.

“Comprado, descargado y aprobado. Así de fácil puede ser.”

Alisha Student

Preguntas frecuentes

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

100% de satisfacción garantizada: ¿Cómo funciona?

Nuestra garantía de satisfacción le asegura que siempre encontrará un documento de estudio a tu medida. Tu rellenas un formulario y nuestro equipo de atención al cliente se encarga del resto.

Who am I buying this summary from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller NursingTotur2. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy this summary for $22.49. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 45,681 summaries were sold in the last 30 days Founded in 2010, the go-to place to buy summaries for 15 years now