Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4,6 TrustPilot
logo-home
Exam (elaborations)

CS-7643 Quiz 4 Exam – Deep Learning Optimization & Regularization Study Guide

Rating
5,0
(1)
Sold
4
Pages
9
Grade
A+
Uploaded on
06-11-2025
Written in
2025/2026

CS-7643 Quiz 4 Exam – Deep Learning Optimization & Regularization Study Guide

Institution
CS-7643
Course
CS-7643

Content preview

CS-7643 Quiz 4 Exam – Deep Learning Optimization & Regularization Study Guide




Embedding - (ANSWER)A learned map from entities to vectors that encodes similarity



Graph Embedding - (ANSWER)Optimize the objective that connected nodes have more similar
embeddings than unconnected nodes.



Task: convert nodes to vectors



- effectively unsupervised learning where nearest neighbors are similar

- these learned vectors are useful for downstream tasks



Multi-layer Perceptron (MLP) pain points for NLP - (ANSWER)- Cannot easily support variable-sized
sequences as inputs or outputs

- No inherent temporal structure

- No practical way of holding state

- The size of the network grows with the maximum allowed size of the input or output sequences



Truncated Backpropagation through time - (ANSWER)- Only backpropagate a RNN through T time steps



Recurrent Neural Networks (RNN) - (ANSWER)h(t) = activation(U*input + V*h(t-1) + bias)

y(t) = activation(W*h(t) + bias)



- activation is typically the logistic function or tanh

- outputs can also simply be h(t)

- family of NN architectures for modeling sequences



Training Vanilla RNN's difficulties - (ANSWER)- Vanishing gradients

- Since dx(t)/dx(t-1) = w^t

- if w > 1: exploding gradients

, CS-7643 Quiz 4 Exam – Deep Learning Optimization & Regularization Study Guide




- if w < 1: vanishing gradients



Long Short-Term Memory Network Gates and States - (ANSWER)- f(t) = forget gate

- i(t) = input gate

- u(t) = candidate update gate

- o(t) = output gate



- c(t) = cell state

- c(t) = f(t) * c(t - 1) + i(t) * u(t)



- h(t) = hidden state

- h(t) = o(t) * tanh(c(t))



Perplexity(s) - (ANSWER)= product( 1 / P(w(i) | w(i-1), ...) ) ^ (1 / N)

= b ^ (-1/N sum( log(b) (P(w(i) | w(i-1), ...) ) )

- note exponent of b is per word CE loss

- perplexity of a discrete uniform distribution over k events is k



Language Model Goal - (ANSWER)- estimate the probability of sequences of words

- p(s) = p(w1, w2, ..., wn)



Masked Language Modeling - (ANSWER)- pre-training task - an auxiliary task different from the final task
we're really interested in, but which can help us achieve better performance finding good initial
parameters for the model



- By pre-training on masked language modeling before training on our final task, it is usually possible to
obtain higher performance than by simply training on the final task



Knowledge Distillation to Reduce Model Sizes - (ANSWER)- Have fully parameterized teacher model

Written for

Institution
CS-7643
Course
CS-7643

Document information

Uploaded on
November 6, 2025
Number of pages
9
Written in
2025/2026
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

R375,38
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF


Document also available in package deal

Reviews from verified buyers

Showing all reviews
2 months ago

5,0

1 reviews

5
1
4
0
3
0
2
0
1
0
Trustworthy reviews on Stuvia

All reviews are made by real Stuvia users after verified purchases.

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
Belaire Teachme2-tutor
Follow You need to be logged in order to follow users or courses
Sold
2779
Member since
1 year
Number of followers
1
Documents
1353
Last sold
1 day ago

5,0

502 reviews

5
487
4
12
3
3
2
0
1
0

Trending documents

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their exams and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can immediately select a different document that better matches what you need.

Pay how you prefer, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card or EFT and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions