Exam (elaborations)

CS-7643 Quiz 4 Exam – Deep Learning Optimization & Regularization Study Guide

Name: CS-7643 Quiz 4 Exam – Deep Learning Optimization & Regularization Study Guide
SKU: doc_9589710
Rating: 5.00 (1 reviews)
Author: Belaire

Rating

5.0

(1)

Sold

Pages

Grade

A+

Uploaded on

06-11-2025

Written in

2025/2026

CS-7643 Quiz 4 Exam – Deep Learning Optimization & Regularization Study Guide

Institution

CS-7643

Course

CS-7643

Content preview

CS-7643 Quiz 4 Exam – Deep Learning Optimization & Regularization Study Guide

Embedding - (ANSWER)A learned map from entities to vectors that encodes similarity

Graph Embedding - (ANSWER)Optimize the objective that connected nodes have more similar
embeddings than unconnected nodes.

Task: convert nodes to vectors

- effectively unsupervised learning where nearest neighbors are similar

- these learned vectors are useful for downstream tasks

Multi-layer Perceptron (MLP) pain points for NLP - (ANSWER)- Cannot easily support variable-sized
sequences as inputs or outputs

- No inherent temporal structure

- No practical way of holding state

- The size of the network grows with the maximum allowed size of the input or output sequences

Truncated Backpropagation through time - (ANSWER)- Only backpropagate a RNN through T time steps

Recurrent Neural Networks (RNN) - (ANSWER)h(t) = activation(U*input + V*h(t-1) + bias)

y(t) = activation(W*h(t) + bias)

- activation is typically the logistic function or tanh

- outputs can also simply be h(t)

- family of NN architectures for modeling sequences

Training Vanilla RNN's difficulties - (ANSWER)- Vanishing gradients

- Since dx(t)/dx(t-1) = w^t

- if w > 1: exploding gradients

, CS-7643 Quiz 4 Exam – Deep Learning Optimization & Regularization Study Guide

- if w < 1: vanishing gradients

Long Short-Term Memory Network Gates and States - (ANSWER)- f(t) = forget gate

- i(t) = input gate

- u(t) = candidate update gate

- o(t) = output gate

- c(t) = cell state

- c(t) = f(t) * c(t - 1) + i(t) * u(t)

- h(t) = hidden state

- h(t) = o(t) * tanh(c(t))

Perplexity(s) - (ANSWER)= product( 1 / P(w(i) | w(i-1), ...) ) ^ (1 / N)

= b ^ (-1/N sum( log(b) (P(w(i) | w(i-1), ...) ) )

- note exponent of b is per word CE loss

- perplexity of a discrete uniform distribution over k events is k

Language Model Goal - (ANSWER)- estimate the probability of sequences of words

- p(s) = p(w1, w2, ..., wn)

Masked Language Modeling - (ANSWER)- pre-training task - an auxiliary task different from the final task
we're really interested in, but which can help us achieve better performance finding good initial
parameters for the model

- By pre-training on masked language modeling before training on our final task, it is usually possible to
obtain higher performance than by simply training on the final task

Knowledge Distillation to Reduce Model Sizes - (ANSWER)- Have fully parameterized teacher model

Report Copyright Violation

Written for

Institution: CS-7643
Course: CS-7643

Document information

Uploaded on: November 6, 2025
Number of pages: 9
Written in: 2025/2026
Type: Exam (elaborations)
Contains: Questions & answers

Subjects

cs 7643 quiz 4 exam

$21.49

Get access to the full document:

Written by students who passed

Immediately available after payment

Read online or as PDF

Get to know the seller

Belaire

5.0

(502)

Also available in package deal

Reviews from verified buyers

Showing all reviews

Harveymats Teachme2-tutor · 2001 reviews

2 months ago

5.0

1 reviews

Trustworthy reviews on Stuvia

All reviews are made by real Stuvia users after verified purchases.

Get to know the seller

Belaire Teachme2-tutor

View profile

Sold

2779

Member since

1 year

Number of followers

Documents

1353

Last sold

1 day ago

5.0

502 reviews

487

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller Belaire. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $21.49. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 50161 documents were sold in the last 30 days Founded in 2010, the go-to place to buy study notes for 16 years now

CS-7643 Quiz 4 Exam – Deep Learning Optimization & Regularization Study Guide

Content preview

Written for

Document information

Subjects

Also available in package deal

Reviews from verified buyers

Get to know the seller

Trending documents

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Didn't get what you expected? Choose another document

Pay as you like, start learning right away

Frequently asked questions

What do I get when I buy this document?

Satisfaction guarantee: how does it work?

Who am I buying these notes from?

Will I be stuck with a subscription?

Can Stuvia be trusted?