100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Class notes

Optimization Algorithms – SGD, Momentum and RMSprop Study Guide, Machine Learning Class Notes, Exam Preparation Material

Rating
-
Sold
-
Pages
3
Uploaded on
25-12-2025
Written in
2025/2026

This document is a structured study guide on optimization algorithms used in machine learning, focusing on Standard SGD, SGD with Momentum, and RMSprop. It explains core concepts, mechanisms, common pitfalls, and exam-style questions using clear analogies and mathematical intuition. The material is suitable for coursework revision, conceptual understanding, and exam preparation in optimization and deep learning topics.

Show more Read less








Whoops! We can’t load your doc right now. Try again or contact support.

Document information

Uploaded on
December 25, 2025
Number of pages
3
Written in
2025/2026
Type
Class notes
Professor(s)
Fei-fei li, ehsan adeli
Contains
Optimization algorithms: sgd to rmsprop

Subjects

Content preview

Optimization Algorithms: SGD,
Momentum & RMSprop Study Guide
Topic Overview
This study guide compares three major optimization algorithms—Standard SGD, SGD with
Momentum, and RMSprop—using a "hiking down a ravine" analogy to explain their
mechanics1. It focuses on how RMSprop utilizes adaptive learning rates to solve the "ravine
problem" by adjusting step sizes based on terrain volatility.




Core Concepts
●​ Standard SGD (Stochastic Gradient Descent): A basic optimization method that
takes steps of a fixed size regardless of the terrain. It is analogous to hiking
blindfolded; on steep slopes, you risk tumbling (overshooting), while on flat ground,
progress is painfully slow.​

●​ SGD Momentum: An enhancement to SGD that accumulates velocity to move
faster, similar to running down a hill. While it helps gain speed, the momentum can
cause the algorithm to overshoot the track during sharp turns.​

●​ RMSprop: An advanced algorithm that uses Adaptive Learning Rates to adjust step
size independently for each parameter. It acts like "smart shoes" that analyze the
"bumpiness" of recent terrain to automatically adjust your stride.​

●​ Ravine: A specific landscape challenge in machine learning characterized by steep
walls and a flat bottom. Without adaptive methods, algorithms tend to bounce uselessly
against the walls rather than moving down the center.​




Important: The RMSprop Mechanism
RMSprop works by calculating a "Volatility Meter" and then normalizing the step size.

Step 1: The "Volatility" Meter
This step calculates how shaky or volatile the recent steps have been by keeping a running
$9.79
Get access to the full document:

100% satisfaction guarantee
Immediately available after payment
Both online and in PDF
No strings attached

Get to know the seller
Seller avatar
know-how

Get to know the seller

Seller avatar
know-how National University of Sciences and Technology, Islamabad
View profile
Follow You need to be logged in order to follow users or courses
Sold
New on Stuvia
Member since
3 days
Number of followers
0
Documents
9
Last sold
-

0.0

0 reviews

5
0
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions