100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Summary

Summary of paper Masked Autoencoders Are Scalable Vision Learners

Rating
-
Sold
-
Pages
4
Uploaded on
05-07-2024
Written in
2023/2024

This is a summary of the paper Masked Autoencoders Are Scalable Vision Learners for the course Seminar of Computer Vision by Deep Learning in TU Delft

Institution
Course








Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Study
Course

Document information

Uploaded on
July 5, 2024
Number of pages
4
Written in
2023/2024
Type
Summary

Subjects

Content preview

Masked Autoencoders Are
Scalable Vision Learners




During pre-training, a large random subset of image patches (75%) is masked
out. The encoder is applied to the small subset of visible patches. Mask tokens
are introduced after the encoder and the full set of encoded pathes and mask
tokens is processed by a small decoder that reconstructs the original image in
pixels.


Introduction
Solutions based on autoregressive language modeling in GPT and masked
autoencoding BERT are conceptually simple, they remove a portion of the data
and learn to predict removed content.
The idea of masked autoencoders, a form of more general denoising
autoencoder, is natural and applicable in computer vision as well.
What makes autoencoding different between vision and language?
Languages are human-generated signals that are highly semantic and
information-dense. When training a model to predict only a few missing words
per sentence, this task appears to induce sophisticated language
understanding. Images on the contrary are natural signals with heavy spatial
redundancy — e.g. a missing patch can be covered from neighboring patches
with little high-level understanding of parts, objects and scenes.




Masked Autoencoders Are Scalable Vision Learners 1
$8.67
Get access to the full document:

100% satisfaction guarantee
Immediately available after payment
Both online and in PDF
No strings attached

Get to know the seller
Seller avatar
guillemribes

Also available in package deal

Get to know the seller

Seller avatar
guillemribes Technische Universiteit Delft
Follow You need to be logged in order to follow users or courses
Sold
0
Member since
1 year
Number of followers
0
Documents
11
Last sold
-

0.0

0 reviews

5
0
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions