100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Class notes

College aantekeningen Multi-Agent Systems deel 2 (XM_0052), Master VU AI

Rating
-
Sold
-
Pages
42
Uploaded on
23-02-2022
Written in
2020/2021

Lecture notes from lecture 7-9 for the course MAS, you no longer have to watch a lecture but mainly practice

Institution
Course











Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Study
Course

Document information

Uploaded on
February 23, 2022
Number of pages
42
Written in
2020/2021
Type
Class notes
Professor(s)
Eric pauwels
Contains
7-9

Subjects

Content preview

Lecture 7 (17 & 19 & 24 Nov)
Exploration versus Exploitation

Context

sequential decision making

recurring themes:

States, actions, transitions, policy (how he is going to translate the state he is in into the actions he should
take), value functions (functions that assign value to states and allows to use them in next action);

Back-up, optimization (planning and searching)

In sequential decision making an agent tries to solve a sequential control problem by directly interacting with an
unknown environment

learning by trial and error, agent tries actions to learn their consequences

not supervised: no examples of correct or incorrect behavior; instead only rewards for actions tried

active learning: agent interacts with environment, agent has partial control over what data it will obtain for
learning

on-line learning: it must maximize performance during learning, not afterwards

in game theory it was just selfish agents without a mental state, but looking for the most rational action. From
now on we will look at agents that use mental states to move forward. We will first focus on 1 agent

Preliminaries: Recap of Probability Theory
Stochastic (or random) variables: abstract model the idea of randomly determined numerical outcome from a set of
outcomes.

X: Ω(′ outcomes′ ) →R
P(X=x) probability of x from the set X is ... → plotting this will give density function




Multi-Agent Systems 34

,Multi-Agent Systems 35

, By combining each line getting the mean of the row in the last column/sample mean:




adding more and more of X, it will go to a normal form.




Multi-Agent Systems 36

, the larger the sample the more picked about actual mean, and much more normal distribution:




Multi-Agent Systems 37
$8.37
Get access to the full document:

100% satisfaction guarantee
Immediately available after payment
Both online and in PDF
No strings attached


Also available in package deal

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
MeldaMalkoc Vrije Universiteit Amsterdam
Follow You need to be logged in order to follow users or courses
Sold
54
Member since
3 year
Number of followers
34
Documents
20
Last sold
5 months ago

3.3

7 reviews

5
2
4
1
3
2
2
1
1
1

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions