CS 234 assignment 2-ALL ANSWERS 100% CORRECT
Consider the following grid environment. Starting from any unshaded square, you can move up, down, left, or right. Actions are deterministic and always succeed (e.g. going left from state 16 goes to state 15) unless they will cause the agent to run into a wall. The thicker edges indicate walls, and attempting to move in the direction of a wall results in staying in the same square (e.g. going in any direction other than left from state 16 stays in 16). Taking any action from the green target square (no. 12) earns a reward of rg (so r(12, a) = rg ∀a) and ends the episode . Taking any action from the red square of death (no. 5) earns a reward of rr (so r(5, a) = rr ∀a) and ends the episode. Otherwise, from every other square, taking any action is associated with a reward rs ∈ {−1, 0, +1} (even if the action results in the agent staying in the same square). Assume the discount factor γ = 1, rg = +5, and rr = −5 unless otherwise specified
Escuela, estudio y materia
- Institución
- Chamberlain College Nursing
- Grado
- CS 234 assignment 1-ALL ANSWERS 100% CORRECT (CS234)
Información del documento
- Subido en
- 17 de diciembre de 2021
- Número de páginas
- 8
- Escrito en
- 2021/2022
- Tipo
- Examen
- Contiene
- Preguntas y respuestas
Temas
-
cs 234 assignment 2 all answers 100 correct