NAGIOS: RODERIC FUNCIONANDO

Using Inverse Reinforcement Learning with Real Trajectories to Get More Trustworthy Pedestrian Simulation

Repositori DSpace/Manakin

Valencià Castellano

IMPORTANT: Aquest repositori està en una versió antiga des del 3/12/2023. La nova instal.lació está en https://roderic.uv.es/

Using Inverse Reinforcement Learning with Real Trajectories to Get More Trustworthy Pedestrian Simulation

Mostra el registre complet de l'element

Visualització (2.520Mb)

Martinez Gil, Francisco Antonio; Lozano Ibáñez, Miguel; García-Fernández, Ignacio; Romero, Pau; Serra, Dolors; Sebastián Aguilar, Rafael

Aquest document és un/a article, creat/da en: 2020

Reinforcement learning is one of the most promising machine learning techniques to get intelligent behaviors for embodied agents in simulations. The output of the classic Temporal Difference family of Reinforcement Learning algorithms adopts the form of a value function expressed as a numeric table or a function approximator. The learned behavior is then derived using a greedy policy with respect to this value function. Nevertheless, sometimes the learned policy does not meet expectations, and the task of authoring is difficult and unsafe because the modification of one value or parameter in the learned value function has unpredictable consequences in the space of the policies it represents. This invalidates direct manipulation of the learned value function as a method to modify the derived behaviors. In this paper, we propose the use of Inverse Reinforcement Learning to incorporate rea... [Llegir més ...]

Veure al catàleg Trobes