dc.contributor.author | Mannion, Patrick | |
dc.contributor.author | Duggan, Jim | |
dc.contributor.author | Howley, Enda | |
dc.date.accessioned | 2018-12-13T13:02:51Z | |
dc.date.available | 2018-12-13T13:02:51Z | |
dc.date.copyright | 2017-05 | |
dc.date.issued | 2017 | |
dc.identifier.uri | https://research.thea.ie/handle/20.500.12065/2388 | |
dc.description.abstract | Reward shaping has been proposed as a means to address the
credit assignment problem in Multi-Agent Systems (MAS).
Two popular shaping methods are Potential-Based Reward
Shaping and di erence rewards, and both have been shown
to improve learning speed and the quality of joint policies
learned by agents in single-objective MAS. In this work we
discuss the theoretical implications of applying these approaches
to multi-objective MAS, and evaluate their e -
cacy using a new multi-objective benchmark domain where
the true set of Pareto optimal system utilities is known. | en_US |
dc.format | Pdf | en_US |
dc.language.iso | en | en_US |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Ireland | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/ie/ | * |
dc.subject | Multi-Objective | en_US |
dc.subject | Stochastic Game | en_US |
dc.subject | Reinforcement Learning | en_US |
dc.subject | Reward Shaping | en_US |
dc.subject | Multi-Agent Systems | en_US |
dc.subject | Credit Assignment | en_US |
dc.title | A Theoretical and Empirical Analysis of Reward Transformations in Multi-Objective Stochastic Games | en_US |
dc.type | Presentation | en_US |
dc.description.peerreview | no | en_US |
dc.rights.access | Copyright | en_US |
dc.subject.department | Department of Computer Science & Applied Physics | en_US |