Show simple item record

dc.contributor.authorMannion, Patrick
dc.contributor.authorDuggan, Jim
dc.contributor.authorHowley, Enda
dc.date.accessioned2018-12-13T13:02:51Z
dc.date.available2018-12-13T13:02:51Z
dc.date.copyright2017-05
dc.date.issued2017
dc.identifier.urihttps://research.thea.ie/handle/20.500.12065/2388
dc.description.abstractReward shaping has been proposed as a means to address the credit assignment problem in Multi-Agent Systems (MAS). Two popular shaping methods are Potential-Based Reward Shaping and di erence rewards, and both have been shown to improve learning speed and the quality of joint policies learned by agents in single-objective MAS. In this work we discuss the theoretical implications of applying these approaches to multi-objective MAS, and evaluate their e - cacy using a new multi-objective benchmark domain where the true set of Pareto optimal system utilities is known.en_US
dc.formatPdfen_US
dc.language.isoenen_US
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Ireland*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/ie/*
dc.subjectMulti-Objectiveen_US
dc.subjectStochastic Gameen_US
dc.subjectReinforcement Learningen_US
dc.subjectReward Shapingen_US
dc.subjectMulti-Agent Systemsen_US
dc.subjectCredit Assignmenten_US
dc.titleA Theoretical and Empirical Analysis of Reward Transformations in Multi-Objective Stochastic Gamesen_US
dc.typePresentationen_US
dc.description.peerreviewnoen_US
dc.rights.accessCopyrighten_US
dc.subject.departmentDepartment of Computer Science & Applied Physicsen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Ireland
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland