dc.contributor.author | Mannion, Patrick | |
dc.contributor.author | Devlin, Sam | |
dc.contributor.author | Duggan, Jim | |
dc.contributor.author | Howley, Enda | |
dc.date.accessioned | 2018-12-19T16:50:30Z | |
dc.date.available | 2018-12-19T16:50:30Z | |
dc.date.copyright | 2017-08 | |
dc.date.issued | 2017-08 | |
dc.identifier.uri | https://research.thea.ie/handle/20.500.12065/2392 | |
dc.description.abstract | Multi-Agent Systems (MAS) are a form of distributed intelligence, where multiple autonomous
agents act in a common environment. Numerous complex, real world systems have been
successfully optimised using Multi-Agent Reinforcement Learning (MARL) in conjunction with
the MAS framework. In MARL agents learn by maximising a scalar reward signal from the
environment, and thus the design of the reward function directly a ects the policies learned. In
this work, we address the issue of appropriate multi-agent credit assignment in stochastic resource
management games. We propose two new Stochastic Games to serve as testbeds for MARL
research into resource management problems: the Tragic Commons Domain and the Shepherd
Problem Domain. Our empirical work evaluates the performance of two commonly used reward
shaping techniques: Potential-Based Reward Shaping and difference rewards. Experimental
results demonstrate that systems using appropriate reward shaping techniques for multi-agent
credit assignment can achieve near optimal performance in stochastic resource management
games, outperforming systems learning using unshaped local or global evaluations. We also
present the first empirical investigations into the effect of expressing the same heuristic knowledge
in state- or action-based formats, therefore developing insights into the design of multi-agent
potential functions that will inform future work. | en_US |
dc.format | Pdf | en_US |
dc.language.iso | en | en_US |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Ireland | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/ie/ | * |
dc.subject | Multi-Agent Systems | en_US |
dc.subject | Environment | en_US |
dc.subject | Stochastic Resource Management Games | en_US |
dc.subject | Potential-Based Reward Shaping | en_US |
dc.title | Multi-Agent Credit Assignment in Stochastic Resource Management Games | en_US |
dc.type | Article | en_US |
dc.description.peerreview | yes | en_US |
dc.rights.access | Copyright | en_US |
dc.subject.department | Department of Computer Science & Applied Physics | en_US |