Dynamics generalisation in reinforcement learning through the use of adaptive policies

dc.contributor.authorBeukman, Michael
dc.date.accessioned2024-01-26T09:49:46Z
dc.date.available2024-01-26T09:49:46Z
dc.date.issued2024
dc.descriptionA research report submitted in partial fulfilment of the requirements for the degree Master of Science to the Faculty of Science, School of Computer Science and Applied Mathematics, University of the Witwatersrand, Johannesburg, 2023
dc.description.abstractReinforcement learning (RL) is a widely-used method for training agents to interact with an external environment, and is commonly used in fields such as robotics. While RL has achieved success in several domains, many methods fail to generalise well to scenarios different from those encountered during training. This is a significant limitation that hinders RL’s real-world applicability. In this work, we consider the problem of generalising to new transition dynamics, corresponding to cases in which the effects of the agent’s actions differ; for instance, walking on a slippery vs. rough floor. To address this problem, we introduce a neural network architecture, the Decision Adapter, which leverages contextual information to modulate the behaviour of an agent, depending on the setting it is in. In particular, our method uses the context – information about the current environment, such as the floor’s friction – to generate the weights of an adapter module which influences the agent’s actions. This, for instance, allows an agent to act differently when walking on ice compared to gravel. We theoretically show that our approach generalises a prior network architecture and empirically demonstrate that it results in superior generalisation performance compared to previous approaches in several environments. Furthermore, we show that our method can be applied to multiple RL algorithms, making it a widely-applicable approach to improve generalisation
dc.description.librarianTL (2024)
dc.facultyFaculty of Science
dc.identifier.urihttps://hdl.handle.net/10539/37446
dc.language.isoen
dc.schoolComputer Science and Applied Mathematics
dc.subjectReinforcement learning
dc.subjectRobotics
dc.titleDynamics generalisation in reinforcement learning through the use of adaptive policies
dc.typeDissertation
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Michael Beukman 1825748 MSc Dissertation.pdf
Size:
3.31 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.43 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections