Learning Operators with NEAT for Boolean Composition in Reinforcement Learning
| dc.contributor.author | Esterhuysen, Amir | |
| dc.contributor.supervisor | Rosman, Benjamin | |
| dc.contributor.supervisor | James, Steven | |
| dc.contributor.supervisor | Tasse, Geraud Nangue | |
| dc.date.accessioned | 2025-11-11T09:54:53Z | |
| dc.date.issued | 2025-06 | |
| dc.description | A research report submitted in partial fulfilment of the requirements for the degree of Masters of Science in the field of Artificial Intelligence, to the Faculty of Science, School of Computer Science and Applied Mathematics, University of the Witwatersrand, Johannesburg, 2025 | |
| dc.description.abstract | The idea of skill composition has been gaining traction within reinforcement learning research. This compositional approach promotes efficient use of knowledge and represents a realistic, human-like style of learning. Existing work has demonstrated how simple skills can be composed using Boolean operators to solve new, unseen tasks without the need for further learning. However, this approach assumes that the learned value functions for each atomic skill are optimal—an assumption that is violated in most practical cases. We thus propose a method that instead learns operators for composition using evolutionary strategies. Our approach is empirically verified first in a tabular setting and then in a high dimensional function approximation environment. Results demonstrate outperformance of existing composition methods when faced with learned, suboptimal behaviours, while also promoting the development of robust agents and allowing for fluid transfer between domains. | |
| dc.description.submitter | MMM2025 | |
| dc.faculty | Faculty of Science | |
| dc.identifier | 0000-0003-0446-4737 | |
| dc.identifier.citation | Esterhuysen, Amir. (2025). Learning Operators with NEAT for Boolean Composition in Reinforcement Learning. [Master's dissertation, University of the Witwatersrand, Johannesburg]. WIReDSpace. https://hdl.handle.net/10539/47485 | |
| dc.identifier.uri | https://hdl.handle.net/10539/47485 | |
| dc.language.iso | en | |
| dc.publisher | University of the Witwatersrand, Johannesburg | |
| dc.rights | ©2025 University of the Witwatersrand, Johannesburg. All rights reserved. The copyright in this work vests in the University of the Witwatersrand, Johannesburg. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of University of the Witwatersrand, Johannesburg. | |
| dc.rights.holder | University of the Witwatersrand, Johannesburg | |
| dc.school | School of Computer Science and Applied Mathematics | |
| dc.subject | Reinforcement Learning | |
| dc.subject | NEAT | |
| dc.subject | Neuroevolution | |
| dc.subject | Genetic Algorithm | |
| dc.subject | Composition | |
| dc.subject | UCTD | |
| dc.subject.primarysdg | SDG-9: Industry, innovation and infrastructure | |
| dc.subject.secondarysdg | SDG-4: Quality education | |
| dc.title | Learning Operators with NEAT for Boolean Composition in Reinforcement Learning | |
| dc.type | Dissertation |