Learning Operators with NEAT for Boolean Composition in Reinforcement Learning

dc.contributor.authorEsterhuysen, Amir
dc.contributor.supervisorRosman, Benjamin
dc.contributor.supervisorJames, Steven
dc.contributor.supervisorTasse, Geraud Nangue
dc.date.accessioned2025-11-11T09:54:53Z
dc.date.issued2025-06
dc.descriptionA research report submitted in partial fulfilment of the requirements for the degree of Masters of Science in the field of Artificial Intelligence, to the Faculty of Science, School of Computer Science and Applied Mathematics, University of the Witwatersrand, Johannesburg, 2025
dc.description.abstractThe idea of skill composition has been gaining traction within reinforcement learning research. This compositional approach promotes efficient use of knowledge and represents a realistic, human-like style of learning. Existing work has demonstrated how simple skills can be composed using Boolean operators to solve new, unseen tasks without the need for further learning. However, this approach assumes that the learned value functions for each atomic skill are optimal—an assumption that is violated in most practical cases. We thus propose a method that instead learns operators for composition using evolutionary strategies. Our approach is empirically verified first in a tabular setting and then in a high dimensional function approximation environment. Results demonstrate outperformance of existing composition methods when faced with learned, suboptimal behaviours, while also promoting the development of robust agents and allowing for fluid transfer between domains.
dc.description.submitterMMM2025
dc.facultyFaculty of Science
dc.identifier0000-0003-0446-4737
dc.identifier.citationEsterhuysen, Amir. (2025). Learning Operators with NEAT for Boolean Composition in Reinforcement Learning. [Master's dissertation, University of the Witwatersrand, Johannesburg]. WIReDSpace. https://hdl.handle.net/10539/47485
dc.identifier.urihttps://hdl.handle.net/10539/47485
dc.language.isoen
dc.publisherUniversity of the Witwatersrand, Johannesburg
dc.rights©2025 University of the Witwatersrand, Johannesburg. All rights reserved. The copyright in this work vests in the University of the Witwatersrand, Johannesburg. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of University of the Witwatersrand, Johannesburg.
dc.rights.holderUniversity of the Witwatersrand, Johannesburg
dc.schoolSchool of Computer Science and Applied Mathematics
dc.subjectReinforcement Learning
dc.subjectNEAT
dc.subjectNeuroevolution
dc.subjectGenetic Algorithm
dc.subjectComposition
dc.subjectUCTD
dc.subject.primarysdgSDG-9: Industry, innovation and infrastructure
dc.subject.secondarysdgSDG-4: Quality education
dc.titleLearning Operators with NEAT for Boolean Composition in Reinforcement Learning
dc.typeDissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Esterhuysen_Learning_2025.pdf
Size:
2.33 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.43 KB
Format:
Item-specific license agreed upon to submission
Description: