Generalized Task Learning for Robots: Unifying Task Hierarchies through Contrastive Learning
| dc.contributor.author | Alexander, Ryan Austin | |
| dc.contributor.co-supervisor | James, Steven | |
| dc.contributor.supervisor | Klein, Richard | |
| dc.date.accessioned | 2025-11-07T12:27:31Z | |
| dc.date.issued | 2025-06 | |
| dc.description | A dissertation submitted in partial fulfilment of the requirements for the degree of Master of Science, to the Faculty of Science, School of Computer Science & Applied Mathematics, University of the Witwatersrand, Johannesburg, | |
| dc.description.abstract | This dissertation addresses the challenge of enabling robots to generalize across unseen household tasks by learning abstract task structures from demonstration data. We develop a three-stage pipeline that translates natural language instructions and demonstrations into hierarchical task representations using large language models, clustering, and parameterized generalization. Our approach is tested and evaluated on the ALFRED benchmark [Shridhar et al. 2020]. ALFRED acts as a standardized measure used for training models to comprehend and follow instructions in natural language. It leverages first-person perspective visual input to carry out a series of actions for various household tasks. While this approach doesn’t represent the state-of-the-art, it establishes a foundation for future research to build upon. | |
| dc.description.submitter | MMM2025 | |
| dc.faculty | Faculty of Science | |
| dc.identifier | 0000-0002-5785-6411 | |
| dc.identifier.citation | Alexander, Ryan Austin. (2025). Generalized Task Learning for Robots: Unifying Task Hierarchies through Contrastive Learning. [Master's dissertation, University of the Witwatersrand, Johannesburg]. WIReDSpace. https://hdl.handle.net/10539/47450 | |
| dc.identifier.uri | https://hdl.handle.net/10539/47450 | |
| dc.language.iso | en | |
| dc.publisher | University of the Witwatersrand, Johannesburg | |
| dc.rights | ©2025 University of the Witwatersrand, Johannesburg. All rights reserved. The copyright in this work vests in the University of the Witwatersrand, Johannesburg. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of University of the Witwatersrand, Johannesburg. | |
| dc.rights.holder | University of the Witwatersrand, Johannesburg | |
| dc.school | School of Computer Science and Applied Mathematics | |
| dc.subject | Natural Language Instuctions | |
| dc.subject | Clustering Methods | |
| dc.subject | Task Generalisation | |
| dc.subject | Symbolic Planning | |
| dc.subject | Embodied Instruction Following | |
| dc.subject | UCTD | |
| dc.subject.primarysdg | SDG-9: Industry, innovation and infrastructure | |
| dc.subject.secondarysdg | SDG-4: Quality education | |
| dc.title | Generalized Task Learning for Robots: Unifying Task Hierarchies through Contrastive Learning | |
| dc.type | Dissertation |