A Continuous Reinforcement Learning Approach to Self-Adaptive Particle Swarm Optimisation

Tilley, Duncan

A Continuous Reinforcement Learning Approach to Self-Adaptive Particle Swarm Optimisation

Files

Tilley_Continuous_2023.pdf (3.44 MB)

Date

2023-08

Authors

Tilley, Duncan

Publisher

University of the Witwatersrand, Johannesburg

Abstract

Particle Swarm Optimisation (PSO) is a popular black-box optimisation technique due to its simple implementation and surprising ability to perform well on various problems. Unfortunately, PSO is fairly sensitive to the choice of hyper-parameters. For this reason, many self-adaptive techniques have been proposed that attempt to both simplify hyper-parameter selection and improve the performance of PSO. Surveys however show that many self-adaptive techniques are still outperformed by time-varying techniques where the value of coefficients are simply increased or decreased over time. More recent works have shown the successful application of Reinforcement Learning (RL) to learn self-adaptive control policies for optimisers such as differential evolution, genetic algorithms, and PSO. However, many of these applications were limited to only discrete state and action spaces, which severely limits the choices available to a control policy, given that the PSO coefficients are continuous variables. This dissertation therefore investigates the application of continuous RL techniques to learn a self-adaptive control policy that can make full use of the continuous nature of the PSO coefficients. The dissertation first introduces the RL framework used to learn a continuous control policy by defining the environment, action-space, state-space, and a number of possible reward functions. An effective learning environment that is able to overcome the difficulties of continuous RL is then derived through a series of experiments, culminating in a successfully learned continuous control policy. The policy is then shown to perform well on the benchmark problems used during training when compared to other self-adaptive PSO algorithms. Further testing on benchmark problems not seen during training suggest that the learned policy may however not generalise well to other functions, but this is shown to also be a problem in other PSO algorithms. Finally, the dissertation performs a number of experiments to provide insights into the behaviours learned by the continuous control policy.

Description

A dissertation submitted in fulfilment of the requirements for the degree of Master of Science (in Computer Science), to the Faculty of Science, School of Computer Science and Applied Mathematics, University of the Witwatersrand, Johannesburg, 2023.

Keywords

Particle Swarm Optimisation, Self-Adaptive PSO, Reinforcement Learning, UCTD

Citation

Tilley, Duncan. (2023). A Continuous Reinforcement Learning Approach to Self-Adaptive Particle Swarm Optimisation. [Master's dissertation, University of the Witwatersrand, Johannesburg]. https://hdl.handle.net/10539/42617

URI

https://hdl.handle.net/10539/42617

Collections

Electronic Theses and Dissertations (Masters)

Full item page

A Continuous Reinforcement Learning Approach to Self-Adaptive Particle Swarm Optimisation

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By