Automatic speech feature extraction using a convolutional restricted boltzmann machine

dc.contributor.authorAnderson, David John
dc.date.accessioned2018-12-04T11:39:37Z
dc.date.available2018-12-04T11:39:37Z
dc.date.issued2017
dc.descriptionA dissertation submitted to the Faculty of Science, University of the Witwatersrand, in fulfillment of the requirements for the degree of Master of Science, 2017en_ZA
dc.description.abstractRestricted Boltzmann Machines (RBMs) are a statistical learning concept that can be interpreted as Arti cial Neural Networks. They are capable of learning, in an unsupervised fashion, a set of features with which to describe a data set. Connected in series RBMs form a model called a Deep Belief Network (DBN), learning abstract feature combinations from lower layers. Convolutional RBMs (CRBMs) are a variation on the RBM architecture in which the learned features are kernels that are convolved across spatial portions of the input data to generate feature maps identifying if a feature is detected in a portion of the input data. Features extracted from speech audio data by a trained CRBM have recently been shown to compete with the state of the art for a number of speaker identi cation tasks. This project implements a similar CRBM architecture in order to verify previous work, as well as gain insight into Digital Signal Processing (DSP), Generative Graphical Models, unsupervised pre-training of Arti cial Neural Networks, and Machine Learning classi cation tasks. The CRBM architecture is trained on the TIMIT speech corpus and the learned features veri ed by using them to train a linear classi er on tasks such as speaker genetic sex classi cation and speaker identi cation. The implementation is quantitatively proven to successfully learn and extract a useful feature representation for the given classi cation tasksen_ZA
dc.description.librarianMT2018en_ZA
dc.format.extentOnline resource (78 leaves)
dc.identifier.citationAnderson, David John (2017) Automatic speech feature extraction using a convolutional restricted boltzmann machine, University of the Witwatersrand, Johannesburg, <http://hdl.handle.net/10539/26165>
dc.identifier.urihttps://hdl.handle.net/10539/26165
dc.language.isoenen_ZA
dc.subject.lcshNeural networks (Computer science)
dc.subject.lcshMachine learning
dc.titleAutomatic speech feature extraction using a convolutional restricted boltzmann machineen_ZA
dc.typeThesisen_ZA

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
David Anderson - 0706537A - Final MSc Submission.pdf
Size:
1.96 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections