Adaptive threshold optimisation for colour-based lip segmentation in automatic lip-reading systems

dc.contributor.authorGritzman, Ashley Daniel
dc.date.accessioned2017-05-18T12:47:29Z
dc.date.available2017-05-18T12:47:29Z
dc.date.issued2016
dc.descriptionA thesis submitted to the Faculty of Engineering and the Built Environment, University of the Witwatersrand, Johannesburg, in ful lment of the requirements for the degree of Doctor of Philosophy. Johannesburg, September 2016en_ZA
dc.description.abstractHaving survived the ordeal of a laryngectomy, the patient must come to terms with the resulting loss of speech. With recent advances in portable computing power, automatic lip-reading (ALR) may become a viable approach to voice restoration. This thesis addresses the image processing aspect of ALR, and focuses three contributions to colour-based lip segmentation. The rst contribution concerns the colour transform to enhance the contrast between the lips and skin. This thesis presents the most comprehensive study to date by measuring the overlap between lip and skin histograms for 33 di erent colour transforms. The hue component of HSV obtains the lowest overlap of 6:15%, and results show that selecting the correct transform can increase the segmentation accuracy by up to three times. The second contribution is the development of a new lip segmentation algorithm that utilises the best colour transforms from the comparative study. The algorithm is tested on 895 images and achieves percentage overlap (OL) of 92:23% and segmentation error (SE) of 7:39 %. The third contribution focuses on the impact of the histogram threshold on the segmentation accuracy, and introduces a novel technique called Adaptive Threshold Optimisation (ATO) to select a better threshold value. The rst stage of ATO incorporates -SVR to train the lip shape model. ATO then uses feedback of shape information to validate and optimise the threshold. After applying ATO, the SE decreases from 7:65% to 6:50%, corresponding to an absolute improvement of 1:15 pp or relative improvement of 15:1%. While this thesis concerns lip segmentation in particular, ATO is a threshold selection technique that can be used in various segmentation applications.en_ZA
dc.description.librarianMT2017en_ZA
dc.format.extentOnline resource (xix, 171 leaves)
dc.identifier.citationGritzman, Ashley Daniel (2016) Adaptive threshold optimisation for colour-based lip segmentation in automatic lip-reading systems, University of the Witwatersrand, Johannesburg, <http://hdl.handle.net/10539/22664>
dc.identifier.urihttp://hdl.handle.net/10539/22664
dc.language.isoenen_ZA
dc.subject.lcshAutomatic speech recognition
dc.subject.lcshSpeech processing systems
dc.subject.lcshLipreading--Computer simulation
dc.subject.lcshSpeech synthesis
dc.titleAdaptive threshold optimisation for colour-based lip segmentation in automatic lip-reading systemsen_ZA
dc.typeThesisen_ZA

Files

Original bundle

Now showing 1 - 3 of 3
No Thumbnail Available
Name:
title_page.pdf
Size:
51.33 KB
Format:
Adobe Portable Document Format
Description:
No Thumbnail Available
Name:
abstract.pdf
Size:
170.62 KB
Format:
Adobe Portable Document Format
Description:
No Thumbnail Available
Name:
thesis.pdf
Size:
22.36 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections