Adaptive threshold optimisation for colour-based lip segmentation in automatic lip-reading systems

Show simple item record

dc.contributor.author Gritzman, Ashley Daniel
dc.date.accessioned 2017-05-18T12:47:29Z
dc.date.available 2017-05-18T12:47:29Z
dc.date.issued 2016
dc.identifier.uri http://hdl.handle.net/10539/22664
dc.description A thesis submitted to the Faculty of Engineering and the Built Environment, University of the Witwatersrand, Johannesburg, in ful lment of the requirements for the degree of Doctor of Philosophy. Johannesburg, September 2016 en_ZA
dc.description.abstract Having survived the ordeal of a laryngectomy, the patient must come to terms with the resulting loss of speech. With recent advances in portable computing power, automatic lip-reading (ALR) may become a viable approach to voice restoration. This thesis addresses the image processing aspect of ALR, and focuses three contributions to colour-based lip segmentation. The rst contribution concerns the colour transform to enhance the contrast between the lips and skin. This thesis presents the most comprehensive study to date by measuring the overlap between lip and skin histograms for 33 di erent colour transforms. The hue component of HSV obtains the lowest overlap of 6:15%, and results show that selecting the correct transform can increase the segmentation accuracy by up to three times. The second contribution is the development of a new lip segmentation algorithm that utilises the best colour transforms from the comparative study. The algorithm is tested on 895 images and achieves percentage overlap (OL) of 92:23% and segmentation error (SE) of 7:39 %. The third contribution focuses on the impact of the histogram threshold on the segmentation accuracy, and introduces a novel technique called Adaptive Threshold Optimisation (ATO) to select a better threshold value. The rst stage of ATO incorporates -SVR to train the lip shape model. ATO then uses feedback of shape information to validate and optimise the threshold. After applying ATO, the SE decreases from 7:65% to 6:50%, corresponding to an absolute improvement of 1:15 pp or relative improvement of 15:1%. While this thesis concerns lip segmentation in particular, ATO is a threshold selection technique that can be used in various segmentation applications. en_ZA
dc.language.iso en en_ZA
dc.title Adaptive threshold optimisation for colour-based lip segmentation in automatic lip-reading systems en_ZA
dc.type Thesis en_ZA
dc.description.librarian MT2017 en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account