A comparative analysis of two Image synthesis networks repurposed for video synthesis

dc.contributor.authorWeiher, Kyle
dc.date.accessioned2022-07-19T12:04:29Z
dc.date.available2022-07-19T12:04:29Z
dc.date.issued2021
dc.descriptionA dissertation submitted to the Faculty of Science, University of the Witwatersrand, Johannesburg, in fulfilment of the requirements for the degree of Master of Science, 2021en_ZA
dc.description.abstractModern, state-of-the-art video synthesis networks are large, complex, and difficult to train due to steep hardware requirements. Image synthesis networks are, however, far easier to train, and thus present an opportunity. We present a comparative analysis of two image synthesis networks when repurposed for video synthesis, namely the Cascaded Refinement Network (CRN), and pix2pixHD. We are primarily interested in the temporal consistency between output frames, however we analyse aspects such as image quality as well. The networks are not equal in neither features nor performance, with pix2pixHD superior in both regards, and thus we compare both base network configurations, and configurations where we port components from pix2pixHD to the CRN. Furthermore, we test video specific features, namely the use of prior frame information in the synthesis and training processes, and the use of optical flow estimation to warp prior frame content directly to the next frame. We find that while the CRN performed far worse than pix2pixHD in their base configurations, the CRN with pix2pixHD features closed the gap. Next, the addition of prior frame information provides the largest improvement to temporal consistency, leading to acceptably smooth synthetic video. Finally, optical flow estimation further improves the performance of pix2pixHD, while hindering the CRNen_ZA
dc.description.librarianCK2022en_ZA
dc.facultyFaculty of Scienceen_ZA
dc.identifier.urihttps://hdl.handle.net/10539/33039
dc.language.isoenen_ZA
dc.schoolSchool of Computer Science and Applied Mathematicsen_ZA
dc.titleA comparative analysis of two Image synthesis networks repurposed for video synthesisen_ZA
dc.typeThesisen_ZA

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
MSc - Kyle Brandon Weiher - 1116087 - Dissertation.pdf
Size:
23.76 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections