Chunked extendible arrays and its integration with the global array toolkit for parallel image processing

dc.contributor.authorNimako, Gideon
dc.date.accessioned2017-04-10T12:33:42Z
dc.date.available2017-04-10T12:33:42Z
dc.date.issued2016
dc.descriptionA thesis submitted to the Faculty of Engineering and the Built Environment in fulfilment of the requirements for the degree of Doctor of Philosophy, 2016en_ZA
dc.descriptionOnline resource (xii, 151 leaves)
dc.description.abstractSeveral meetings of the Extremely Large Databases Community for large scale scientific applications have advocated the use of multidimensional arrays as the appropriate model for representing scientific databases. Scientific databases gradually grow to massive sizes of the order of terabytes and petabytes. As such, the storage of such databases requires efficient dynamic storage schemes where the array is allowed to arbitrarily extend the bounds of the dimensions. Conventional multidimensional array representations in today’s programming environments do not extend or shrink their bounds without relocating elements of the data-set. In general extendibility of the bounds of the dimensions is limited to only one dimension. This thesis presents a technique for storing dense multidimensional arrays by chunks such that the array can be extended along any dimension without compromising the access time of an element. This is done with a computed access mapping function that maps the k-dimensional index onto a linear index of the storage locations. This concept forms the basis for the implementation of an array file of any number of dimensions, where the bounds of the array dimension can be extended arbitrarily. Such a feature currently exists in the Hierarchical Data Format version 5 (HDF5). However, extending the bound of a dimension in the HDF5 array file can be unusually expensive in time. Such extensions, in our storage scheme for dense array files, can be performed while still accessing elements of the array at orders of magnitude faster than in HDF5 or conventional array-files. We also present Parallel Chunked Extendible Dense Array (PEXTA), a new parallel I/O model for the Global Array Toolkit. PEXTA provides the necessary Application Programming Interface (API) for explicit data transfer between the memory resident global array and its secondary storage counterpart but also allows the persistent array to be extended on any dimension without compromising the access time of an element or sub-array elements. Such APIs provide a platform for high speed and parallel hyperspectral image processing without performance degradation, even when the imagery files undergo extensions.en_ZA
dc.description.librarianMT2017en_ZA
dc.identifier.citationNimako, Gideon (2016) Chunked extendible arrays and its integration with the global array toolkit for parallel image processing, University of the Witwatersrand, <http://hdl.handle.net/10539/22332>
dc.identifier.urihttp://hdl.handle.net/10539/22332
dc.language.isoenen_ZA
dc.subject.lcshParallel processing (Electronic computers)
dc.subject.lcshElectronic data processing--Distributed processing
dc.subject.lcshScience--Databases
dc.subject.lcshApplication software--Programming
dc.subject.lcshDatabase management
dc.titleChunked extendible arrays and its integration with the global array toolkit for parallel image processingen_ZA
dc.typeThesisen_ZA
Files
Original bundle
Now showing 1 - 5 of 5
No Thumbnail Available
Name:
Thesis_TitlePage.pdf
Size:
183.5 KB
Format:
Adobe Portable Document Format
Description:
No Thumbnail Available
Name:
ThesisAbstract.pdf
Size:
55.27 KB
Format:
Adobe Portable Document Format
Description:
No Thumbnail Available
Name:
GideonNimako_PhDThesis.pdf
Size:
4.89 MB
Format:
Adobe Portable Document Format
Description:
No Thumbnail Available
Name:
GideonNimako_PhDFinalSubmissionForm.pdf
Size:
11.63 MB
Format:
Adobe Portable Document Format
Description:
No Thumbnail Available
Name:
GideonNimako_PhDCorrectionLIst.pdf
Size:
162.63 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections