Digital Preservation: Handling Large Collections Case Study: Digitizing Egyptian Press Archive at Centre for Economic, Judicial, and Social Study and Documentation(CEDEJ)
Alexandria University, Alexandria, Egypt
Managing the digitization of large collections is quite a challenge not only in terms of quantity, but also in terms of text and material quality, designing the workflow system which organizes the operations, and handling metadata. This has been the focus of the Bibliotheca Alexandrina during its partnership with the Centre for Economic, Judicial, and Social Study and Documentation (CEDEJ), to digitize more than 800,000 pages of press articles dating back to 1976. This triggered a need to design a workflow to manage such a massive collection proficiently. This required simultaneous intervention of four main aspects; data analysis, developing a digitization workflow , implementing and installing the necessary software tools for metadata entry, and publishing the digital archive. This paper demonstrates the workflow system implemented to manage this massive press collection, yielding more than 400,000 items to date. It illustrates the BA’s Digital Assets Factory (DAF); the nucleus of the digitization process ,and the tools and stages implemented for ingesting data into the system. The outflow is also discussed in terms of organizing and grouping multipart press clips, in addition to reviewing and validating the output. The paper also discusses the challenges of associating the accessible online archive with a powerful search engine supporting multidimensional search.
Digital Assets Factory (DAF) , BA