10.25375/uct.12333893.v2 Kerry Jones Kerry Jones Sanjin Muftić Sanjin Muftic Endangered African Languages Featured in a Digital Collection: The Case of the ǂKhomani San | Hugh Brody Collection University of Cape Town 2020 endangered African languages N|uu Kora Khoekhoe digital curation online showcasing heritage knowledge ethics of repositories ǂKhomani San Hugh Brody OpenRefine R data cleanup linked data Python jupyter notebooks Omeka S Heritage and Cultural Conservation Library and Information Studies Language in Culture and Society (Sociolinguistics) Language in Time and Space (incl. Historical Linguistics, Dialectology) 2020-05-28 10:53:10 Presentation https://zivahub.uct.ac.za/articles/presentation/Endangered_African_Languages_Featured_in_a_Digital_Collection_The_Case_of_the_Khomani_San_Hugh_Brody_Collection/12333893 <div><div>Presentation supporting a paper published as part of the Proceedings of the Language Resources and Evaluation Conference (LREC - organized virtually) 2020 first workshop on Resources for African Indigenous Languages (RAIL) on May 16, 2020.</div><div><br></div>The ǂKhomani San | Hugh Brody Collection features the voices and history of indigenous hunter gatherer descendants in three endangered languages namely, Nǀuu, Kora and Khoekhoe as well as a regional dialect of Afrikaans. A large component of this collection is audio-visual (legacy media) recordings of interviews conducted with members of the community by Hugh Brody and his colleagues between 1997 and 2012, referring as far back as the 1800s. The Digital Library Services team at the University of Cape Town aim to showcase the collection digitally on the UCT-wide Digital Collections platform, Ibali which runs on Omeka S. In this presentation we highlight the importance of such a collection in the context of South Africa, and the steps that were taken to prepare the transcripts which were generated from the audiovisual material for publication. We outline our development process in preparing the collection for a linked data online showcase website, from digitisation to repository publishing as well as present some of the challenges in data clean-up, the curation of legacy media, multi-lingual support, and site organisation.<br><div>TOC:</div><div>00:00 | Welcome and Intro<br>05:30 | Overview of Collection and Transcription Process</div><div>09:38 | Digital Curation</div><div>15:40 | Conclusion</div><div>16:37 | References</div><div><br>As the conference was held online the presentation was pre-recorded. This international conference was scheduled to be held in Marseille, but was moved virtually due to the global pandemic</div></div>