10.25375/uct.12333893.v2
Kerry Jones
Kerry
Jones
Sanjin Muftić
Sanjin
Muftic
Endangered African Languages Featured in a Digital Collection: The Case of the ǂKhomani San | Hugh Brody Collection
University of Cape Town
2020
endangered African languages
N|uu
Kora
Khoekhoe
digital curation
online showcasing
heritage knowledge
ethics of repositories
ǂKhomani San
Hugh Brody
OpenRefine
R
data cleanup
linked data
Python
jupyter notebooks
Omeka S
Heritage and Cultural Conservation
Library and Information Studies
Language in Culture and Society (Sociolinguistics)
Language in Time and Space (incl. Historical Linguistics, Dialectology)
2020-05-28 10:53:10
Presentation
https://zivahub.uct.ac.za/articles/presentation/Endangered_African_Languages_Featured_in_a_Digital_Collection_The_Case_of_the_Khomani_San_Hugh_Brody_Collection/12333893
<div><div>Presentation supporting a paper published as part of the Proceedings of the Language Resources and Evaluation Conference (LREC - organized virtually) 2020 first workshop on Resources for African Indigenous Languages (RAIL) on May 16, 2020.</div><div><br></div>The ǂKhomani San | Hugh Brody Collection features the voices and history of indigenous hunter gatherer descendants in three endangered languages namely, Nǀuu, Kora and Khoekhoe as well as a regional dialect of Afrikaans. A large component of this collection is audio-visual (legacy media) recordings of interviews conducted with members of the community by Hugh Brody and his colleagues between 1997 and 2012, referring as far back as the 1800s. The Digital Library Services team at the University of Cape Town aim to showcase the collection digitally on the UCT-wide Digital Collections platform, Ibali which runs on Omeka S. In this presentation we highlight the importance of such a collection in the context of South Africa, and the steps that were taken to prepare the transcripts which were generated from the audiovisual material for publication. We outline our development process in preparing the collection for a linked data online showcase website, from digitisation to repository publishing as well as present some of the challenges in data clean-up, the curation of legacy media, multi-lingual support, and site organisation.<br><div>TOC:</div><div>00:00 | Welcome and Intro<br>05:30 | Overview of Collection and Transcription Process</div><div>09:38 | Digital Curation</div><div>15:40 | Conclusion</div><div>16:37 | References</div><div><br>As the conference was held online the presentation was pre-recorded. This international conference was scheduled to be held in Marseille, but was moved virtually due to the global pandemic</div></div>