1887

n Lexikos - Semi-automatic retrieval of definitional information : a Northern Sotho case study

Volume 14, Issue 1
  • ISSN : 1684-4904
USD

 

Abstract


Korpus-gebaseerde terminologie is tans besig om veld te wen op die internasionale front. Dit is daarom belangrik dat terminoloë wat binne die Suid-Afrikaanse Bantoetale werk, nie net sal kennis neem van hierdie ontwikkeling nie, maar dat hulle ook hierdie neiging sal volg, selfs al het hulle nie dieselfde mate van toegang tot gesofistikeerde rekenaarprogrammatuur nie. Die doel van hierdie artikel is daarom om vas te stel of dit moontlik is om definisie-inligting oor sleutelkonsepte uit ongemerkte, lopende teks te herwin deur bekostigbare en toeganklike sagteware soos WordSmith Tools te gebruik. Ten einde hierdie vraag te beantwoord, is 'n gevallestudie in Noord-Sotho gedoen, met gebruikmaking van teksmateriaal oor die linguistiek as basis vir 'n gespesialiseerde korpus. Sintaktiese en leksikale patrone wat as tekstuele merkers van definisie-inligting dien, word geïdentifiseer en die suksesratio van rekenaarmatige herwinning van definisie-inligting word ontleed en beoordeel. Aandag word ook gegee aan die herwinning van spesifiek konseptuele inligting, wat 'n onverwagse byproduk van die semi-outomatiese herwinning van definisie-inligting is. Ten slotte word geïllustreer hoe definisie-inligting aangewend kan word by die skryf van 'n formele terminologiese definisie.

Corpus-based terminology is currently gaining ground on the international front. It is therefore important that terminologists working on the South African Bantu languages not only take note of this development, but that they should also follow this trend, even if they do not have the same measure of access to highly sophisticated software. The aim of this article is therefore to establish whether it is possible to retrieve definitional information on key concepts from untagged, running text by making use of affordable and easily accessible software such as WordSmith Tools. In order to answer this question, a case study is done in Northern Sotho, using textual material on linguistics as basis for a special field corpus. Syntactic and lexical patterns serving as textual markers of definitional information are identified and the success rate of the computational retrieval of definitional information is analysed and evaluated. Attention is also paid to the retrieval of specifically conceptual information, which turned out to be a fortunate by-product of semi-automatic retrieval of definitional information. Finally, it is illustrated how definitional information retrieved can be utilised in the writing of a formal terminological definition.

Loading full text...

Full text loading...

Loading

Article metrics loading...

/content/lexikos/14/1/EJC60490
2004-01-01
2016-12-11

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error