1887

n Literator : Journal of Literary Criticism, Comparative Linguistics and Literary Studies - Verbal extension sequencing : an examination from a computational perspective

USD

 

Abstract


Leksikale oorvormers maak gebruik van 'n tweevlak-eindigendetoestandnetwerk om tegelykertyd morfologiese analiserings- en morfologiese genereringsherskrywingsreëls te enkodeer. Veelvuldige werkwoordelike ekstensies kan morfologies as 'n geslote morfeemklas geanaliseer word deur van verskillende rekenaarmatige tegnieke gebruik te maak. Die analise van 'n veelvuldige opeenvolging van ekstensies word verwesenlik deur triviale analise, gebaseer op enige kombinasie van lede van die geslote klas, maar dit gee aanleiding tot onnodige oorgenerering van leksikale items, waarvan baie items nie in 'n woordeboek voorkom nie. Indien ekstensiekombinasies beperk sou word in 'n strewe na die verteenwoordiging van voorbeelde wat werklik bestaan, sowel ten opsigte van die moontlike aantal ekstensies in 'n reeks as die relatiewe ordening van die ekstensies, het dit 'n radikale vermindering in die generering van leksikale items tot gevolg, terwyl die vermoë om te analiseer voldoende gehandhaaf word. Die artikel belig besonderhede van triviale analise en 'n benadering wat dramatiese oorgenerering voorkom. Dit is gebaseer op toetsdata wat die moontlike ekstensievolgordes asook die morfofonemiese alternasies van hierdie ekstensies in Noord-Sotho weerspieël. Dit is bekom deur literatuurnavorsing, leksikografiese ondersoeke en die rekenaarmatige morfologiese analise van tekste.

Lexical transducers utilise a two-level finite-state network to simultaneously code morphological analysis and morphological generation rewrite rules. Multiple extensions following the verb root can be morphologically analysed as a closed morpheme class using different computational techniques. Analysis of a multiple extension sequence is achieved by trivial analysis, based on any combination of the closed class members, but this produces unnecessary over-generation of lexical items, many of which may not occur in a lexicon. Limiting the extension combinations, in an attempt to represent examples that may actually exist - in terms of both the possible number of extensions in a sequence and the relative ordering of the extensions - leads to a radical reduction in the generation of lexical items while the ability to analyse adequately is maintained. The article highlights details of an investigation based on both trivial analysis and an approach that prevents dramatic over-generation. The article is based on test data reflecting possible extension sequences and the morphophonemic alternations of these extensions for Northern Sotho, garnered from literature research, lexicographic investigation and the computational morphological analysis of texts.

Loading

Article metrics loading...

/content/literat/29/1/EJC62007
2008-04-01
2016-12-08
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error