Savremeni jezički korpusi na Zapadnom Balkanu – Istorijat, trenutno stanje i budućnost (Language Corpora in the West Balkans – History, Current State and Future Perspective)

Slavistična revija, No. 60, Vol. 4, pp. 677–692 (2012)

16 Pages Posted: 15 Aug 2013 Last revised: 6 Sep 2013

See all articles by Nikola Dobric

Nikola Dobric

Alpen-Adria-University Klagenfurt - Institut für Anglistik und Amerikanistik

Date Written: 2012

Abstract

Zapadni Balkan ima bogatu istoriju konstrukcije jezičkih korpusa. Prvi elektronski korpus u regionu je konstruisan samo nekoliko godina posle prvog elektronskog korpusa u svetu, dok se ideja razvitka elektronskih jezičkih resursa razvila na ovim prostorima još ranije. Ovakav rani razvitak obrade prirodnog jezika je donekle usporen (negde i skoro zaustavljen) nesretnim događajima devedetih godina prošlog veka. Na sreću, protekle dve dekade bile su obeležene značajnim napretkom u razvoju korpusa zapadno-balkanskih jezika. Ovaj članak prvo daje istorijski pregled razvitka jezičkih korpusa i korpusne lingvistike u regionu u periodu između 1950. i 1990. godine, kao i trenutno stanje i buduću perspektivu.

The paper in essence looks at the current available corpora of the West Balkan languages and what they have to offer to researchers. The current state is put into focus by a detailed outline of the history of the development of corpora in this region, starting with the very first electronic corpora in the 1960s and following their common development until 1990 (being that all of the languages and countries understood as West Balkans belonged to the same country in this period). The paper also follows the beginnings of their individual development in the last two decades and emphasizes the importance of the international projects that helped reform the technological resources necessary for the construction of contemporary corpora. The conclusions that impose themselves say volumes about the amount of work some of the countries involved still need to invest in order to reach both world and regional standards in the construction of corpora and they also point out the need for a renewal of regional cooperation that was so fruitful in early years of corpus linguistics in the West Balkans.

Note: Downloadable document is in Serbian.

Keywords: corpora, West Balkans, history, overview, language resources, natural language processing

Suggested Citation

Dobric, Nikola, Savremeni jezički korpusi na Zapadnom Balkanu – Istorijat, trenutno stanje i budućnost (Language Corpora in the West Balkans – History, Current State and Future Perspective) (2012). Slavistična revija, No. 60, Vol. 4, pp. 677–692 (2012). Available at SSRN: https://ssrn.com/abstract=2309948

Nikola Dobric (Contact Author)

Alpen-Adria-University Klagenfurt - Institut für Anglistik und Amerikanistik ( email )

Universitätsstrasse 65-67
Klagenfurt, Corinthia 9020
Austria

HOME PAGE: http://www.uni-klu.ac.at/iaa/inhalt/2512.htm

Register to save articles to
your library

Register

Paper statistics

Downloads
13
Abstract Views
160
PlumX Metrics