본문 바로가기

research tips

The MULINCO corpus and corpus platform

http://gandalf.aksis.uib.no/lrec2006/pdf/480_pdf.pdf

Bente Maegaard (1), Lene Offersgaard (1), Lina Henriksen (1), Hanne Jansen (2), Xavier
Lepetit (2), Costanza Navarretta (1), Claus Povlsen (1)
(1) Center for Sprogteknologi, University of Copenhagen
(2) Department of English, Germanic and Romance Studies, University of Copenhagen
Njalsgade 80, DK-2300 Copenhagen S, Denmark
{bente, loff, lina, costanza, claus}@cst.dk, {hanjan, lepetit}@hum.ku.dk


Abstract
The MULINCO project (MUltiLINgual Corpus of the University of Copenhagen) started early 2005. The purpose of this crossdisciplinary project is to create a corpus platform for education and research in monolingual and translation studies. The project covers two main types of corpus texts: literary and non-literary. The platform is being developed using available tools as far as possible, and integrating them in a very open architecture. In this paper we describe the current status and future developments of both the text and tool side of the corpus platform, and we show some examples of student exercises and linguistic investigations taking advantage of tagged and aligned texts.