parallel corpus 썸네일형 리스트형 Bitext Alignment Bitext Alignment Synthesis Lectures on Human Language Technologies May 2011, 165 pages, (doi:10.2200/S00367ED1V01Y201106HLT014) Jörg Tiedemann Uppsala University Abstract This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granula.. 더보기 the open parallel corpus ... the open parallel corpus http://opus.lingfil.uu.se/ OPUS is a growing collection of translated texts from the web. In the OPUS project we try to convert and align free online data, to add linguistic annotation, and to provide the community with a publicly available parallel corpus. OPUS is based on open source products and the corpus is also delivered as an open content package. We used seve.. 더보기 Reuters Corpora http://corpus.leeds.ac.uk/ Reuters Corpora (RCV1, RCV2, TRC2) In 2000, Reuters Ltd made available a large collection of Reuters News stories for use in research and development of natural language processing, information retrieval, and machine learning systems. This corpus, known as "Reuters Corpus, Volume 1" or RCV1, is significantly larger than the older, well-known Reuters-21578 collection he.. 더보기 이전 1 다음