LOB语料库

说明:引用此文请注明出处,并务请保留后面的有效链接地址,谢谢!http://www.yygrammar.com/Article/201211/3104.html

LOB语料库

创建时间:1970年代初

创建单位:英国Lancaster大学和挪威Oslo大学以及Bergen大学

规模层级:100万词次

基本情况:研究当代英国英语,与美国英语对比,使用了TAGIT系统,以统计方式建立换算几率矩阵,提高标注正确率。

The Lancaster-Oslo/Bergen Corpus (LOB) was compiled by researchers inLancaster,OsloandBergen. It consists of one million words of British English texts from 1961. The texts for the corpus were sampled from 15 different text categories. Each text is just over 2,000 words long (longer texts have been cut at the first sentence boundary after 2,000 words) and the number of texts in each category varies (see table below). Further information about the texts can be found in the LOB manual (external link).

This corpus is the British counterpart of the Brown Corpus of American English, which contains texts printed in the same year so that comparison between both varieties could be made.

查询地址:http://icame.uib.no/lob/lob-dir.htm

引用地址:http://www.yygrammar.com/Article/201211/3104.html

(0)

相关推荐