创建时间: 1970年代初
规模层级: 100万词次
The Lancaster-Oslo Bergen Corpus (LOB) was compiled by researchers in Lancaster, Oslo and Bergen. It co
nsists of one million words of British En
glish texts from 1961. The texts for the corpus were sampled from 15 different text categories. Each text is just over 2.000 words long (lo
nger texts have b
een cut at the first sentence boundary after 2.000 words) and the number of texts in each category varies (see table below). Further information a
bout the t
exts can be found in the LOB manual (external l
This corpus is the British counterpart of the Brown Corpus of American English. which co
ntains texts printed in the same year so that comparison bet