A fundamental question in both books is how. Compare genres, dialects, time periods Search by pos, collocates, synonyms, and much more. 到目前为止,在语言学习方面最广泛使用的语料库是COCA (the Corpus of Contemporary American English)。 COCA是唯一一个庞大、新近且体裁均衡的语料库。 拥有体裁均衡的语料库是极为重要的,因为语言学习者常常不知道一个单词或短语在母语者听来是否过于正式或非正式。 By far, the most widely used corpus for language learning is coca (the corpus of contemporary american english) Academic in the recent book doing linguistics with a corpus (2020), jesse egbert, tove larsson, and douglas biber (hereafter elb 2020) spend about 12% of their book (9 of its 73 pages
And then jesse egbert and douglas biber (this time with bethany gray) follow that book with the 2022 book designing and evaluating language corpora. 而研究人员也可在如COCA(1990-2019)、 在线新闻语料库(NOW)(2010-2020)和冠状病毒语料库(Coronavirus)(2020)几个库中关注近期的语言变化。后两者每晚都会更新数百万词的数据。总的来说,这些语料库有数十亿词, 库容大多是同类历时语料库的50-100倍. Hundreds of thousands of researchers, teachers, and students have found the data from coca to be more reliable and useful than that of any other corpus. The corpora are used by more than 130,000 people each month, from more than 140 countries In addition, hundreds of universities worldwide have academic licenses, which provide their users with expanded access to the corpora. Note that the cost for an academic license for the english corpora is much less than for many other corpora
A number of examples also come from coha (historical), glowbe (dialects), and now (very large and recent) But all of the information in these help.
WATCH