As we can see in fig.1 and fig.2 the concept "altruism" is less popular in Russian language, there are 1047 mentions of the mercy and only 309 mentions of altruism in corpora
. The statistical analysis showed that the differences in use of these concepts have statistical significance: Pearson Chi-Square p < 0,000 (see Table 1).
The research questions posed in the introduction to this study present two major foci, firstly, to measure the influence of resorting to a corpus-based learning platform on the use of legal terminology by ESAP learners and, secondly, to try to access the pragmatic level of two learner corpora
through the analysis of meta-discourse markers.
One reason is that teachers and learners do not adopt the ideology and technique of corpora
From pedagogically relevant corpora
to authentic language learning contents.
The next section examines five corpora
of understudied languages from Africa and Eurasia: firstly, the non-annotated corpora
of Assamese and Ndebele (2.1), and then the annotated corpora
of Ossetic, Bambara and Kalmyk (2.2).
We compare those numbers to the ones obtained on the Croatian, Bosnian and Serbian domains , showing that the second versions of the corpora
(hrWaC and slWaC), which merge two crawls obtained with different tools and were collected three years apart, show a smaller level of reduction (around 30%) at each step of near-duplicate removal, while the first versions of corpora
(bsWaC and srWaC), obtained with SpiderLing only and in one crawl, suffer more data loss in this process (around 35-40%).
In the penultimate chapter, 'Computational Challenges, Innovations, and Future of Scottish Corpora
', David Beavan provides an extremely useful technical overview of recent developments.
and Workplace Discourse", Almut Koester explores the characteristics of workplace discourse occurring in professional and institutional contexts, where the author shows how language use in the workplace exhibits lexico-grammar as well as pragmatic features, which make it distinct from everyday discourse.
COHA and COCA are the flagship corpora
from Davies' BYU project, but there are several other members in its complete corpus cornucopia that are based on different collections of texts.
We have found that not all of the corpora
installable are tagged.
A standalone corpus processing system is required when analyzing corpora
in text formats.
Moskowich thoroughly justifies the importance of the compilation of CETA, arguing that it fills "a gap left by other historical corpora