Corpus Linguistics

Corpus linguistics refers to a set of data-driven, empirical methods and techniques for linguistic studies and analyses of languages. This course provides an introduction to corpus linguistics, types of corpora, and considerations and methods for building and using them. The course offers a balance between theory and practice. We put our theoretical guidelines into practice, for example, by building and annotating small corpora, using concordance systems and Corpus Query Language. By providing an overall overview of methods, tools, and software in corpus-based linguistics, the ultimate goal is to enable participants to conduct corpus-based investigations.

The material for the course can be downloaded from here (including information for assessment and grading).

This page last edited on 19 January 2018.

