This volume provides an up-to-date survey of the field of corpus linguistics, a field whose methodology has revolutionized much of the empirical work done in most fields of linguistic study over the past decade.
Corpus linguistics investigates human language by starting out from large collections of texts - spoken, written, or recorded. These language corpora, which are now regularly available in electronic form, are the basis for quantitative and qualitative research on almost any question of linguistic interest. Many techniques that are in use in corpus linguistics today are rooted in the tradition of the late 18th and 19th century, when linguistics began to make use of mathematical and empirical methods. Modern corpus linguistics has used and developed these methods in close connection with computer science and computational linguistics.