next up previous contents
Next: Recommended reading Up: Introduction Previous: Introduction

Aims of the book

This book has three main aims: familiarity with tools and techniques for handling text corpora, knowledge of the characteristics of some of the available corpora, and a secure grasp of the fundamentals of statistical natural language processing. Specific objectives include:
1.
grounding in the use of UNIX corpus tools.
2.
understanding of probability and information theory as they have been applied to computational linguistics.
3.
knowledge of fundamental techniques of probabilistic language modelling.
4.
experience of implementation techniques for corpus tools.

We believe that practical application of the techniques is essential for a clear understanding of what is going on, so provide exercises which will allow you to test your understanding and abilities.



Chris Brew
8/7/1998