LN4003: Programming for Corpus Linguists

Module leader: Dr. Constantin Orasan
Semester: 1

The module will enable students to acquire basic and intermediate concepts of computer science and programming in Python. The module is intended for linguists and other non-computer-scientists who have no programming experience, but computer scientists interested in learning Python and how it can be used in corpus linguistics can also benefit from it. Its special focus will be on Python and how it can be used to solve problems from corpus linguistics. Topics to be covered include: how to analyse the problem to be solved, fundamental data types, control structures, functions, regular expressions, simple tokenization, arrays, dictionaries, files, and corpora. Laboratory sessions will give participants hands-on experience in writing simple Python programs. Formative assessment involving a mock in-class test will allow students to get feedback on their work before the final summative assessment, as well as gain an insight into the way the assessment will be structured.

At the completion of the module, the student is expected to be able to:

Recommended reading for this module

Electronic resources for this module

(c) 2006 - 2012 Research Group in Computational Linguistics
Last modified: June 22 2010