Corpus Linguistics is the use of digitized text (usually of natural language use) for the purpose of research in linguistics and other disciplines that make use of textual language data. Now that enormous amounts of text are available as well as the computational power to process it easily, Corpus Linguistics has become one of the most widespread methods of linguistic investigation in recent years and is increasingly being used beyond linguistics. This area of research requires a diverse range of skills, most notably computer programming and statistical analysis. This course focuses on corpus building, on the formulation of research questions that can be answered with corpora, and on the extraction, summarization, analysis, and visualization of corpus information. This course is dual listed with LIN 514.