Corpus based linguistic researches

Course Overview

It is known that Corpus linguistics is the study of language based on large collections of "real life" language use stored in corpora (or corpuses)—computerized databases created for linguistic research. It is also known as corpus-based studies.

This course introduces students to language corpora as a resource for linguistic analysis. The course presents a survey of existing language corpora and teaches the use of variety of corpus tools. Students learn the central concepts and methods of corpus linguistics by performing a number of investigations in the field of philology by comparatively different languages (English, Turkish, Uzbek and etc.)

Learning Outcomes

  • display familiarity with a variety of synchronic and diachronic English-language corpora
  • demonstrate an understanding of the considerations most central to the process of corpus compilation
  • demonstrate an understanding of important concepts in corpus-based research
  • use a variety of corpus tools to perform searches of spoken and written language data
  • employ both qualitative and quantitative methods of analysis
  • carry out small-scale corpus-based research projects on the lexis and grammar of English
  • write research reports presenting original findings in correct and genre-appropriate prose, using well-chosen methods of data presentation

Course Content

The course comprises the following topics:

  1. Corpus linguistics and corpus types of the world languages
  2. Corpus based lexicography
  3. Corpus based teaching as a second/foreign language
  4. Corpus based translation studies
  5. Corpus as a tool in applied linguistics (as example media linguistics and sociolinguistics)
  6. Corpus analysis in Computational linguistics
  7. Corpus based learning lexicogrammar (morphology, syntax and semantics)
  8. Corpus based analysis in Comparativistics (Comparative linguistics) ( as example Uzbek, Spanish, English)

Instructional Method

Group sessions. Instruction, and class discussions are in English. All teaching materials are in English. Active participation in course seminars is obligatory. In case of absence, students will be given the opportunity to complete an extra assignment within the framework of the course period.

Required Course Materials

The classroom is provided computer and projector for presentation.

Main textbook:
[1] Corpus-Based Language Studies: An Advanced Resource Book. Tony McEnery et al. Routledge, 2006.
*See References section below for articles covered in the B and C units of this book.

Supplementary books (a couple chapters will be used):
[2] Corpora in Applied Linguistics. Susan Hunston. Cambridge, 2002.
      Ch.6,7,8 Corpora and language teaching
[3] From Corpus to Classroom. O'Keeffe et al, Cambridge, 2007.
[4] Corpus Linguistics. McEnery & Wilson, Edinburgh Univ.
Press, 2001

 

Assessment

The purpose of the evaluation is to provide feedback to students on their learning, as well as to permit the instructor to assign a fair grade at the end of the course. Providing explicit information about assignments and grading procedures will clarify expectations and allay student anxiety, while supporting students in pacing their studies, gauging their progress, and achieving learning outcomes. )

Students are examined in the form of oral presentations, a written exam, and a written home essay. Grades are fail, pass, or pass with distinction. The presentation or essay might be a survey of two corpora of students’ own choice or teacher’s suggestion.