Review:

Corpora Such As The Corpus Of Historical American English (coha)

overall review score: 4.5
score is between 0 and 5
The Corpus of Historical American English (COHA) is a large and comprehensive digital collection of written texts that span from the 1810s to the 2000s. It provides researchers with access to a diachronic corpus of American English, enabling detailed linguistic, cultural, and historical analyses of language change over nearly two centuries.

Key Features

  • Extensive temporal coverage from the 1810s to the present
  • Contains over 400 million words across a wide range of genres including fiction, newspapers, magazines, and academic texts
  • Structured into decade-based sub-corpora for year-by-year or period-specific analysis
  • Facilitates research in diachronic linguistics, lexicography, cultural studies, and more
  • Publicly accessible via online interfaces and data repositories for academic use

Pros

  • Provides invaluable insights into historical language usage and evolution
  • Rich, diverse datasets support detailed linguistic research
  • User-friendly interface for accessing and querying data
  • Supports multiple scholarly disciplines beyond linguistics

Cons

  • Limited to written texts, excluding spoken language or conversational speech
  • Requires some familiarity with corpus linguistics to maximize utility
  • Potential gaps in the earliest periods due to available data sources
  • Access may be constrained by institutional subscriptions or availability constraints

External Links

Related Items

Last updated: Thu, May 7, 2026, 02:58:19 AM UTC