Corpus of Historical American English (COHA)

(Entry based on information on the corpus website and on

The corpus is composed of more than 400 million words of text in more than 100,000 individual texts. The corpus is 100 times as large as any other structured corpus of historical English, and it is balanced in each decade  between fiction, popular magazines, newspapers, and academic. As a result, it allows researchers to examine a wide range of changes in English with much more accuracy and detail than with any other available corpus

Project leader: Mark Davies

Language:American English

Size: 400 million words

Period: 1810-2009

Released: 2010

Project home page:

Funding: Funded by the US National Endowment for the Humanities

Reference line and copyright

Davies, Mark. (2010-) The Corpus of Historical American English: 400 million words, 1810-2009.


Available online at



Mark Davies

Associated projects

The Corpus of Contemporary American English (COCA)

Time Corpus