A Corpus of English Life Sciences Texts (CELiST)

The Corpus of English Life Sciences Texts (CELiST) is the third sub-corpus of the Coruña Corpus of English Scientific Writing (CC), currently under construction. It intends to complement the other sub-corpora in the series since this, the same as CETA, belongs to the Hard Sciences and can be used to carry out comparative studies when confronting it to CEPhiT (in the process of being published) and others containing texts belonging to the Humanities. Also, as the other corpora in the Coruña Corpus family, it is intended for the description of English scientific writing between 1700 and 1900, focusing on the language of disciplines such as Biology, Zoology, Botany and other fields that did not constitute separate disciplines during the late Modern English period. Each text file is accompanied by a metadata file containing information about the text sampled and its author’s sociolinguistic background. Metadata files are also used to select the texts to work with though the Coruña Corpus Tool (CCT).

Compilers: Members of MUSTE Research Group
Time of compilation: 2009 -
Period: 1700-1900
Size: extracts from 40 works
Language: LModE (scientific)
Project home page: www.udc.es/grupos/muste

Associated projects

It is part of The Coruña Corpus of English Scientific Writing (CC) and its subcorpora.