Corpus Linguistics
Tutorials
Corpora
English Corpora
German Corpora
More Languages
Spoken Corpora
Learner Corpora
ICE
Corpora
Parallel Corpora
Historical Corpora
Treebanks
Text
Archives
Alphabetical List
Software
CL in Applied Linguistics

You are now in section > Corpora > Historical Corpora

The Brooklyn-Geneva-Amsterdam-Helsinki Parsed Corpus of Old English

Org:  Joint project of five linguists
Time: /

Size:

106,210 words
Contents: selection of texts from the Old English Section of the Helsinki Corpus of English Texts

Access:

free access for research purposes possible; fill out request form
Notes: annotated to facilitate searches on lexical items and syntactic structure

CME - Corpus of Middle English Prose and Verse

Org:  HTI - University of Michigan, U.S.
Time: Middle English (16th/17th century)

Size:

61 Texts
Contents: Collection of Middle English texts provided by the University of Michigan and the Oxford Text Archive

Access:

Free; Search possible in individual or groups of books; Conduct simple/boolean/proximity searches
Notes: SGML Markup according to the TEI guidelines

 

Lampeter Corpus of Early Modern English Tracts

Org:  REAL Centre at the University of Chemnitz, Germany
Time: 1640 to 1740

Size:

1.1 mio words; 120 texts
Contents: "The Lampeter Corpus of Early Modern English Tracts is a collection of texts on various subject matter published between 1640 and 1740 - a time that is marked by the rise of mass publication, the development of a public discourse in many areas of everyday life and, last but not least, the standardisation of British English."

Access:

Available for scholarly research free of charge
Notes: SGML Markup according to the TEI guidelines

  

MEMEM - Michigan Early Modern English Materials

Org:  HTI - University of Michigan, U.S.
Time: 1970s

Size:

50,000 records
Contents: "The Materials consist of citations collected for the modal verbs and certain other English words for the Early Modern English Dictionary. "

Access:

Free: download via FTP
Notes: A DTD and a character DTD is also available for download 

 

PPCME1 Penn-Helsinki Parsed Corpus of Middle English

Org:  University of Pennsylvania, U.S.
Time: Middle English

Size:

510,000 mio words 
Contents: "Syntactically annotated corpus of the Middle English prose samples in the Helsinki Corpus of Historical English"

Access:

Free: for download click here 
Notes:

PPCME2 Penn-Helsinki Parsed Corpus of Middle English

Org:  University of Pennsylvania, U.S.
Time: Middle English

Size:

1.3 mio words (55 text samples each as a text file, as a POS tagged file and a parsed file)  
Contents: collection of prose texts in Middle English

Access:

Registered Users only; Corpus and CorpusSearch (tool) available on CD-ROM
Notes:

 

Helsinki Corpus (Diachronic Part)

Org:  University of Helsinki, Finland
Time: 750 to ca. 1700

Size:

ca. 1.5 mio words 
Contents: texts from Old, Middle and Early Modern English

Access:

At the moment the different versions of the Helsinki Corpus are distributed by the HIT Centre, the OTA and on the ICAME CD-ROM 
Notes:

 

You are now in section  > Corpora > Historical Corpora

Data-driven learning
Virtual Resources
Bibliography
Email
About

webmaster@corpus-linguistics.de