Corpus Linguistics
Tutorials
Corpora
English Corpora
German Corpora
More Languages
Spoken Corpora
Learner Corpora
ICE
Corpora
Parallel Corpora
Historical Corpora
Treebanks
Text
Archives
Alphabetical List
Software
CL in Applied Linguistics

You are now in section > Corpora> German Corpora

COSMAS 1 Corpus Storage, Maintenance and Access System

Org:  Hosted at the IDS, Mannheim, Germany
Time: COSMAS is available since 1992 and is under constant development

Size:

ca. 1080 mio words; 653 mio publicly accessible
Contents: COSMAS hosts a great variety of corpora, click here for a list.

Access:

 via public www access
Notes: The public access to COSMAS provides great features and is very flexible. Definitely worth checking out

COSMAS II has been active since March 2000. Check it out

LIMAS - Linguistik und Maschinelle Sprachbearbeitung

Org:  Forschungsgruppe LIMAS (Bonn, Regensburg)
Time: 1970 and 1971

Size:

>1 mio
Contents: various texts from 33 different areas

Access:

free access online
Notes:  

Negr@ A Syntactically Annotated Corpus of German Newspaper Texts

Org:  Saarland University, Germany
Time: 1990s

Size:

176,000 tokens (10,000 sentences) 
Contents: German newspaper text,

Access:

free access for scientific use
Notes: POS tagged and syntactically annotated;

 

You are now in section > Corpora> German Corpora

Data-driven learning
Virtual Resources
Bibliography
Email
About

webmaster@corpus-linguistics.de