Europarl

Full Name
European Parliament Proceedings Parallel Corpus
Composer
Philipp Koehn (University of Edinburgh)
Language
Danish
Dutch
English
French
German
Italian
Portuguese
Spanish
Swedish
Multilingual type
Sentence aligned parallel
Register
Written
Genre
Legislative
Style
Formal
Period
2000-2100 AD
1900-2000 AD
Period (details)
1996-2011
Number of words
10.000.000 - 100.000.000
Number of words (details)
around 60 million words per language
Annotation
Tokenization
Format
Download
Availability
Open access