Corpus ÉMA, écrits scolaires

Full Name
Corpus ÉMA, écrits scolaires
Composer
Catherine Boré - Marie-Noëlle Roubaud - Marie-Laure Elalouf
Language
French
Register
Written
Period
2000-2100 AD
Period (details)
2015 - ...
Number of words
< 500.000
Annotation
Other
Annotation remarks

The transcription of the written texts are completed by the correction of the spelling mistakes made by these French pupils when they had to write texts and answer questions. 

Format
Online
Data collection
Elicited
Semi-elicited
Spontaneous
Availability
Open access
Remarks

Corpus ÉMA, écrits scolaires is the first compilation of a large written schoolcorpus in purpose to  evaluate the knowledge of the  written language by pupils of the primary school and first grade of the secondary school. The collection starts in 2015 because of the renewal of the educational programs. Corpus ÉMA, écrits scolaires contains two files, one is devoted to the writing of argumentative texts in a grade class (third and fourth year of the primary school). Four types of texts were used. The second file is devoted to the writing of narrative texts in combination with the lecture of an album in a grade class (first and second year of primary school). Two types of texts were used. 

Every file contains - for every pupil of each class:

  • scans of the writings of each pupil (anonymous) 
  • their transcription - raw data
  • annotation of texts - raw data
  • metadata in PDF (in order to restore all the information about the school, teachers, class, remarks, ...)