PRESEEA

Full Name
Proyecto para el Estudio Sociolingüístico del Español de España y de América [PRESEEA]
Composer
Coordination: F. Moreno Fernández (Universidad de Alcalá)
Language
Spanish
Iberian Spanish
Latin American Spanish
Register
Spoken
Genre
Conversation
Style
Formal and Informal
Period
2000-2100 AD
Number of words
2.000.000 - 10.000.000
Number of words (details)
Goal of 10 million words
Annotation remarks

The corpus includes semi-structured conversations based on thematic modules (the family, the economy, etc.). The transcriptions are orthographic realized in SGML4 system and follow the standards TEI and include oral features (repetitions, silences, hesitations, shift changes, etc.) and extra-linguistic features (laughter, noise, etc.).

Format
Online
Data collection
Semi-elicited
Multimedia
Transcription only
Availability
Open access
In preparation