Val.Es.Co

Full Name
Valencia.EspaƱol.Coloquial
Composer
Grupo Val.Es.Co. Coordination: A. Briz
Language
Spanish
Iberian Spanish
Language (details)
Spanish of Valencia
Register
Spoken
Genre
Speech
Style
Informal
Period
2000-2100 AD
Number of words
< 500.000
Number of words (details)
120,000 words as of 2014
Annotation
Lemmatisation
POS tagging
Prosodic annotation
Annotation remarks

The corpus Val.Es.Co includes free and informal conversations secretly recorded, in addition to other oral genres (telephone recordings, radio, television, etc.). In addition to documenting the spontaneous colloquial speech of Valencia, it aims to facilitate the study of the structure of the conversation and its units. To this end, it applies a uniform system of very detailed transcripts (with indications of overlapping, alternating turns, pauses, intonation, and annotations with extra- and paralinguistic information as laughter, hesitations, coughing, etc.). The current virtual platform, still in beta, gives access to 46 transcribed conversations (over 120,000 words). Other parts of the corpus are in preparation. The website allows conversations read in full, leaving files exported to Word, Excel or XML. You can also search relevant parts of the body through an advanced search engine. Three types of consultation are planned: (a) search for interventions (filtering according to the characteristics of the speaker: gender, age, profession, language); (B)
Search for intonational group; (C) word search. For the latter option there is lemmatization and POS tagging program by Freeling, although labeling is still under review. To access the audio you should contact the research group.

Format
Download
Data collection
Spontaneous
Availability
Open access