Opcije pristupačnosti Pristupačnost

Publication of the C-ORAL-IC corpus...

The Centre for Research on the Linguistic and Cultural Heritage of Istria is pleased to inform the scientific community and the wider public that the C-ORAL-IC corpus (Corpus of Oral Istrovenetian and Croatian) has been successfully completed and published in the international research repository TalkBank, one of the world’s leading infrastructures for the study of spoken language, language development and discourse. The corpus was developed within the framework of the Croatian Science Foundation (HRZZ) project Multidisciplinary Approaches to Linguistic and Cultural Heritage (MULTIDIS).

C-ORAL-IC constitutes the first systematically designed, computationally processed and publicly accessible corpus of spontaneous speech documenting the complex sociolinguistic landscape of the Istrian region, with particular emphasis on contact between the Istrovenetian dialect and the Croatian language. The corpus is the outcome of several years of fieldwork, transcription and analytical work carried out by members and collaborators of the Centre, and is grounded in contemporary methodological principles of corpus linguistics, sociolinguistics and contact linguistics, in line with the research objectives of the MULTIDIS project.

The distinctive value of C-ORAL-IC lies in its foundation on authentic spoken language data, collected in natural communicative settings and encompassing a wide range of discourse genres, age groups and communicative contexts. Transcription was carried out in accordance with the standardized CHAT (Codes for the Human Analysis of Transcripts) system, while data analysis is conducted using the CLAN (Computerized Language Analysis) software package. This methodological framework ensures interoperability with other international corpora and enables advanced quantitative and qualitative analyses. The availability of the corpus opens new avenues for research into (Croatian–Italian) bilingual discourse, code-switching, and (socio)linguistic and pragmalinguistic variation.

The C-ORAL-IC corpus is available in open access on the TalkBank platform:
https://biling.talkbank.org/access/C-ORAL-IC.html

Poropat Jeletić, N., Moscarda Mirković, E.; Hržica, G. – BilingBank C-ORAL-IC Corpus, 2024, doi:10.21415/PAZ2-EP87

Project financed by the Croatian Science Foundation (HRZZ)
Project title: Multidisciplinary Approaches to Linguistic and Cultural Heritage (MULTIDIS) / A multi-level approach to spoken discourse in language development
http://multidis.erf.hr
 

Principal Investigator: Assist. Prof. Gordana Hržica, PhD
Project start date: 01/01/2018
Project end date: 31/12/2022
Project number: UIP-2017-05-6603
Host institution: Faculty of Education and Rehabilitation Sciences, University of Zagreb
Scientific area: Social Sciences
Scientific field: Speech and Language Pathology

News list