Upcoming events
Event Information:
-
Thu13Jun20242:00 pmLokaal 3.30 - Camelot, Blandijn, Campus Boekentoren
Giuseppe Magistro (UGent) - "Creating a corpus of web-data with Pyrlato. A demonstration"
Show contentThe use of corpora in acoustic analyses has become a standard practice in phonetic phonological research, offering high ecological validity (see e.g. Beckman, 1997; Warner, 2012; Tucker & Mukai, 2023 for a discussion on validity). However, compiling corpora and looking for specific phenomena can be time and resource-consuming. In response to this challenge, we developed a program named Pyrlato, which we aim to demonstrate. Pyrlato is a novel tool designed for creating corpora of real-world spoken data from the web. The tool extracts audio files from YouTube, cutting and extracting desired segments such as specific phonemes, syllables, or words found in YouTube videos. This enables the creation of corpora with tens of thousands of tokens within a few computational hours. Pyrlato works across Dutch, English, French, German, Indonesian, Italian, Japanese, Korean, Portuguese, Russian, Spanish, Turkish, Ukrainian, and Vietnamese, i.e. those languages for which YouTube provides automatic subtitles. The software searches for the desired string in the subtitles and, upon finding the match, extracts the relevant audio extract containing the string in .mp3 format (other formats are also possible).
The demonstration will showcase Pyrlato's online version and the application of some case studies.
• Beckman, M.E. (1997).A typology of spontaneous speech. In Y. Sagisaka, N. Campbell, & N. Higuchi (Eds.), Computing Prosody: Computational Models for Processing Spontaneous Speech (pp. 7–26). Springer. http://dx.doi.org/10.1007/978-1-4612-2258-3_2.
• Tucker, B.V., & Mukai, Y. (2023). Spontaneous speech. Cambridge University Press. http://doi.org/10.1017/9781108943024.
• Warner, N. (2012). Methods for studying spontaneous speech. In A. Cohn, C. Fougeron, & M. Huffman (Eds.), The Oxford Handbook of Laboratory Phonology (pp. 621–633). Oxford University Press.
Past events
Event Information:
-
Tue09Dec202511:00 amBlandijn, Faculteitszaal
Gina Saviano (Ghent University) – "Mapping the Regional Linguistic Space between Italian and Dialect: A Computational Analysis of Phonetic Features in Neapolitan"
Show contentAbstract
“Continuum con addensamenti”, ‘continuum with agglomerations’: this is how Berruto once described the Italian linguistic repertoire. Much has been discussed about the extremes of this continuum; however, little do we know about those intermediate “agglomerations”. How many are there? How are they organized? Do they share features? To address these questions, we adopt a phonetic perspective and borrow a technique typically used in commercial profiling. Examining established Neapolitan phonetic features alongside new prosodic parameters, we identify possible speaker profiles and features agglomerations, shedding a new light on the intermediate varieties of the Italian standard-dialect continuum. In this talk, I will discuss preliminary studies and findings, applying this innovative methodology to offer new insights into the nature of the concept of variety.