Upcoming events
Event Information:
-
Thu13Jun20242:00 pmLokaal 3.30 - Camelot, Blandijn, Campus Boekentoren
Giuseppe Magistro (UGent) - "Creating a corpus of web-data with Pyrlato. A demonstration"
Show contentThe use of corpora in acoustic analyses has become a standard practice in phonetic phonological research, offering high ecological validity (see e.g. Beckman, 1997; Warner, 2012; Tucker & Mukai, 2023 for a discussion on validity). However, compiling corpora and looking for specific phenomena can be time and resource-consuming. In response to this challenge, we developed a program named Pyrlato, which we aim to demonstrate. Pyrlato is a novel tool designed for creating corpora of real-world spoken data from the web. The tool extracts audio files from YouTube, cutting and extracting desired segments such as specific phonemes, syllables, or words found in YouTube videos. This enables the creation of corpora with tens of thousands of tokens within a few computational hours. Pyrlato works across Dutch, English, French, German, Indonesian, Italian, Japanese, Korean, Portuguese, Russian, Spanish, Turkish, Ukrainian, and Vietnamese, i.e. those languages for which YouTube provides automatic subtitles. The software searches for the desired string in the subtitles and, upon finding the match, extracts the relevant audio extract containing the string in .mp3 format (other formats are also possible).
The demonstration will showcase Pyrlato's online version and the application of some case studies.
• Beckman, M.E. (1997).A typology of spontaneous speech. In Y. Sagisaka, N. Campbell, & N. Higuchi (Eds.), Computing Prosody: Computational Models for Processing Spontaneous Speech (pp. 7–26). Springer. http://dx.doi.org/10.1007/978-1-4612-2258-3_2.
• Tucker, B.V., & Mukai, Y. (2023). Spontaneous speech. Cambridge University Press. http://doi.org/10.1017/9781108943024.
• Warner, N. (2012). Methods for studying spontaneous speech. In A. Cohn, C. Fougeron, & M. Huffman (Eds.), The Oxford Handbook of Laboratory Phonology (pp. 621–633). Oxford University Press.
Past events
Event Information:
-
Fri03Oct20251:30 pmBlandijn Boekentoren -1.91 (floor minus 1)
Lorenzo Maselli (Ghent University) - "Documenting the implosives and labial-velars of the Ubangi River Basin"
Show contentDocumenting the implosives and labial-velars of the Ubangi River BasinThis contribution reports on a recent field mission carried out in the Ubangi River Basin (Central African Republic). Work focused on 31 varieties belonging to the Bantu, Ubangi, and Central Sudanic subfamilies of Niger-Congo and Nilo-Saharan. The primary objective was to document implosive and labial-velar consonants. To this end, acoustic, electroglottographic, and aerodynamic (pneumotachographic) data were collected. In the context of this presentation: I will present a detailed report on activities within the framework of this mission; I will illustrate the quality and typology of the data and exemplify the usefulness of integrated phonetic research with instances from Central Sudanic Bagiro; I will present a few general remarks on the merit of instrumental data collection for phonetic typology and phonological theory. The hope is that this will serve as a handy reference for fellow researchers interested in instrumental work in the field.