Events

Upcoming events

Event Information:

  • Thu
    13
    Jun
    2024

    Giuseppe Magistro (UGent) - "Creating a corpus of web-data with Pyrlato. A demonstration"

    2:00 pmLokaal 3.30 - Camelot, Blandijn, Campus Boekentoren

    The use of corpora in acoustic analyses has become a standard practice in phonetic phonological research, offering high ecological validity (see e.g. Beckman, 1997; Warner, 2012; Tucker & Mukai, 2023 for a discussion on validity). However, compiling corpora and looking for specific phenomena can be time and resource-consuming. In response to this challenge, we developed a program named Pyrlato, which we aim to demonstrate. Pyrlato is a novel tool designed for creating corpora of real-world spoken data from the web. The tool extracts audio files from YouTube, cutting and extracting desired segments such as specific phonemes, syllables, or words found in YouTube videos. This enables the creation of corpora with tens of thousands of tokens within a few computational hours. Pyrlato works across Dutch, English, French, German, Indonesian, Italian, Japanese, Korean, Portuguese, Russian, Spanish, Turkish, Ukrainian, and Vietnamese, i.e. those languages for which YouTube provides automatic subtitles. The software searches for the desired string in the subtitles and, upon finding the match, extracts the relevant audio extract containing the string in .mp3 format (other formats are also possible).

    The demonstration will showcase Pyrlato's online version and the application of some case studies.

    • Beckman, M.E. (1997).A typology of spontaneous speech. In Y. Sagisaka, N. Campbell, & N. Higuchi (Eds.), Computing Prosody: Computational Models for Processing Spontaneous Speech (pp. 7–26). Springer. http://dx.doi.org/10.1007/978-1-4612-2258-3_2.
    • Tucker, B.V., & Mukai, Y. (2023). Spontaneous speech. Cambridge University Press. http://doi.org/10.1017/9781108943024.
    • Warner, N. (2012). Methods for studying spontaneous speech. In A. Cohn, C. Fougeron, & M. Huffman (Eds.), The Oxford Handbook of Laboratory Phonology (pp. 621–633). Oxford University Press.

     

    Show content

 

Past events

Event Information:

  • Tue
    22
    Oct
    2019
    Fri
    25
    Oct
    2019

    Multilingualism and language varieties in Europe - Autumn School in Medieval Languages and Culture 2019

    8:30 amCamelot room (Blandijnberg 2, 3rd floor)

    The Henri Pirenne Institute for Medieval Studies (HPIMS) is organizing their 2019 Autumn school in Medieval Languages and Culture on the theme of Multilingualism and language varieties in Europe.

    In particular, on Tuesday 22 October, Roger Wright (University of Liverpool) will give a lecture (11.00-12.30) on "The emergence of the vernacular languages in the Middle Ages; Romance in the Iberian Peninsula in the Tenth Century." After lunch, Anna Adamska will give a lecture (15.30-17.00) with the title "A laboratory of multilingual communication? Speaking, writing and reading, in the towns of late medieval East Central Europe."

    For more information, please visit https://www.ugent.be/pirenne/en/news-events/events/sociolinguistics.

    Attendance for individual lectures, as well as the entire Autumn School, is free for UGent members, but registration is required: please inform the organisers of your attendance for specific lectures, or for the Autumn school as a whole, at Martine.DeReu@ugent.be.

    Show content