Events – ΔiaLing

Upcoming events

Event Information:

Thu
13
Jun
2024

Giuseppe Magistro (UGent) - "Creating a corpus of web-data with Pyrlato. A demonstration"
2:00 pmLokaal 3.30 - Camelot, Blandijn, Campus Boekentoren
The use of corpora in acoustic analyses has become a standard practice in phonetic phonological research, offering high ecological validity (see e.g. Beckman, 1997; Warner, 2012; Tucker & Mukai, 2023 for a discussion on validity). However, compiling corpora and looking for specific phenomena can be time and resource-consuming. In response to this challenge, we developed a program named Pyrlato, which we aim to demonstrate. Pyrlato is a novel tool designed for creating corpora of real-world spoken data from the web. The tool extracts audio files from YouTube, cutting and extracting desired segments such as specific phonemes, syllables, or words found in YouTube videos. This enables the creation of corpora with tens of thousands of tokens within a few computational hours. Pyrlato works across Dutch, English, French, German, Indonesian, Italian, Japanese, Korean, Portuguese, Russian, Spanish, Turkish, Ukrainian, and Vietnamese, i.e. those languages for which YouTube provides automatic subtitles. The software searches for the desired string in the subtitles and, upon finding the match, extracts the relevant audio extract containing the string in .mp3 format (other formats are also possible).

The demonstration will showcase Pyrlato's online version and the application of some case studies.

• Beckman, M.E. (1997).A typology of spontaneous speech. In Y. Sagisaka, N. Campbell, & N. Higuchi (Eds.), Computing Prosody: Computational Models for Processing Spontaneous Speech (pp. 7–26). Springer. http://dx.doi.org/10.1007/978-1-4612-2258-3_2.
• Tucker, B.V., & Mukai, Y. (2023). Spontaneous speech. Cambridge University Press. http://doi.org/10.1017/9781108943024.
• Warner, N. (2012). Methods for studying spontaneous speech. In A. Cohn, C. Fougeron, & M. Huffman (Eds.), The Oxford Handbook of Laboratory Phonology (pp. 621–633). Oxford University Press.

Show content

Past events

Event Information:

Mon
05
Sep
2022
Fri
09
Sep
2022

14th International Colloquium on Late and Vulgar Latin (Latin vulgaire – latin tardif XIV)
Gent

Please note: the conference has been postponed again due to the continuing uncertainties related to the covid-19 pandemic and will take place from Monday, September 5th to Friday, September 9th, 2022.

The 14th International Colloquium on Late and Vulgar Latin (Latin vulgaire – latin tardif XIV) will be held at the Faculty of Arts and Philosophy of Ghent University (Belgium) from Monday, September 5th to Friday, September 9th, 2022. It will be organized by the Latin section and the research group DiaLing at the Department of Linguistics, under the auspices of the Comité international pour l'étude du latin vulgaire et tardif (www.unibg.it/lvlt).

The colloquium will be held in English, French, German, Spanish, Italian and Latin. As per tradition, it will be devoted to all linguistic aspects of late, informal, non-standard and colloquial Latin (including the transition from Latin to Romance).

For all further information, please visit the website of the colloquium at https://www.lvlt14.ugent.be. For any additional questions you may have, please contact the organisers at lvlt14@ugent.be.

Show content

Upcoming events

Event Information:

Giuseppe Magistro (UGent) - "Creating a corpus of web-data with Pyrlato. A demonstration"

Past events

Event Information:

14th International Colloquium on Late and Vulgar Latin (Latin vulgaire – latin tardif XIV)