Events

Upcoming events

Event Information:

  • Thu
    13
    Jun
    2024

    Giuseppe Magistro (UGent) - "Creating a corpus of web-data with Pyrlato. A demonstration"

    2:00 pmLokaal 3.30 - Camelot, Blandijn, Campus Boekentoren

    The use of corpora in acoustic analyses has become a standard practice in phonetic phonological research, offering high ecological validity (see e.g. Beckman, 1997; Warner, 2012; Tucker & Mukai, 2023 for a discussion on validity). However, compiling corpora and looking for specific phenomena can be time and resource-consuming. In response to this challenge, we developed a program named Pyrlato, which we aim to demonstrate. Pyrlato is a novel tool designed for creating corpora of real-world spoken data from the web. The tool extracts audio files from YouTube, cutting and extracting desired segments such as specific phonemes, syllables, or words found in YouTube videos. This enables the creation of corpora with tens of thousands of tokens within a few computational hours. Pyrlato works across Dutch, English, French, German, Indonesian, Italian, Japanese, Korean, Portuguese, Russian, Spanish, Turkish, Ukrainian, and Vietnamese, i.e. those languages for which YouTube provides automatic subtitles. The software searches for the desired string in the subtitles and, upon finding the match, extracts the relevant audio extract containing the string in .mp3 format (other formats are also possible).

    The demonstration will showcase Pyrlato's online version and the application of some case studies.

    • Beckman, M.E. (1997).A typology of spontaneous speech. In Y. Sagisaka, N. Campbell, & N. Higuchi (Eds.), Computing Prosody: Computational Models for Processing Spontaneous Speech (pp. 7–26). Springer. http://dx.doi.org/10.1007/978-1-4612-2258-3_2.
    • Tucker, B.V., & Mukai, Y. (2023). Spontaneous speech. Cambridge University Press. http://doi.org/10.1017/9781108943024.
    • Warner, N. (2012). Methods for studying spontaneous speech. In A. Cohn, C. Fougeron, & M. Huffman (Eds.), The Oxford Handbook of Laboratory Phonology (pp. 621–633). Oxford University Press.

     

    Show content

 

Past events

Event Information:

  • Thu
    15
    Jun
    2017

    Multiple subjects across categories: Evidence from Modern Standard Arabic

    1:00 pmGrote vergaderzaal, 3de verdieping Blandijn

    Dialing talk: Fayssal Tayalati en Lieven Danckaert (Université de Lille 3), "Multiple subjects across categories: Evidence from Modern Standard Arabic"

    In this talk we focus on a particular type of tough-construction in Modern Standard Arabic (MSA), illustrated in (1)-(2). In these examples we see a DP (bracketed), followed by an adjective (underscored) and a deverbal noun. Attached to this last element is a resumptive pronoun (glossed as ‘RP’) which corefers with the bracketed DP.
    (1) [hāḏā l-kitābu]i mumtiʿun qirāʾatu-hui
    this the-book.ᴍ.sɢ.ɴᴏᴍ pleasant.ᴍ.sɢ.ɴᴏᴍ read.f.sɢ.ɴᴏᴍ-ʀᴘ.ᴍ.sɢ
    ‘This book is pleasant to read.’ (= predicative reading, root clause)
    (2) [[al-ʿimāratu]i [ṣ-ṣaʿbu bayʿu-hāi]] bīʿat ʾaẖīran
    the-building.f.sɢ.ɴᴏᴍ the-difficult.ᴍ.sɢ.ɴᴏᴍ sell.ᴍ.sɢ.ɴᴏᴍ-ʀᴘ.f.sɢ was.sold finally
    ‘[The building which is difficult to sell] was finally sold.’ (= attributive reading, DP)
    A key property of these structures concerns the agreement morphology on the adjective. In MSA, attributive and predicative adjectives canonically agree with their head noun or subject. However, as can be deduced from (2), the adjective in tough-constructions does not agree in number, gender or case with the lefthand DP, but rather it appears with ‘default’ nominative masculine singular morphology.
    We will propose a unified analysis of (1)-(2), which crucially incorporates the idea that the relevant structures contain two distinct subject positions. First, to account for the observed lack of agreement between the DP and the adjective, we adopt Mohammad’s (1990, 2000) suggestion that the ‘default’ agreement which can be observed in a number of (impersonal) constructions in MSA is due to the presence of a (phonologically null) expletive subject. We take this expletive pronoun to be located in SpecTP, where it agrees with the adjectival predicate. Next, having discarded the hypothesis that the initial DP sits in an A-bar position, we propose that it occurs in Cardinaletti’s (2004) SubjP, i.e. a high subject position which is associated with an ‘aboutness’ reading. Interestingly, there is independent evidence that in MSA (as well as in many other languages) two clause-mate subject positions can be filled simultaneously by non-coreferring XPs: this is the case in the ‘broad subject’ pattern discussed in e.g. Alexopoulou et al. (2004).

    Show content