Events – ΔiaLing

Upcoming events

Event Information:

Thu
13
Jun
2024

Giuseppe Magistro (UGent) - "Creating a corpus of web-data with Pyrlato. A demonstration"
2:00 pmLokaal 3.30 - Camelot, Blandijn, Campus Boekentoren
The use of corpora in acoustic analyses has become a standard practice in phonetic phonological research, offering high ecological validity (see e.g. Beckman, 1997; Warner, 2012; Tucker & Mukai, 2023 for a discussion on validity). However, compiling corpora and looking for specific phenomena can be time and resource-consuming. In response to this challenge, we developed a program named Pyrlato, which we aim to demonstrate. Pyrlato is a novel tool designed for creating corpora of real-world spoken data from the web. The tool extracts audio files from YouTube, cutting and extracting desired segments such as specific phonemes, syllables, or words found in YouTube videos. This enables the creation of corpora with tens of thousands of tokens within a few computational hours. Pyrlato works across Dutch, English, French, German, Indonesian, Italian, Japanese, Korean, Portuguese, Russian, Spanish, Turkish, Ukrainian, and Vietnamese, i.e. those languages for which YouTube provides automatic subtitles. The software searches for the desired string in the subtitles and, upon finding the match, extracts the relevant audio extract containing the string in .mp3 format (other formats are also possible).

The demonstration will showcase Pyrlato's online version and the application of some case studies.

• Beckman, M.E. (1997).A typology of spontaneous speech. In Y. Sagisaka, N. Campbell, & N. Higuchi (Eds.), Computing Prosody: Computational Models for Processing Spontaneous Speech (pp. 7–26). Springer. http://dx.doi.org/10.1007/978-1-4612-2258-3_2.
• Tucker, B.V., & Mukai, Y. (2023). Spontaneous speech. Cambridge University Press. http://doi.org/10.1017/9781108943024.
• Warner, N. (2012). Methods for studying spontaneous speech. In A. Cohn, C. Fougeron, & M. Huffman (Eds.), The Oxford Handbook of Laboratory Phonology (pp. 621–633). Oxford University Press.

Show content

Past events

Event Information:

Thu
15
Jun
2017

Multiple subjects across categories: Evidence from Modern Standard Arabic
1:00 pmGrote vergaderzaal, 3de verdieping Blandijn
Dialing talk: Fayssal Tayalati en Lieven Danckaert (Université de Lille 3), "Multiple subjects across categories: Evidence from Modern Standard Arabic"

In this talk we focus on a particular type of tough-construction in Modern Standard Arabic (MSA), illustrated in (1)-(2). In these examples we see a DP (bracketed), followed by an adjective (underscored) and a deverbal noun. Attached to this last element is a resumptive pronoun (glossed as ‘RP’) which corefers with the bracketed DP.
(1) [hāḏā l-kitābu]i mumtiʿun qirāʾatu-hui
this the-book.ᴍ.sɢ.ɴᴏᴍ pleasant.ᴍ.sɢ.ɴᴏᴍ read.f.sɢ.ɴᴏᴍ-ʀᴘ.ᴍ.sɢ
‘This book is pleasant to read.’ (= predicative reading, root clause)
(2) [[al-ʿimāratu]i [ṣ-ṣaʿbu bayʿu-hāi]] bīʿat ʾaẖīran
the-building.f.sɢ.ɴᴏᴍ the-difficult.ᴍ.sɢ.ɴᴏᴍ sell.ᴍ.sɢ.ɴᴏᴍ-ʀᴘ.f.sɢ was.sold finally
‘[The building which is difficult to sell] was finally sold.’ (= attributive reading, DP)
A key property of these structures concerns the agreement morphology on the adjective. In MSA, attributive and predicative adjectives canonically agree with their head noun or subject. However, as can be deduced from (2), the adjective in tough-constructions does not agree in number, gender or case with the lefthand DP, but rather it appears with ‘default’ nominative masculine singular morphology.
We will propose a unified analysis of (1)-(2), which crucially incorporates the idea that the relevant structures contain two distinct subject positions. First, to account for the observed lack of agreement between the DP and the adjective, we adopt Mohammad’s (1990, 2000) suggestion that the ‘default’ agreement which can be observed in a number of (impersonal) constructions in MSA is due to the presence of a (phonologically null) expletive subject. We take this expletive pronoun to be located in SpecTP, where it agrees with the adjectival predicate. Next, having discarded the hypothesis that the initial DP sits in an A-bar position, we propose that it occurs in Cardinaletti’s (2004) SubjP, i.e. a high subject position which is associated with an ‘aboutness’ reading. Interestingly, there is independent evidence that in MSA (as well as in many other languages) two clause-mate subject positions can be filled simultaneously by non-coreferring XPs: this is the case in the ‘broad subject’ pattern discussed in e.g. Alexopoulou et al. (2004).

Show content

Upcoming events

Event Information:

Giuseppe Magistro (UGent) - "Creating a corpus of web-data with Pyrlato. A demonstration"

Past events

Event Information:

Multiple subjects across categories: Evidence from Modern Standard Arabic