Wikidata:Property proposal/vazn
rhythmic weight
[edit]Originally proposed at Wikidata:Property proposal/Lexemes
Description | syllabification annotation for Urdu-based orthographies |
---|---|
Represents | vazn (Q115484594) |
Data type | String |
Domain | form |
Allowed values | (۰|۱|۲|\s)+ |
Example 1 | ਵੱਜਿਆ/وجّیا (L741125-F2) ISO 15919: (vaj.jiā); IPA: /ʋəd͡ʒˈjɑ/; Vazn: ۲۱ (21) |
Example 2 | ਆਇ/آۓ (L677817-F6) ISO 15919: (ā.i); IPA: /ɑ.i/; Vazn: ۰۰ (00) |
Example 3 | ISO 15919: (maṁg.vau.ṇa.gī.ā̃); IPA: /mɐŋɡˈʋɔ.ɳᵊˈɡɪ.ɑ̃/; Vazn: ۲۱۱۱۱ (21111) |
Example 4 | ݙاڈھے (L740480-F3) ISO 15919: (ḏ̣ā.ḍhe); IPA: /ᶑɑ.ɖʰɛ/; Vazn: ۱۱ (11) |
Source | an explanation may be observed by clicking on the “i” next to Vazn on Rekhta Dictionary entries |
Planned use | add to lexeme forms in Punjabi, Hindustani, and other languages used in Pakistan written with an Urdu-based orthography |
Motivation
[edit]We have the existing hyphenation property for indicating the syllabification of lexeme forms, but this is poorly suited to languages which are typically written with vowels omitted and/or a cursive script which is not legible when hyphenated. The term "vazn" is used in the English version of the Rekhta online Hindustani dictionary, hence the name used here, but this property could also be labeled as "rhythmic weight" in English if that is preferable. Either way, I do intend the scope for this property to be for languages used in Pakistan commonly written with an Urdu-based orthography - without context for the ways in which syllable timing occurs in other languages which use cursive scripts, or for the syllabification conventions used elsewhere, it is hard to say how applicable this might be. If anyone does think this could be applied more broadly however, I am interested to hear.
To explain how it works, the idea is that each syllable is given a number 1 (۱) if it contains one consonant or cluster, or 2 (۲) if it contains two. Typically, short vowels between consonants are not indicated, so we can use these numbers to infer where syllable boundaries occur. The rules have to be bent a little bit for Punjabi as vowel-only syllables are very common compared to Urdu - 0 (۰) will represent no consonant syllables, and stressed semi-vowels will be treated as consonants. A transliterated example for the benefit of those not able to read those given in the proposal, taking a Punjabi word and pronunciation rules. Singular oblique form لفظ /ləf.ɐz/ : 21. Plural oblique form لفظاں /ləf.zɑ̃/ : 22. For the purposes of this format, the letter ں nun gunna is treated as a consonant. Its actual sound is conditioned by context and can be a nasal consonant or nasalised vowel, the latter of which still makes sense to treat as consonant-like for syllabification. عُثمان (talk) 17:26, 27 November 2022 (UTC)
Discussion
[edit]Comment I don't know anything about this topic, but I think "rhythmic weight" or some other Enlish phrase would be a better label. "Vazn" can be used as an alias. — The Erinaceous One 🦔 07:41, 30 November 2022 (UTC)
- @The-erinaceous-one Fair enough; I have updated the title. I was thinking about it and there are ambiguities with وزن and other senses of the term; I will probably make this more specific in Punjabi as well. عُثمان (talk) 00:19, 17 December 2022 (UTC)
- Comment Withdrawn in part due to lack of interest, and because I think when I proposed this my understanding of syllabification in these languages was oversimplified. -عُثمان (talk) 17:18, 6 June 2023 (UTC)