Wiktionary:Todo/Language code errors

From Wiktionary, the free dictionary
Jump to navigation Jump to search

Introduction

[edit]

Wiktionary has its own set of language codes based on those of the ISO, and its own set of language names that go with them. Problems arise when these codes don't match the ISO ones. This is mostly due to differences in scope: Wiktionary disagrees with the ISO as to which lects are independent languages and which are varieties of other lects. There are also differences as to which name or variation of a name should be used as the main name.

On top of that, there are languages associated with a specific country where the international code for the country is different from the language code of the language spoken there.

Add to that the large number of languages and language codes as well as the superficial similarity between different language names and between different language codes, and you have confused editors who make a mess of things.

Another common source of errors is editors who are ignorant of what the codes are there for and others who are careless or absent-minded about which codes they use in which language. They copy templates from other entries and reuse them without updating the language codes.

Common errors

[edit]
Code used Correct header
for code used
Header used Correct code
for header used
Notes
br Breton Belarusian be Typo
bs none- module error Bosnian
(Serbo-Croatian)
sh
ch Chamorro Chinese zh
cnr none- module error Montenegrin
(Serbo-Croatian)
sh
cu Old Church Slavonic Welsh cy Typo
hr none- module error Croatian
Serbo-Croatian
sh
jp none- module error Japanese ja
kr Kanuri Korean ko
li Limburgish Lithuanian lt
ne Nepali Dutch nl
nn Norwegian Nynorsk Old Norse non Typo
pa Punjabi Pashto ps
sc Sardinian Serbo-Croatian sh
sc Sardinian Sicilian scn
sr none- module error Serbian
(Serbo-Croatian}
sh
sw Swahili Swedish sv
se Northern Sami Swedish sv
syc Syriac Assyrian Neo-Aramaic aii
as Assamese Bengali bn
bn Bengali Assamese as
id Indonesian Malay ms
ms Malay Indonesian id
el Greek Ancient Greek grc
ur Urdu Punjabi pa

Groups

[edit]

These are sets of languages where the same editors work on many of them, and don't pay much attention to which codes they're using for which languages

Philippines

[edit]

(not all of them, by any means)

  1. bcl, Bikol Central
  2. ceb, Cebuano
  3. hil, Hiligaynon
  4. ilo, Ilocano
  5. kne, Kankanaey
  6. mdh, Maguindanao
  7. pag, Pangasinan
  8. pam, Kapampangan
  9. sgd, Surigaonan
  10. tl, Tagalog
  11. tsg, Tausug
  12. war, Waray-Waray

Malay-Indonesian

[edit]
  1. id, Indonesian
  2. kaw, Old Javanese
  3. jv, Javanese
  4. ms, Malay

Indic

[edit]

hi (Hindi) or gu (Gujarati) codes tend to be used for a wide variety of these languages

Bengali, etc.

[edit]
  1. as, Assamese
  2. bn, Bengali
  3. syl, Sylheti

Dravidian

[edit]
  1. kn, Kannada
  2. ml, Malayalam
  3. ta, Tamil
  4. te, Telugu

Eastern Romance

[edit]
  1. ro, Romanian
  2. ruo, Istro-Romanian
  3. rup, Aromanian
  4. ruq, Megleno-Romanian

Goidelic

[edit]
  1. ga, Irish
  2. gd, Scottish Gaelic
  3. gv, Manx
  4. sga, Old Irish

High German

[edit]
  1. bar, Bavarian
  2. de, German
  3. gmw-cfr, Central Franconian
  4. gmw-ecg, East Central German
  5. hrx, Hunsrik
  6. lb, Luxembourgish
  7. wym, Vilamovian

Overlap with templates

[edit]
code Template Language
af {{affix}} [[Category:Afrikaans language]]
alt {{alter}} [[Category:Southern Altai language]]
ant {{antonyms}} [[Category:Antakarinya language]]
as {{antsense}} [[Category:Assamese language]]
bor {{borrowed}} [[Category:Borôro language]]
cal {{calque}} [[Category:Carolinian language]]
co {{collocation}} [[Category:Corsican language]]
cog {{cognate}} [[Category:Chong language]]
com {{compound}} [[Category:Comanche language]]
con {{confix}} [[Category:Cofán language]]
cot {{coordinate terms}} [[Category:Caquinte language]]
dbt {{doublet}} [[Category:Ben Tey language]]
der {{derived}} [[Category:Deori language]]
dim {{diminutive}} [[Category:Dime language]]
gl {{gloss}} [[Category:Galician language]]
hmp {{homophone}} [[Category:Northern Mashan Hmong language]]
inh {{inherited}} [[Category:Ingush language]]
lb {{label}} [[Category:Luxembourgish language]]
lbl {{label}} [[Category:Libon Bikol language]]
lg {{glossary}} [[Category:Luganda language]]
ngd {{non-gloss definition}} [[Category:Ngando (Central African Republic) language]]
nl {{nonlemma}} [[Category:Dutch language]]
nom {{nominalization }} [[Category:Nocamán language]]
or {{original research}} [[Category:Odia language]]
par {{parasynonyms}} [[Category:Panamint language]]
pl {{pseudo-loan}} [[Category:Polish language]]
pre {{prefix}} [[Category:Principense language]]
psm {{phono-semantic matching}} [[Category:Pauserna language]]
see {{see}} [[Category:Seneca language]]
sid {{senseid}} [[Category:Sidamo language]]
sl {{semantic loan}} [[Category:Slovene language]]
smc {{smallcaps}} [[Category:Som language]]
ss {{semantic shift}} [[Category:Swazi language]]
swp {{slim-wikipedia}} [[Category:Suau language]]
syn {{synonyms}} [[Category:Senaya language]]
tcl {{transclude}} [[Category:Taman (Myanmar) language]]
tl {{template}} [[Category:Tagalog language]]
tlb {{term label}} [[Category:Tobelo language]]
top {{topics}} [[Category:Papantla Totonac language]]
tr {{tea room}} [[Category:Turkish language]]
ubl {{unbulleted list}} [[Category:Buhi'non Bikol language]]
unk {{unknown}} [[Category:Enawené-Nawé language]]

Relevant maintenance categories

[edit]

Mostly due to incorrect language codes in headword templates, but also misspelled, misformatted or missing language headers. There are a few cases where someone uses a language name not accepted by Wiktionary and puts the language code of a related language in the templates to avoid module errors.

Mostly due to incorrect language codes in templates, but also often due to using a transliteration of the type found in etymological references instead of the term in the correct script.

There are also problems with the modules' ability to deal with entries for diacritics that aren't on letters, and with certain letters in common scripts that are only used in a small number of the languages written in those scripts. These will need to be fixed by updating the modules.

The genuine cases of terms that are attested in scripts otherwise not used for their language should be ignored.

Relevant abuse filters

[edit]

68: Missing headword templates

Other Todo pages

[edit]

Wiktionary:Todo/Lists/Template language code does not match header (sorted by language)