I18N::LangTags::List(3p) Perl Programmers Reference Guide

I18N::LangTags::List(3p) Perl Programmers Reference Guide #

I18N::LangTags::List(3p) Perl Programmers Reference Guide

NNAAMMEE #

 I18N::LangTags::List -- tags and names for human languages

SSYYNNOOPPSSIISS #

   use I18N::LangTags::List;
   print "Parlez-vous... ", join(', ',
       I18N::LangTags::List::name('elx') || 'unknown_language',
       I18N::LangTags::List::name('ar-Kw') || 'unknown_language',
       I18N::LangTags::List::name('en') || 'unknown_language',
       I18N::LangTags::List::name('en-CA') || 'unknown_language',
     ), "?\n";

 prints:

   Parlez-vous... Elamite, Kuwait Arabic, English, Canadian English?

DDEESSCCRRIIPPTTIIOONN #

 This module provides a function "I18N::LangTags::List::name( _l_a_n_g_t_a_g ) "
 that takes a language tag (see I18N::LangTags) and returns the best
 attempt at an English name for it, or undef if it can't make sense of the
 tag.

 The function I18N::LangTags::List::name(...) is not exported.

 This module also provides a function "I18N::LangTags::List::is_decent(
 _l_a_n_g_t_a_g )" that returns true iff the language tag is syntactically valid
 and is for general use (like "fr" or "fr-ca", below).  That is, it
 returns false for tags that are syntactically invalid and for tags, like
 "aus", that are listed in brackets below.  This function is not exported.

 The map of tags-to-names that it uses is accessible as
 %I18N::LangTags::List::Name, and it's the same as the list that follows
 in this documentation, which should be useful to you even if you don't
 use this module.

AABBOOUUTT LLAANNGGUUAAGGEE TTAAGGSS #

 Internet language tags, as defined in RFC 3066, are a formalism for
 denoting human languages.  The two-letter ISO 639-1 language codes are
 well known (as "en" for English), as are their forms when qualified by a
 country code ("en-US").  Less well-known are the arbitrary-length non-ISO
 codes (like "i-mingo"), and the recently (in 2001) introduced three-
 letter ISO-639-2 codes.

 Remember these important facts:

 •   Language tags are not locale IDs.  A locale ID is written with a "_"
     instead of a "-", (almost?) always matches "m/^\w\w_\w\w\b/", and
     _m_e_a_n_s something different than a language tag.  A language tag
     denotes a language.  A locale ID denotes a language _a_s _u_s_e_d _i_n a
     particular place, in combination with non-linguistic location-
     specific information such as what currency is used there.  Locales
     _a_l_s_o often denote character set information, as in "en_US.ISO8859-1".

 •   Language tags are not for computer languages.

 •   "Dialect" is not a useful term, since there is no objective criterion
     for establishing when two language-forms are dialects of eachother,
     or are separate languages.

 •   Language tags are not case-sensitive.  en-US, en-us, En-Us, etc., are
     all the same tag, and denote the same language.

 •   Not every language tag really refers to a single language.  Some
     language tags refer to conditions: i-default (system-message text in
     English plus maybe other languages), und (undetermined language).
     Others (notably lots of the three-letter codes) are bibliographic
     tags that classify whole groups of languages, as with cus "Cushitic
     (Other)" (i.e., a language that has been classed as Cushtic, but
     which has no more specific code) or the even less linguistically
     coherent sai for "South American Indian (Other)".  Though useful in
     bibliography, SSUUCCHH TTAAGGSS AARREE NNOOTT FFOORR GGEENNEERRAALL UUSSEE.  For further
     guidance, email me.

 •   Language tags are not country codes.  In fact, they are often
     distinct codes, as with language tag ja for Japanese, and ISO 3166
     country code ".jp" for Japan.

LLIISSTT OOFF LLAANNGGUUAAGGEESS #

 The first part of each item is the language tag, between {...}.  It is
 followed by an English name for the language or language-group.  Language
 tags that I judge to be not for general use, are bracketed.

 This list is in alphabetical order by English name of the language.

 {ab} : Abkhazian
     eq Abkhaz

 {ace} : Achinese
 {ach} : Acoli
 {ada} : Adangme
 {ady} : Adyghe
     eq Adygei

 {aa} : Afar
 {afh} : Afrihili
     (Artificial)

 {af} : Afrikaans
 [{afa} : Afro-Asiatic (Other)]
 {ak} : Akan
     (Formerly "aka".)

 {akk} : Akkadian
     (Historical)

 {sq} : Albanian
 {ale} : Aleut
 [{alg} : Algonquian languages]
     NOT Algonquin!

 [{tut} : Altaic (Other)]
 {am} : Amharic
     NOT Aramaic!

 {i-ami} : Ami
     eq Amis.  eq 'Amis.  eq Pangca.

 [{apa} : Apache languages]
 {ar} : Arabic
     Many forms are mutually un-intelligible in spoken media.  Notable
     forms: {ar-ae} UAE Arabic; {ar-bh} Bahrain Arabic; {ar-dz} Algerian
     Arabic; {ar-eg} Egyptian Arabic; {ar-iq} Iraqi Arabic; {ar-jo}
     Jordanian Arabic; {ar-kw} Kuwait Arabic; {ar-lb} Lebanese Arabic;
     {ar-ly} Libyan Arabic; {ar-ma} Moroccan Arabic; {ar-om} Omani Arabic;
     {ar-qa} Qatari Arabic; {ar-sa} Sauda Arabic; {ar-sy} Syrian Arabic;
     {ar-tn} Tunisian Arabic; {ar-ye} Yemen Arabic.

 {arc} : Aramaic
     NOT Amharic!  NOT Samaritan Aramaic!

 {arp} : Arapaho
 {arn} : Araucanian
 {arw} : Arawak
 {hy} : Armenian
 {an} : Aragonese
 [{art} : Artificial (Other)]
 {ast} : Asturian
     eq Bable.

 {as} : Assamese
 [{ath} : Athapascan languages]
     eq Athabaskan.  eq Athapaskan.  eq Athabascan.

 [{aus} : Australian languages]
 [{map} : Austronesian (Other)]
 {av} : Avaric
     (Formerly "ava".)

 {ae} : Avestan
     eq Zend

 {awa} : Awadhi
 {ay} : Aymara
 {az} : Azerbaijani
     eq Azeri

     Notable forms: {az-Arab} Azerbaijani in Arabic script; {az-Cyrl}
     Azerbaijani in Cyrillic script; {az-Latn} Azerbaijani in Latin
     script.

 {ban} : Balinese
 [{bat} : Baltic (Other)]
 {bal} : Baluchi
 {bm} : Bambara
     (Formerly "bam".)

 [{bai} : Bamileke languages]
 {bad} : Banda
 [{bnt} : Bantu (Other)]
 {bas} : Basa
 {ba} : Bashkir
 {eu} : Basque
 {btk} : Batak (Indonesia)
 {bej} : Beja
 {be} : Belarusian
     eq Belarussian.  eq Byelarussian.  eq Belorussian.  eq Byelorussian.
     eq White Russian.  eq White Ruthenian.  NOT Ruthenian!

 {bem} : Bemba
 {bn} : Bengali
     eq Bangla.

 [{ber} : Berber (Other)]
 {bho} : Bhojpuri
 {bh} : Bihari
 {bik} : Bikol
 {bin} : Bini
 {bi} : Bislama
     eq Bichelamar.

 {bs} : Bosnian
 {bra} : Braj
 {br} : Breton
 {bug} : Buginese
 {bg} : Bulgarian
 {i-bnn} : Bunun
 {bua} : Buriat
 {my} : Burmese
 {cad} : Caddo
 {car} : Carib
 {ca} : Catalan
     eq Catalán.  eq Catalonian.

 [{cau} : Caucasian (Other)]
 {ceb} : Cebuano
 [{cel} : Celtic (Other)]
     Notable forms: {cel-gaulish} Gaulish (Historical)

 [{cai} : Central American Indian (Other)]
 {chg} : Chagatai
     (Historical?)

 [{cmc} : Chamic languages]
 {ch} : Chamorro
 {ce} : Chechen
 {chr} : Cherokee
     eq Tsalagi

 {chy} : Cheyenne
 {chb} : Chibcha
     (Historical)  NOT Chibchan (which is a language family).

 {ny} : Chichewa
     eq Nyanja.  eq Chinyanja.

 {zh} : Chinese
     Many forms are mutually un-intelligible in spoken media.  Notable
     forms: {zh-Hans} Chinese, in simplified script; {zh-Hant} Chinese, in
     traditional script; {zh-tw} Taiwan Chinese; {zh-cn} PRC Chinese;
     {zh-sg} Singapore Chinese; {zh-mo} Macau Chinese; {zh-hk} Hong Kong
     Chinese; {zh-guoyu} Mandarin [Putonghua/Guoyu]; {zh-hakka} Hakka
     [formerly "i-hakka"]; {zh-min} Hokkien; {zh-min-nan} Southern
     Hokkien; {zh-wuu} Shanghaiese; {zh-xiang} Hunanese; {zh-gan} Gan;
     {zh-yue} Cantonese.

 {chn} : Chinook Jargon
     eq Chinook Wawa.

 {chp} : Chipewyan
 {cho} : Choctaw
 {cu} : Church Slavic
     eq Old Church Slavonic.

 {chk} : Chuukese
     eq Trukese.  eq Chuuk.  eq Truk.  eq Ruk.

 {cv} : Chuvash
 {cop} : Coptic
 {kw} : Cornish
 {co} : Corsican
     eq Corse.

 {cr} : Cree
     NOT Creek!  (Formerly "cre".)

 {mus} : Creek
     NOT Cree!

 [{cpe} : English-based Creoles and pidgins (Other)]
 [{cpf} : French-based Creoles and pidgins (Other)]
 [{cpp} : Portuguese-based Creoles and pidgins (Other)]
 [{crp} : Creoles and pidgins (Other)]
 {hr} : Croatian
     eq Croat.

 [{cus} : Cushitic (Other)]
 {cs} : Czech
 {dak} : Dakota
     eq Nakota.  eq Latoka.

 {da} : Danish
 {dar} : Dargwa
 {day} : Dayak
 {i-default} : Default (Fallthru) Language
     Defined in RFC 2277, this is for tagging text (which must include
     English text, and might/should include text in other appropriate
     languages) that is emitted in a context where language-negotiation
     wasn't possible -- in SMTP mail failure messages, for example.

 {del} : Delaware
 {din} : Dinka
 {dv} : Divehi
     eq Maldivian.  (Formerly "div".)

 {doi} : Dogri
     NOT Dogrib!

 {dgr} : Dogrib
     NOT Dogri!

 [{dra} : Dravidian (Other)]
 {dua} : Duala
 {nl} : Dutch
     eq Netherlander.  Notable forms: {nl-nl} Netherlands Dutch; {nl-be}
     Belgian Dutch.

 {dum} : Middle Dutch (ca.1050-1350)
     (Historical)

 {dyu} : Dyula
 {dz} : Dzongkha
 {efi} : Efik
 {egy} : Ancient Egyptian
     (Historical)

 {eka} : Ekajuk
 {elx} : Elamite
     (Historical)

 {en} : English
     Notable forms: {en-au} Australian English; {en-bz} Belize English;
     {en-ca} Canadian English; {en-gb} UK English; {en-ie} Irish English;
     {en-jm} Jamaican English; {en-nz} New Zealand English; {en-ph}
     Philippine English; {en-tt} Trinidad English; {en-us} US English;
     {en-za} South African English; {en-zw} Zimbabwe English.

 {enm} : Old English (1100-1500)
     (Historical)

 {ang} : Old English (ca.450-1100)
     eq Anglo-Saxon.  (Historical)

 {i-enochian} : Enochian (Artificial)
 {myv} : Erzya
 {eo} : Esperanto
     (Artificial)

 {et} : Estonian
 {ee} : Ewe
     (Formerly "ewe".)

 {ewo} : Ewondo
 {fan} : Fang
 {fat} : Fanti
 {fo} : Faroese
 {fj} : Fijian
 {fi} : Finnish
 [{fiu} : Finno-Ugrian (Other)]
     eq Finno-Ugric.  NOT Ugaritic!

 {fon} : Fon
 {fr} : French
     Notable forms: {fr-fr} France French; {fr-be} Belgian French; {fr-ca}
     Canadian French; {fr-ch} Swiss French; {fr-lu} Luxembourg French;
     {fr-mc} Monaco French.

 {frm} : Middle French (ca.1400-1600)
     (Historical)

 {fro} : Old French (842-ca.1400)
     (Historical)

 {fy} : Frisian
 {fur} : Friulian
 {ff} : Fulah
     (Formerly "ful".)

 {gaa} : Ga
 {gd} : Scots Gaelic
     NOT Scots!

 {gl} : Gallegan
     eq Galician

 {lg} : Ganda
     (Formerly "lug".)

 {gay} : Gayo
 {gba} : Gbaya
 {gez} : Geez
     eq Ge'ez

 {ka} : Georgian
 {de} : German
     Notable forms: {de-at} Austrian German; {de-be} Belgian German;
     {de-ch} Swiss German; {de-de} Germany German; {de-li} Liechtenstein
     German; {de-lu} Luxembourg German.

 {gmh} : Middle High German (ca.1050-1500)
     (Historical)

 {goh} : Old High German (ca.750-1050)
     (Historical)

 [{gem} : Germanic (Other)]
 {gil} : Gilbertese
 {gon} : Gondi
 {gor} : Gorontalo
 {got} : Gothic
     (Historical)

 {grb} : Grebo
 {grc} : Ancient Greek
     (Historical)  (Until 15th century or so.)

 {el} : Modern Greek
     (Since 15th century or so.)

 {gn} : Guarani
     Guaraní

 {gu} : Gujarati
 {gwi} : Gwich'in
     eq Gwichin

 {hai} : Haida
 {ht} : Haitian
     eq Haitian Creole

 {ha} : Hausa
 {haw} : Hawaiian
     Hawai'ian

 {he} : Hebrew
     (Formerly "iw".)

 {hz} : Herero
 {hil} : Hiligaynon
 {him} : Himachali
 {hi} : Hindi
 {ho} : Hiri Motu
 {hit} : Hittite
     (Historical)

 {hmn} : Hmong
 {hu} : Hungarian
 {hup} : Hupa
 {iba} : Iban
 {is} : Icelandic
 {io} : Ido
     (Artificial)

 {ig} : Igbo
     (Formerly "ibo".)

 {ijo} : Ijo
 {ilo} : Iloko
 [{inc} : Indic (Other)]
 [{ine} : Indo-European (Other)]
 {id} : Indonesian
     (Formerly "in".)

 {inh} : Ingush
 {ia} : Interlingua (International Auxiliary Language Association)
     (Artificial)  NOT Interlingue!

 {ie} : Interlingue
     (Artificial)  NOT Interlingua!

 {iu} : Inuktitut
     A subform of "Eskimo".

 {ik} : Inupiaq
     A subform of "Eskimo".

 [{ira} : Iranian (Other)]
 {ga} : Irish
 {mga} : Middle Irish (900-1200)
     (Historical)

 {sga} : Old Irish (to 900)
     (Historical)

 [{iro} : Iroquoian languages]
 {it} : Italian
     Notable forms: {it-it} Italy Italian; {it-ch} Swiss Italian.

 {ja} : Japanese
     (NOT "jp"!)

 {jv} : Javanese
     (Formerly "jw" because of a typo.)

 {jrb} : Judeo-Arabic
 {jpr} : Judeo-Persian
 {kbd} : Kabardian
 {kab} : Kabyle
 {kac} : Kachin
 {kl} : Kalaallisut
     eq Greenlandic "Eskimo"

 {xal} : Kalmyk
 {kam} : Kamba
 {kn} : Kannada
     eq Kanarese.  NOT Canadian!

 {kr} : Kanuri
     (Formerly "kau".)

 {krc} : Karachay-Balkar
 {kaa} : Kara-Kalpak
 {kar} : Karen
 {ks} : Kashmiri
 {csb} : Kashubian
     eq Kashub

 {kaw} : Kawi
 {kk} : Kazakh
 {kha} : Khasi
 {km} : Khmer
     eq Cambodian.  eq Kampuchean.

 [{khi} : Khoisan (Other)]
 {kho} : Khotanese
 {ki} : Kikuyu
     eq Gikuyu.

 {kmb} : Kimbundu
 {rw} : Kinyarwanda
 {ky} : Kirghiz
 {i-klingon} : Klingon
 {kv} : Komi
 {kg} : Kongo
     (Formerly "kon".)

 {kok} : Konkani
 {ko} : Korean
 {kos} : Kosraean
 {kpe} : Kpelle
 {kro} : Kru
 {kj} : Kuanyama
 {kum} : Kumyk
 {ku} : Kurdish
 {kru} : Kurukh
 {kut} : Kutenai
 {lad} : Ladino
     eq Judeo-Spanish.  NOT Ladin (a minority language in Italy).

 {lah} : Lahnda
     NOT Lamba!

 {lam} : Lamba
     NOT Lahnda!

 {lo} : Lao
     eq Laotian.

 {la} : Latin
     (Historical)  NOT Ladin!  NOT Ladino!

 {lv} : Latvian
     eq Lettish.

 {lb} : Letzeburgesch
     eq Luxemburgian, eq Luxemburger.  (Formerly "i-lux".)

 {lez} : Lezghian
 {li} : Limburgish
     eq Limburger, eq Limburgan.  NOT Letzeburgesch!

 {ln} : Lingala
 {lt} : Lithuanian
 {nds} : Low German
     eq Low Saxon.  eq Low German.  eq Low Saxon.

 {art-lojban} : Lojban (Artificial)
 {loz} : Lozi
 {lu} : Luba-Katanga
     (Formerly "lub".)

 {lua} : Luba-Lulua
 {lui} : Luiseno
     eq Luiseño.

 {lun} : Lunda
 {luo} : Luo (Kenya and Tanzania)
 {lus} : Lushai
 {mk} : Macedonian
     eq the modern Slavic language spoken in what was Yugoslavia.  NOT the
     form of Greek spoken in Greek Macedonia!

 {mad} : Madurese
 {mag} : Magahi
 {mai} : Maithili
 {mak} : Makasar
 {mg} : Malagasy
 {ms} : Malay
     NOT Malayalam!

 {ml} : Malayalam
     NOT Malay!

 {mt} : Maltese
 {mnc} : Manchu
 {mdr} : Mandar
     NOT Mandarin!

 {man} : Mandingo
 {mni} : Manipuri
     eq Meithei.

 [{mno} : Manobo languages]
 {gv} : Manx
 {mi} : Maori
     NOT Mari!

 {mr} : Marathi
 {chm} : Mari
     NOT Maori!

 {mh} : Marshall
     eq Marshallese.

 {mwr} : Marwari
 {mas} : Masai
 [{myn} : Mayan languages]
 {men} : Mende
 {mic} : Micmac
 {min} : Minangkabau
 {i-mingo} : Mingo
     eq the Irquoian language West Virginia Seneca.  NOT New York Seneca!

 [{mis} : Miscellaneous languages]
     Don't use this.

 {moh} : Mohawk
 {mdf} : Moksha
 {mo} : Moldavian
     eq Moldovan.

 [{mkh} : Mon-Khmer (Other)]
 {lol} : Mongo
 {mn} : Mongolian
     eq Mongol.

 {mos} : Mossi
 [{mul} : Multiple languages]
     Not for normal use.

 [{mun} : Munda languages]
 {nah} : Nahuatl
 {nap} : Neapolitan
 {na} : Nauru
 {nv} : Navajo
     eq Navaho.  (Formerly "i-navajo".)

 {nd} : North Ndebele
 {nr} : South Ndebele
 {ng} : Ndonga
 {ne} : Nepali
     eq Nepalese.  Notable forms: {ne-np} Nepal Nepali; {ne-in} India
     Nepali.

 {new} : Newari
 {nia} : Nias
 [{nic} : Niger-Kordofanian (Other)]
 [{ssa} : Nilo-Saharan (Other)]
 {niu} : Niuean
 {nog} : Nogai
 {non} : Old Norse
     (Historical)

 [{nai} : North American Indian]
     Do not use this.

 {no} : Norwegian
     Note the two following forms:

 {nb} : Norwegian Bokmal
     eq Bokmål, (A form of Norwegian.)  (Formerly "no-bok".)

 {nn} : Norwegian Nynorsk
     (A form of Norwegian.)  (Formerly "no-nyn".)

 [{nub} : Nubian languages]
 {nym} : Nyamwezi
 {nyn} : Nyankole
 {nyo} : Nyoro
 {nzi} : Nzima
 {oc} : Occitan (post 1500)
     eq Provençal, eq Provencal

 {oj} : Ojibwa
     eq Ojibwe.  (Formerly "oji".)

 {or} : Oriya
 {om} : Oromo
 {osa} : Osage
 {os} : Ossetian; Ossetic
 [{oto} : Otomian languages]
     Group of languages collectively called "Otomí".

 {pal} : Pahlavi
     eq Pahlevi

 {i-pwn} : Paiwan
     eq Pariwan

 {pau} : Palauan
 {pi} : Pali
     (Historical?)

 {pam} : Pampanga
 {pag} : Pangasinan
 {pa} : Panjabi
     eq Punjabi

 {pap} : Papiamento
     eq Papiamentu.

 [{paa} : Papuan (Other)]
 {fa} : Persian
     eq Farsi.  eq Iranian.

 {peo} : Old Persian (ca.600-400 B.C.)
 [{phi} : Philippine (Other)]
 {phn} : Phoenician
     (Historical)

 {pon} : Pohnpeian
     NOT Pompeiian!

 {pl} : Polish
 {pt} : Portuguese
     eq Portugese.  Notable forms: {pt-pt} Portugal Portuguese; {pt-br}
     Brazilian Portuguese.

 [{pra} : Prakrit languages]
 {pro} : Old Provencal (to 1500)
     eq Old Provençal.  (Historical.)

 {ps} : Pushto
     eq Pashto.  eq Pushtu.

 {qu} : Quechua
     eq Quecha.

 {rm} : Raeto-Romance
     eq Romansh.

 {raj} : Rajasthani
 {rap} : Rapanui
 {rar} : Rarotongan
 [{qaa - qtz} : Reserved for local use.]
 [{roa} : Romance (Other)]
     NOT Romanian!  NOT Romany!  NOT Romansh!

 {ro} : Romanian
     eq Rumanian.  NOT Romany!

 {rom} : Romany
     eq Rom.  NOT Romanian!

 {rn} : Rundi
 {ru} : Russian
     NOT White Russian!  NOT Rusyn!

 [{sal} : Salishan languages]
     Large language group.

 {sam} : Samaritan Aramaic
     NOT Aramaic!

 {se} : Northern Sami
     eq Lappish.  eq Lapp.  eq (Northern) Saami.

 {sma} : Southern Sami
 {smn} : Inari Sami
 {smj} : Lule Sami
 {sms} : Skolt Sami
 [{smi} : Sami languages (Other)]
 {sm} : Samoan
 {sad} : Sandawe
 {sg} : Sango
 {sa} : Sanskrit
     (Historical)

 {sat} : Santali
 {sc} : Sardinian
     eq Sard.

 {sas} : Sasak
 {sco} : Scots
     NOT Scots Gaelic!

 {sel} : Selkup
 [{sem} : Semitic (Other)]
 {sr} : Serbian
     eq Serb.  NOT Sorbian.

     Notable forms: {sr-Cyrl} : Serbian in Cyrillic script; {sr-Latn} :
     Serbian in Latin script.

 {srr} : Serer
 {shn} : Shan
 {sn} : Shona
 {sid} : Sidamo
 {sgn-...} : Sign Languages
     Always use with a subtag.  Notable forms: {sgn-gb} British Sign
     Language (BSL); {sgn-ie} Irish Sign Language (ESL); {sgn-ni}
     Nicaraguan Sign Language (ISN); {sgn-us} American Sign Language

(ASL). #

     (And so on with other country codes as the subtag.)

 {bla} : Siksika
     eq Blackfoot.  eq Pikanii.

 {sd} : Sindhi
 {si} : Sinhalese
     eq Sinhala.

 [{sit} : Sino-Tibetan (Other)]
 [{sio} : Siouan languages]
 {den} : Slave (Athapascan)
     ("Slavey" is a subform.)

 [{sla} : Slavic (Other)]
 {sk} : Slovak
     eq Slovakian.

 {sl} : Slovenian
     eq Slovene.

 {sog} : Sogdian
 {so} : Somali
 {son} : Songhai
 {snk} : Soninke
 {wen} : Sorbian languages
     eq Wendish.  eq Sorb.  eq Lusatian.  eq Wend.  NOT Venda!  NOT
     Serbian!

 {nso} : Northern Sotho
 {st} : Southern Sotho
     eq Sutu.  eq Sesotho.

 [{sai} : South American Indian (Other)]
 {es} : Spanish
     Notable forms: {es-ar} Argentine Spanish; {es-bo} Bolivian Spanish;
     {es-cl} Chilean Spanish; {es-co} Colombian Spanish; {es-do} Dominican
     Spanish; {es-ec} Ecuadorian Spanish; {es-es} Spain Spanish; {es-gt}
     Guatemalan Spanish; {es-hn} Honduran Spanish; {es-mx} Mexican
     Spanish; {es-pa} Panamanian Spanish; {es-pe} Peruvian Spanish;
     {es-pr} Puerto Rican Spanish; {es-py} Paraguay Spanish; {es-sv}
     Salvadoran Spanish; {es-us} US Spanish; {es-uy} Uruguayan Spanish;
     {es-ve} Venezuelan Spanish.

 {suk} : Sukuma
 {sux} : Sumerian
     (Historical)

 {su} : Sundanese
 {sus} : Susu
 {sw} : Swahili
     eq Kiswahili

 {ss} : Swati
 {sv} : Swedish
     Notable forms: {sv-se} Sweden Swedish; {sv-fi} Finland Swedish.

 {syr} : Syriac
 {tl} : Tagalog
 {ty} : Tahitian
 [{tai} : Tai (Other)]
     NOT Thai!

 {tg} : Tajik
 {tmh} : Tamashek
 {ta} : Tamil
 {i-tao} : Tao
     eq Yami.

 {tt} : Tatar
 {i-tay} : Tayal
     eq Atayal.  eq Atayan.

 {te} : Telugu
 {ter} : Tereno
 {tet} : Tetum
 {th} : Thai
     NOT Tai!

 {bo} : Tibetan
 {tig} : Tigre
 {ti} : Tigrinya
 {tem} : Timne
     eq Themne.  eq Timene.

 {tiv} : Tiv
 {tli} : Tlingit
 {tpi} : Tok Pisin
 {tkl} : Tokelau
 {tog} : Tonga (Nyasa)
     NOT Tsonga!

 {to} : Tonga (Tonga Islands)
     (Pronounced "Tong-a", not "Tong-ga")

     NOT Tsonga!

 {tsi} : Tsimshian
     eq Sm'algyax

 {ts} : Tsonga
     NOT Tonga!

 {i-tsu} : Tsou
 {tn} : Tswana
     Same as Setswana.

 {tum} : Tumbuka
 [{tup} : Tupi languages]
 {tr} : Turkish
     (Typically in Roman script)

 {ota} : Ottoman Turkish (1500-1928)
     (Typically in Arabic script)  (Historical)

 {crh} : Crimean Turkish
     eq Crimean Tatar

 {tk} : Turkmen
     eq Turkmeni.

 {tvl} : Tuvalu
 {tyv} : Tuvinian
     eq Tuvan.  eq Tuvin.

 {tw} : Twi
 {udm} : Udmurt
 {uga} : Ugaritic
     NOT Ugric!

 {ug} : Uighur
 {uk} : Ukrainian
 {umb} : Umbundu
 {und} : Undetermined
     Not a tag for normal use.

 {ur} : Urdu
 {uz} : Uzbek
     eq Özbek

     Notable forms: {uz-Cyrl} Uzbek in Cyrillic script; {uz-Latn} Uzbek in
     Latin script.

 {vai} : Vai
 {ve} : Venda
     NOT Wendish!  NOT Wend!  NOT Avestan!  (Formerly "ven".)

 {vi} : Vietnamese
     eq Viet.

 {vo} : Volapuk
     eq Volapük.  (Artificial)

 {vot} : Votic
     eq Votian.  eq Vod.

 [{wak} : Wakashan languages]
 {wa} : Walloon
 {wal} : Walamo
     eq Wolaytta.

 {war} : Waray
     Presumably the Philippine language Waray-Waray (Samareño), not the
     smaller Philippine language Waray Sorsogon, nor the extinct
     Australian language Waray.

 {was} : Washo
     eq Washoe

 {cy} : Welsh
 {wo} : Wolof
 {x-...} : Unregistered (Semi-Private Use)
     "x-" is a prefix for language tags that are not registered with ISO
     or IANA.  Example, x-double-dutch

 {xh} : Xhosa
 {sah} : Yakut
 {yao} : Yao
     (The Yao in Malawi?)

 {yap} : Yapese
     eq Yap

 {ii} : Sichuan Yi
 {yi} : Yiddish
     Formerly "ji".  Usually in Hebrew script.

     Notable forms: {yi-latn} Yiddish in Latin script

 {yo} : Yoruba
 [{ypk} : Yupik languages]
     Several "Eskimo" languages.

 {znd} : Zande
 [{zap} : Zapotec]
     (A group of languages.)

 {zen} : Zenaga
     NOT Zend.

 {za} : Zhuang
 {zu} : Zulu
 {zun} : Zuni
     eq Zuñi

SSEEEE AALLSSOO #

 I18N::LangTags and its "See Also" section.

CCOOPPYYRRIIGGHHTT AANNDD DDIISSCCLLAAIIMMEERR #

 Copyright (c) 2001+ Sean M. Burke. All rights reserved.

 You can redistribute and/or modify this document under the same terms as
 Perl itself.

 This document is provided in the hope that it will be useful, but without
 any warranty; without even the implied warranty of accuracy,
 authoritativeness, completeness, merchantability, or fitness for a
 particular purpose.

 Email any corrections or questions to me.

AAUUTTHHOORR #

 Sean M. Burke, sburke@cpan.org

perl v5.36.3 2019-02-13 I18N::LangTags::List(3p)