Difference between revisions of "Features/Language-List"
< Features
Jump to navigation
Jump to search
Line 13: | Line 13: | ||
*'''Ethnologue''': language name in Ethnologue. Is this column useful to fill up the new_lib2 field? | *'''Ethnologue''': language name in Ethnologue. Is this column useful to fill up the new_lib2 field? | ||
*'''isLang''': confirmed to be a language or a major dialect (not mutually intelligible with others), whether or not it will be adopted. 1 if 'yes', 0 if 'no'. | *'''isLang''': confirmed to be a language or a major dialect (not mutually intelligible with others), whether or not it will be adopted. 1 if 'yes', 0 if 'no'. | ||
*'''update''': 'break', 'merge' | *'''update''': 'break' or '' | ||
*'''update2''': 'rename', 'merge', 'delete' | |||
===NEW COLUMNS=== | ===NEW COLUMNS=== |
Revision as of 14:58, 21 August 2017
The aim of this page is to build a comprehensive language list to replace the old list on the polyglotclub site.
Columns description
OLD COLUMNS
- old: OLD LIST. language name in English. DO NOT CHANGE
- old_lib: OLD LIST. autonym. DO NOT CHANGE
HELP COLUMNS
- community1 and community2: come from 2 well known language communities so those lists "should" be of good quality.
- Wikipedia: source. language name in English.
- Wikipedia2, Wikipedia2_lib: source. autonym.
- Wikipedia3: source & source & source. Languages not included in column Wikipedia.
- Ethnologue: language name in Ethnologue. Is this column useful to fill up the new_lib2 field?
- isLang: confirmed to be a language or a major dialect (not mutually intelligible with others), whether or not it will be adopted. 1 if 'yes', 0 if 'no'.
- update: 'break' or
- update2: 'rename', 'merge', 'delete'
NEW COLUMNS
- ISO_639-3: Why several codes per language? there can be only one code per language.
- new: NEW LIST. MAIN language name in English. Length of this field should be short. WARNING: NOT EDITABLE in the future
- new_lib: NEW LIST. Autonyms. if several names, separated by a comma. Example: name1, name2, name3. It can be useful because if users type any name in the drop-down list, they will find the language. EDITABLE in the future
- new_lib2: NEW LIST. Alternative English names separated by a comma. Example: name2, name3. It can be useful because if users type any name in the drop-down list, they will find the language. EDITABLE in the future.
To do
Users will search for: - the most important languages by typing the English name ('new' column), the autonym ('new_lib') and alternative English names ('new_lib2') - any other languages (less important ones) only by typing their English name ('new' column).
Get an "improved" ISO639-3 file adding some missing fields like new_lib, new_lib2 for the most important languages (like 500 languages out of the 7879). We will be able to fill up those new columns, and to find out which languages are the most important thanks to the help of other lists like : community1, community2, wikipedia, wikipedia2, wikipedia3... (see column description).
This file will also be necessary for the database update (delete, merge, break).
Attach new columns to the old list:
- Attach code ISO639-3 on our list : only one code per language. This code will be use to link our file with the big file
- Add the following comlumns in our table about updates (update, update_lib, update_id)
- Finish completing fields new_lib and new_lib2 (as much as we can)
List
old | old_lib | Wikipedia | Wikipedia2 | Wikipedia2_lib | Wikipedia3 | ISO_639-3 | new | new_lib | new_lib2 | update | |||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Abkhazian | Abkhaz | Aҧсуа бызшәа, Aҧсшәа | abk | Abkhazian | |||||||||
Afar | Afar | Qafár af | Afar | aar | Afar | ||||||||
Afrikaans | Afrikaans | Afrikaans | Afrikaans | afr | Afrikaans | Afrikaans | |||||||
Albanian | sqi | Albanian | break | ||||||||||
Albanian | aae | Arbëreshë Albanian | |||||||||||
Albanian | aat | Arvanitika Albanian | |||||||||||
Albanian | aln | Gheg Albanian | |||||||||||
Albanian | Albanian | Albanian | Shqip, Shqipja | als | Tosk Albanian | Standard Albanian | |||||||
Amharic | Amharic | Amharic | ኣማርኛ | amh | Amharic | ኣማርኛ | |||||||
Arabic | ara | Arabic | |||||||||||
Arabic | Arabic | Arabic | اللغة العربية | arb | Standard Arabic | break | |||||||
Armenian | Armenian | Armenian | Հայերեն | hye | Armenian | հայերեն, հայերէն | |||||||
Assamese | Assamese | Assamese | অসমীয়া | asm | Assamese | ||||||||
Assyrian | Assyrian | Assyrian Neo-Aramaic | ܐܬܘܪܝܐ ܣܘܪܝܝܐ Ātūrāyā, ܣܘܪܝܬ ܣܘܪܝܝܐ Sūrët, Āshuri, Suryāyā, Sureth | aii | Assyrian Neo-Aramaic | ||||||||
Aymara | Aymara | Aymara | Aymar Aru | ayr | Central Aymara | break | |||||||
Aymara | ayc | Southern Aymara | |||||||||||
Aymara | aym | Aymara | |||||||||||
Azerbaijani | Azerbaijani | Azerbaijani | Azərbaycanca, آذربایجان دیلی | azj | North Azerbaijani | break | |||||||
Azerbaijani | azb | South Azerbaijani | |||||||||||
Azerbaijani | aze | Azerbaijani | |||||||||||
Avar | Avar | MагIарул MацI | Avar | ava | Avaric | ||||||||
Bashkir | Bashkir | Bashkir | башҡорт Теле, Başqort Tele | bak | Bashkir | ||||||||
Basque | Basque | Euskara | Basque | eus | Basque | Euskara | |||||||
Berber | Berber | * | break | ||||||||||
Bhutani | Dzongkha | Dzongkha | རྫོང་ཁ་ | dzo | Dzongkha | རྫོང་ཁ་ | |||||||
Bihari | * | break | |||||||||||
Bislama | Bislama | bis | Bislama | ||||||||||
Bosnian | Bosnian | Bosnian | Босански, Bosanski | bos | Bosnian | Босански, Bosanski | |||||||
Breton | Breton | Brezhoneg | bre | Breton | Brezhoneg | ||||||||
Bulgarian | Bulgarian | Bulgarian | български език | bul | Bulgarian | български език | |||||||
Burmese | Burmese | Burmese | မြန်မာစာ | mya | Burmese | မြန်မာစာ | |||||||
Byelorussian | Belarusian | Belarusian | Беларуская | bel | Belarusian | Беларуская | |||||||
Bengali | Bengali | Bengali | বাংলা | ben | Bengali | বাংলা | |||||||
Cambodian | Khmer | Khmer | ភាសាខ្មែរ | khm | Central Khmer | ភាសាខ្មែរ | |||||||
Catalan | Catalan | Catalan | Català | cat | Catalan | Català | |||||||
Chinese, Mandarin | zho | Chinese | 汉语, 漢語, 中文 | ||||||||||
Chinese, Cantonese | Cantonese | yue | Yue Chinese | 廣東話, 广东话, 粵語, 粤语 | |||||||||
Chinese, Mandarin | Chinese | Mandarin | 國語 | cmn | Mandarin Chinese | 普通话, 國語, 华语, 官话, 官話 | break | ||||||
Min (Taiwanese) | nan | Min Nan Chinese | 台語, 臺語, 閩南語, 闽南语, 福建话 | ||||||||||
Creole | * | delete | |||||||||||
Corsican | Corsican | Corsu | cos | Corsican | Corsu | ||||||||
Croatian | Croatian | Croatian | Hrvatski | hrv | Croatian | Hrvatski | |||||||
Czech | Czech | Czech | Český Jazyk, Čeština | ces | Czech | ||||||||
Danish | Danish | Danish | Dansk | dan | Danish | Dansk | |||||||
Dutch | Dutch | Dutch | Nederlands | nld | Dutch | Nederlands | |||||||
Egyptian | egy | Egyptian Arabic | |||||||||||
English | English | English | English | eng | English | ||||||||
Esperanto | Esperanto | Esperanto | epo | Esperanto | Esperanto | ||||||||
Estonian | Estonian | Estonian | Eesti | est | Estonian | Eesti | |||||||
Estonian | ekk | Standard Estonian | |||||||||||
Fiji | Fijian | Fijian | Na vosa vaka-Viti, Vakaviti | fij | Fijian | ||||||||
Faeroese | Faroese | Faroese | Føroyskt | fao | Faroese | Føroyskt | |||||||
Tagalog | Filipino | Filipino | Wikang Filipino | fil | Filipino | ||||||||
Tagalog | Filipino | Tagalog | ᜊᜊᜌᜒ, ᜊᜌ᜔ᜊᜌᜒᜈ᜔, Wikang Tagalog | tgl | Tagalog | ||||||||
Finnish | Finnish | Finnish | Suomi | fin | Finnish | Suomi | |||||||
French | French | French | Français | fra | French | Français | |||||||
Frisian | West Frisian | Frisian (West) | Frysk | fry | Western Frisian | ||||||||
Ga | Ga | Ga | Gã | gaa | Ga | ||||||||
Gothic | got | Gothic | |||||||||||
Irish | Irish | Irish | Gaeilge | gle | Irish | ||||||||
Gaelic | Scottish Gaelic | Gàidhlig | Gàidhlig | gla | Scottish Gaelic | ||||||||
Galician | Galician | Galician | Galego | glg | Galician | Galego | |||||||
Georgian | Georgian | Georgian | ქართული | kat | Georgian | ქართული | |||||||
German | German | German | Deutsch | deu | German | Deutsch | |||||||
Greek | Greek | Greek | Ελληνικά | ell | Modern Greek (1453-) | Ελληνικά | |||||||
Greek (Classical) | grc | Ancient Greek (to 1453) | |||||||||||
Greenlandic | Kalaallisut | Greenlandic | Kalaallisut | kal | Kalaallisut | ||||||||
Guarani | Guaraní | Guaraní | Avañe'ẽ or Javy ju | gug | Paraguayan Guaraní | ||||||||
Gujarati | Gujarati | Gujarati | ગુજરાતી | guj | Gujarati | ગુજરાતી | |||||||
Hausa | Hausa | Hausa | حَوْسَ | hau | Hausa | ||||||||
Hebrew | Hebrew | Hebrew | עברית | heb | Hebrew | עברית | |||||||
Ido-Reformed Esperanto | Ido | Ido | ido | Ido | Ido | ||||||||
Hindi | Hindi | Hindi | हिन्दी | hin | Hindi | हिन्दी | |||||||
Hungarian | Hungarian | Hungarian | Magyar | hun | Hungarian | Magyar | |||||||
Icelandic | Icelandic | Icelandic | Íslenska | isl | Icelandic | Íslenska | |||||||
Imho | delete | ||||||||||||
Indonesian | Indonesian | Indonesian | Bahasa Indonesia | ind | Indonesian | Bahasa Indonesia | |||||||
Iranian | pes | Iranian Persian | merge | ||||||||||
Inupiak | Inupiaq | Inupiat | Iñupiatun | ipk | Inupiaq | ||||||||
Interlingua | Interlingua | Interlingua | ina | Interlingua (International Auxiliary Language Association) | |||||||||
Italian | Italian | Italian | Italiano | ita | Italian | Italiano | |||||||
Japanese | Japanese | Japanese | 日本語 | jpn | Japanese | 日本語 | |||||||
Javanese | Javanese | Javanese | ꦧꦱꦗꦮ | jav | Javanese | ||||||||
Kannada | Kannada | Kannada | ಕನ್ನಡ | kan | Kannada | ಕನ್ನಡ | |||||||
Kashmiri | Kashmiri | Kashmiri | كٲشُر, कॉशुर, Kạ̄šur, Koshur | ||||||||||
Kazakh | Kazakh | Kazakh | Қазақ Tілі | ||||||||||
Kinyarwanda | Kinyarwanda | Kinyarwanda | Ikinyarwanda or Runyarwanda | ||||||||||
Kirghiz | Kyrgyz | ||||||||||||
Kirundi | Kirundi | ||||||||||||
Korean | Korean | Korean | 한국어, 조선말 | kor | |||||||||
Kurdish | Kurdish | Kurdish | Kurdí, کوردی, or K’öрди | ||||||||||
Latin | Latin | ||||||||||||
Laothian | Lao | Lao | ພາສາລາວ | ພາສາລາວ | |||||||||
Latvian | Latvian | Latvian | Latviešu | Latviešu | |||||||||
Lingala | Lingala | Lingala | Lingála | ||||||||||
Lithuanian | Lithuanian | Lithuanian | Lietuvių | Lietuvių | |||||||||
Macedonian | Macedonian | Macedonian | Mакедонски | Mакедонски | |||||||||
Malagasy | Malagasy | ||||||||||||
Malay | Malay | Malay | بهاس ملايو or Bahasa Melayu | ||||||||||
Malayalam | Malayalam | Malayalam | മലയാളം | മലയാളം | |||||||||
Maltese | Maltese | Maltese | Malti | Malti | |||||||||
Maori | Māori | Māori | te Reo Māori | ||||||||||
Marathi | Marathi | Marathi | मराठी | मराठी | |||||||||
Middle Eastern | |||||||||||||
Moldavian | Moldovan | ||||||||||||
Mongolian | Mongolian | Mongolian | Монгол Хэл | Монгол Хэл | |||||||||
Moroccan | |||||||||||||
Nauru | Nauruan | ||||||||||||
Nepali | Nepali | Nepali | नेपाली | नेपाली | |||||||||
Norwegian | Norwegian | Norwegian | Norsk | Norsk | |||||||||
Occitan | Occitan | Occitan | Occitan | Occitan | |||||||||
Oriya | Odia | Odia | ଓଡ଼ିଆ | ||||||||||
Oromo (Afan) | Oromo | ||||||||||||
Pashto (Pushto) | Pashto | ||||||||||||
Persian | Persian | Persian | فارسی | ||||||||||
Polish | Polish | Polish | Język polski, polski, or polszczyzna | Polski | |||||||||
Portuguese | Portuguese | Portuguese | Português | por | Português | ||||||||
Provencal | |||||||||||||
Punjabi | Punjabi | Punjabi language | पंजाबी, ਪੰਜਾਬੀ, or پنجابی | ||||||||||
Quenya | |||||||||||||
Quechua | Quechua | ||||||||||||
Rhaeto-Romance | |||||||||||||
Romanian | Romanian | Romanian | Română | Română | |||||||||
Russian | Russian | Russian | Русский | rus | Русский | ||||||||
Samoan | Samoan | Samoan | Gagana Sāmoa | ||||||||||
Sanskrit | Sanskrit | Sanskrit | संस्कृतम्, संस्कृता वाक् | संस्कृतम्, संस्कृता वाक् | |||||||||
Sardinian | Sardinian | Sardu | |||||||||||
Sangro | |||||||||||||
Serbian | Serbian | Serbian | Српски, Srpski | srp | Српски, Srpski | ||||||||
Imho | |||||||||||||
Sesotho | Sotho | Sotho | Sesotho | ||||||||||
Setswana | Tswana | Tswana | Setswana | ||||||||||
Sindhi | Sindhi | Sindhi | سنڌي | Sindhi | |||||||||
Shona | Shona | Shona | Shona | ||||||||||
Siswati | |||||||||||||
Sign Language | Sign language | ||||||||||||
Sinhalese | Sinhala | Sinhala | සිංහල | ||||||||||
Singhalese | |||||||||||||
Slovak | Slovak | Slovak | Slovenčina | ||||||||||
Slovenian | Slovene | Slovene | Slovenščina | ||||||||||
Somali | Somali | Somali | اللغة الصومالية, Af-Soomaali | ||||||||||
Spanish | Spanish | Spanish | Español | spa | Español | ||||||||
Sudanese | Sundanese | Sundanese | ᮘᮞ ᮞᮥᮔ᮪ᮓ | ||||||||||
Swahili | Swahili | Swahili | Kiswahili | ||||||||||
Swedish | Swedish | Swedish | Svenska | swe | Svenska | ||||||||
Tajik | Tajik | ||||||||||||
Tamil | Tamil | Tamil | தமிழ் | தமிழ் | |||||||||
Tatar | Tatar | Tatar | تاتارچا, Tatarça, Tатарча | ||||||||||
Telugu | Telugu | Telugu | తెలుగు | తెలుగు | |||||||||
Thai | Thai | Thai | ภาษาไทย | ภาษาไทย | |||||||||
Tibetan | Tibetan | དབུས་སྐད་ | Tibetan | ||||||||||
Tigrinya | Tigrinya | ትግርኛ | |||||||||||
Tonga | Tongan | Tongan | Lea faka-Tonga | ||||||||||
Tsonga | Tsonga | ||||||||||||
Turkish | Turkish | Turkish | Türkçe | tur | Türkçe | ||||||||
Turkmen | Turkmen | ||||||||||||
Twi | Twi | ||||||||||||
Ukrainian | Ukrainian | Ukrainian | Українська | Українська | |||||||||
Urdu | Urdu | Urdu | اُردُو | ||||||||||
Uzbek | Uzbek | Uzbek | اوزبیک, Ўзбек, Oʻzbek | ||||||||||
Vietnamese | Vietnamese | Vietnamese | Tiếng Việt Nam | Tiếng Việt | |||||||||
Volapuk | Volapük | Volapük | |||||||||||
Welsh | Welsh | Welsh | Cymraeg | Cymraeg | |||||||||
Xhosa | Xhosa | Xhosa | Xhosa | Xhosa | |||||||||
Wolof | Wolof | Wolof | Wolof | ||||||||||
Yiddish | Yiddish | ייִדיש | Yiddish | ייִדיש | |||||||||
Yoruba | Yoruba | Yoruba | Èdè Yorùbá | ||||||||||
Zulu | Zulu | Zulu | Zulu |