Difference between revisions of "Language/Multiple-languages/Culture/Internet-Vocabularies"

From Polyglot Club WIKI
Jump to navigation Jump to search
m (Quick edit)
 
(One intermediate revision by one other user not shown)
Line 10: Line 10:


In progress.
In progress.
Visit https://codeberg.org/GrimPixel/standard-character-lists to download standard lists in TSV format.


== Common word/character list ==
== Common word/character list ==
Line 80: Line 82:
Kazakh https://github.com/taem/hunspell-kk
Kazakh https://github.com/taem/hunspell-kk


==Related Lessons==
==Other Lessons==
* [[Language/Multiple-languages/Culture/Different-ways-to-greet-in-the-world|Different ways to greet in the world]]
* [[Language/Multiple-languages/Culture/Different-ways-to-greet-in-the-world|Different ways to greet in the world]]
* [[Language/Multiple-languages/Culture/Texts-and-Audios-under-a-Public-License|Texts and Audios under a Public License]]
* [[Language/Multiple-languages/Culture/Texts-and-Audios-under-a-Public-License|Texts and Audios under a Public License]]
Line 91: Line 93:
* [[Language/Multiple-languages/Culture/The-Polyglot-Club-Team|The Polyglot Club Team]]
* [[Language/Multiple-languages/Culture/The-Polyglot-Club-Team|The Polyglot Club Team]]
* [[Language/Multiple-languages/Culture/Important-Technologies|Important Technologies]]
* [[Language/Multiple-languages/Culture/Important-Technologies|Important Technologies]]
<span links></span>

Latest revision as of 17:24, 22 May 2023

Multiple-languages-flag-polyglotclub.jpg

On this page you will find vocabularies to memorise. This page is not to be confused with Language/Multiple-languages/Culture/Internet-Dictionaries. Here are word lists that possess one of the following features:

  • contain frequency or grading information
  • no translations, definitions or pronunciations

Computer programs are included, too.

They can be made use of by merging with dictionary meaning data. This may require web scraping.

In progress.

Visit https://codeberg.org/GrimPixel/standard-character-lists to download standard lists in TSV format.

Common word/character list[edit | edit source]

Multiple languages https://en.wiktionary.org/wiki/Appendix:Swadesh_lists

Chinese

English https://github.com/HK-SHAO/English-Dictionary/blob/master/word/words.txt

Japanese

Thai https://github.com/nv23/thai-wordlist

Vietnamese https://www.chunom.org/

Frequency list[edit | edit source]

Multiple languages https://en.wiktionary.org/wiki/Wiktionary:Frequency_lists

Chinese

Kannada https://github.com/kakashi/kannada_IN_dictionary

Korean 자주 쓰이는 한국어 낱말 https://ko.wiktionary.org/wiki/%EB%B6%80%EB%A1%9D:%EC%9E%90%EC%A3%BC_%EC%93%B0%EC%9D%B4%EB%8A%94_%ED%95%9C%EA%B5%AD%EC%96%B4_%EB%82%B1%EB%A7%90_5800

Graded list[edit | edit source]

Chinese

Japanese

Korean

Russian https://en.openrussian.org/vocab/A1

Spell checker[edit | edit source]

The word lists are in the spell checkers' source code: CWL files in GNU Aspell, can be opened with TeXstudio; DIC files in Hunspell, can be opened with a text editor.

Multiple languages

Croatian https://github.com/spideyfusion/elasticsearch-croatian

Indonesian https://github.com/shuLhan/hunspell-id

Kazakh https://github.com/taem/hunspell-kk

Other Lessons[edit | edit source]