Language/Multiple-languages/Culture/Internet-Vocabularies

From Polyglot Club WIKI
< Language‎ | Multiple-languages‎ | Culture
Revision as of 13:38, 25 April 2021 by Darik77777 (talk | contribs) (Fixed typo)
Jump to navigation Jump to search
Rate this lesson:
5.00
(one vote)


On this page you will find vocabularies to memorise. This page is not to be confused with Language/Multiple-languages/Culture/Internet-Dictionaries. Here are word lists that possess one of the following features:

  • contain frequency or grading information
  • no translations, definitions or pronunciations

Computer programms are included, too.

They can be made use of by merging with dictionary meaning data. This may require web scraping.

In progress.

Common word/character list

Multiple languages https://en.wiktionary.org/wiki/Appendix:Swadesh_lists

Chinese

English https://github.com/HK-SHAO/English-Dictionary/blob/master/word/words.txt

Japanese


Korean https://www.topik.go.kr/usr/cmm/subLocation.do?menuSeq=2110503&boardSeq=64217

Thai https://github.com/nv23/thai-wordlist

Vietnamese https://www.chunom.org/

Frequency list

Multiple languages https://en.wiktionary.org/wiki/Wiktionary:Frequency_lists

Chinese

Kannada https://github.com/kakashi/kannada_IN_dictionary

Korean https://ko.wiktionary.org/wiki/%EB%B6%80%EB%A1%9D:%EC%9E%90%EC%A3%BC_%EC%93%B0%EC%9D%B4%EB%8A%94_%ED%95%9C%EA%B5%AD%EC%96%B4_%EB%82%B1%EB%A7%90_5800

Graded list

Chinese

Japanese

Korean

Russian https://en.openrussian.org/vocab/A1

Spell checker

The word lists are in the spell checkers' source code: CWL files in GNU Aspell, can be opened with TeXstudio; DIC files in Hunspell, can be opened with a text editor.

Multiple languages

Croatian https://github.com/spideyfusion/elasticsearch-croatian

Indonesian https://github.com/shuLhan/hunspell-id

Kazakh https://github.com/taem/hunspell-kk

Contributors

GrimPixel and Maintenance script


Create a new Lesson