Difference between revisions of "Language/Multiple-languages/Culture/Internet-Vocabularies"
Line 37: | Line 37: | ||
Yue Chinese https://humanum.arts.cuhk.edu.hk/Lexis/lexi-can/faq.php | Yue Chinese https://humanum.arts.cuhk.edu.hk/Lexis/lexi-can/faq.php | ||
== Graded word list == | == Graded word/character list == | ||
Japanese https://ja.wikipedia.org/wiki/%E5%AD%A6%E5%B9%B4%E5%88%A5%E6%BC%A2%E5%AD%97%E9%85%8D%E5%BD%93%E8%A1%A8 | Japanese https://ja.wikipedia.org/wiki/%E5%AD%A6%E5%B9%B4%E5%88%A5%E6%BC%A2%E5%AD%97%E9%85%8D%E5%BD%93%E8%A1%A8 | ||
Revision as of 11:43, 17 April 2021
On this page you will find vocabularies to memorise. This page is not to be confused with Language/Multiple-languages/Culture/Internet-Dictionaries. Here are word lists that do not have translations, definitions or pronunciations, and programs that apply such word lists.
They can be made use of by merging with dictionary data. This may require web scraping.
In progress.
Common word/character list
Multiple languages https://en.wiktionary.org/wiki/Appendix:Swadesh_lists
English https://github.com/HK-SHAO/English-Dictionary/blob/master/word/words.txt
Mandarin Chinese https://zh.wikisource.org/wiki/%E9%80%9A%E7%94%A8%E8%A7%84%E8%8C%83%E6%B1%89%E5%AD%97%E8%A1%A8
Mandarin Chinese https://zh.wikisource.org/wiki/%E5%B8%B8%E7%94%A8%E5%9C%8B%E5%AD%97%E6%A8%99%E6%BA%96%E5%AD%97%E9%AB%94%E8%A1%A8
Thai https://github.com/nv23/thai-wordlist
Vietnamese https://www.chunom.org/
Frequency list
Multiple languages https://en.wiktionary.org/wiki/Wiktionary:Frequency_lists
Chinese https://humanum.arts.cuhk.edu.hk/Lexis/chifreq/
Kannada https://github.com/kakashi/kannada_IN_dictionary
Mandarin Chinese https://lingua.mtsu.edu/chinese-computing/statistics/index.html
Mandarin Chinese http://technology.chtsai.org/charfreq/
Yue Chinese https://humanum.arts.cuhk.edu.hk/Lexis/lexi-can/faq.php
Graded word/character list
Korean https://www.topik.go.kr/usr/cmm/subLocation.do?menuSeq=2110503&boardSeq=64217
Mandarin Chinese http://www.chinesetest.cn/godownload.do#list_1
Mandarin Chinese http://www.tw.org/tocfl/
Spell checker
Some require knowledge about GNU Aspell and Hunspell. You can find the list of word in the spell checker's source code.
Multiple languages https://addons.mozilla.org/en-US/firefox/language-tools/
Multiple languages https://ftp.gnu.org/gnu/aspell/dict/0index.html
Multiple languages https://wiki.documentfoundation.org/Language_support_of_LibreOffice
Croatian https://github.com/spideyfusion/elasticsearch-croatian
Indonesian https://github.com/shuLhan/hunspell-id