Editing Language/Multiple-languages/Culture/Licensed-Free-Databases

Jump to navigation Jump to search

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 1: Line 1:
<div class="pg_page_title">Licensed-Free Databases Around Languages</div>
[[Category:Free-Resources]]
[[File:best-licensed-free-databases-polyglotclub.jpg|thumb]]
The listed are databases, not applications. That is to say, if you don't know programming, maybe they won't help you so much.
Hi polyglots! 😀
 
➡ On this page we have listed free databases related to languages.
 
* Those mentioned on [[Language/Multiple-languages/Culture/Internet-Dictionaries|Internet Dictionaries]] will not be mentioned again here.
 
* The listed items are data, so if you don't know programming, this page might not be of much help to you.


== Main ==
== Main ==


=== Multiple languages ===
=== Multiple languages ===
====https://www.ethnologue.com/codes/download-code-tables<nowiki/>====
LanguageCodes.tab lists the 7,400+ distinct language identifiers used in the current Ethnologue database.


==== https://dumps.wikimedia.org/ ====
==== https://dumps.wikimedia.org/ ====
Line 21: Line 12:


Wikimedia.
Wikimedia.
==== https://iate.europa.eu/download-iate/ ====
License: https://iate.europa.eu/download-iate/
Some of its users: https://iate.europa.eu/download-iate/
Terminology dictionary of the EU.


==== https://tatoeba.org/eng/downloads/ ====
==== https://tatoeba.org/eng/downloads/ ====
Line 49: Line 47:


Audio books.
Audio books.
==== https://freedict.org/downloads/ ====
License: https://freedict.org/about/
Some of its users: http://aarddict.org/
Dictionaries.


==== http://www.omegawiki.org/Help:Downloading_the_data ====
==== http://www.omegawiki.org/Help:Downloading_the_data ====
Line 56: Line 61:


Dictionaries.
Dictionaries.
==== http://www.xobdo.org/downloads/ ====
License: http://www.xobdo.org/downloads/
Some of its users: http://www.xobdo.org/
Dictionaries for South Asian languages and English.


==== https://ltrc.iiit.ac.in/onlineServices/Dictionaries/Dict_Frame.html ====
==== https://ltrc.iiit.ac.in/onlineServices/Dictionaries/Dict_Frame.html ====
Line 121: Line 133:
Archived Internet content.
Archived Internet content.


==== https://www.fandom.com/ ====
=== American Sign Language ===
License: https://www.fandom.com/licensing
 
==== http://www.asl-lex.org/ ====
License: http://www.asl-lex.org/
 
Lexicon.
 
=== Burmese ===
 
==== https://github.com/saturngod/ornagai-V2 ====
License: https://github.com/saturngod/ornagai-V2/blob/master/License
 
Some of its users: https://www.ornagai.com/#/
 
Dictionary.
 
=== Catalan ===
 
==== http://www.catalandictionary.org/en/search/ ====
License: http://www.catalandictionary.org/en/search/


Fan-made wiki.
Dictionary. Font of license is too small.


=== Chinese ===
=== Chinese ===
==== https://resources.publicense.moe.edu.tw/index.html ====
License: https://resources.publicense.moe.edu.tw/index.html
Some of its users: https://resources.publicense.moe.edu.tw/index.html, https://www.moedict.tw/
Dictionaries of ROC Mandarin Chinese written in ROC Mandarin Chinese.
==== https://cc-cedict.org/editor/editor.php ====
License: https://cc-cedict.org/wiki/
Some of its users: https://www.mdbg.net/chinese/dictionary, https://www.pleco.com/
Mandarin-English dictionary.
==== https://chine.in/mandarin/dictionnaire/CFDICT/ ====
License: https://chine.in/mandarin/dictionnaire/CFDICT/
Some of its users: https://chine.in/, https://www.pleco.com/
Mandarin-French dictionary.
==== https://handedict.zydeo.net/de/download ====
License: https://handedict.zydeo.net/de/download
Some of its users: https://www.pleco.com/
Mandarin-German dictionary.
==== [https://chdict.zydeo.net/en/download/ https://chdict.zydeo.n].[https://chdict.zydeo.net/en/download/ et/en/download/] ====
License: https://chdict.zydeo.net/en/download/
Some of its users: https://chdict.zydeo.net/hu/
Mandarin-Hungarian dictionary.
==== http://cantonese.org/download.html ====
License: http://cantonese.org/download.html
Some of its users: http://cantonese.org/, https://www.pleco.com/
Cantonese-English dictionary.
==== https://twblg.dict.edu.tw/holodict_new/compile1_6_1.jsp ====
License: https://twblg.dict.edu.tw/holodict_new/compile1_6_1.jsp
Some of its users: https://twblg.dict.edu.tw/holodict_new/default.jsp, https://www.moedict.tw/
Taiwanese-Endlish dictionary. It can be requested through email.
==== http://www.taiwanesedictionary.org/ ====
License: http://www.taiwanesedictionary.org/
Taiwanese-English dictionary.


==== http://lingua.mtsu.edu/chinese-computing/ ====
==== http://lingua.mtsu.edu/chinese-computing/ ====
License: http://lingua.mtsu.edu/chinese-computing/copyright.html
License: http://lingua.mtsu.edu/chinese-computing/copyright.html


Character frequency lists.
Frequency lists.
 
==== https://github.com/gwinterstein/Cifu ====
License: https://github.com/gwinterstein/Cifu/blob/master/LICENSE
 
Word frequency list for Yue Chinese.


==== https://www.tanos.co.uk/hsk/ ====
==== https://www.tanos.co.uk/hsk/ ====
Line 162: Line 241:


Frequent characters.
Frequent characters.
=== Esperanto ===
==== http://reta-vortaro.de/tgz/index.html ====
License: http://reta-vortaro.de/tgz/index.html
Some of its users: http://reta-vortaro.de/, http://www.busydoingnothing.co.uk/prevo/
Dictionary in several languages.
==== http://www.denisowski.org/Esperanto/ESPDIC/espdic_readme.html ====
License: http://www.denisowski.org/Esperanto/ESPDIC/espdic_readme.html
Some of its users: http://www.denisowski.org/Esperanto/ESPDIC/espdic_readme.html
Dictionary.
==== https://komputeko.net/elsxutejo-en.php ====
License: https://komputeko.net/index_en.php
Some of its users: https://komputeko.net/index_en.php
Computer terminology dictionary.
=== German Sign Language ===
==== https://signdict.org/ ====
License: https://signdict.org/about
Some of its users: https://signdict.org/
Dictionary.


=== English ===
=== English ===
Line 179: Line 290:
Dictionary about computing.
Dictionary about computing.


==== https://github.com/tony-mak/Eng-Chi-Dictionary/tree/master/app/src/main/assets/databases ====
==== https://github.com/skywind3000/ECDICT ====
License: https://github.com/tony-mak/Eng-Chi-Dictionary/blob/master/LICENSE
License: https://github.com/skywind3000/ECDICT/blob/master/LICENSE
 
Some of its users: https://github.com/program-in-chinese/webextension_english_chinese_dictionary


Dictionary.
Dictionary.


==== https://github.com/linuxkathirvel/eng2tamildictionary/blob/master/dictionary.json ====
=== Estonian ===
License: https://github.com/linuxkathirvel/eng2tamildictionary/blob/master/License.txt
 
==== https://www.eki.ee/litsents/ ====
License: https://www.eki.ee/litsents/
 
Some of its users: http://portaal.eki.ee/sonaraamatud.html


Dictionary.
Dictionaries. Actually only 2 are available.


==== https://github.com/derekchuank/high-frequency-vocabulary ====
=== German ===
License: https://github.com/derekchuank/high-frequency-vocabulary/blob/master/LICENSE


Dictionary.
==== https://www.openthesaurus.de/about/download/ ====
License: https://www.openthesaurus.de/about/download/


==== https://github.com/kujirahand/EJDict/tree/master/src ====
Some of its users: https://www.openthesaurus.de/about/download/
License: https://github.com/kujirahand/EJDict/blob/master/LICENSE


Dictionary.
Thesaurus.


=== Hindi ===
=== Hindi ===
Line 212: Line 328:
==== https://www.ling.upenn.edu/~kurisuto/germanic/oi_cleasbyvigfusson_about.html ====
==== https://www.ling.upenn.edu/~kurisuto/germanic/oi_cleasbyvigfusson_about.html ====
License: http://lexicon.ff.cuni.cz/txt/oi_cleasbyvigfusson.txt
License: http://lexicon.ff.cuni.cz/txt/oi_cleasbyvigfusson.txt
Dictionary.
=== Interlingua ===
==== http://www.denisowski.org/Interlingua/IEDICT/iedict_readme.html ====
License: http://www.denisowski.org/Interlingua/IEDICT/iedict_readme.html
Some of its users: http://www.denisowski.org/Interlingua/IEDICT/iedict_readme.html


Dictionary.
Dictionary.
Line 221: Line 346:


Spell checker with dictionary.
Spell checker with dictionary.
=== Iranian Persian ===
==== https://github.com/amirshnll/English-Persian-Word-Database ====
License: https://github.com/amirshnll/English-Persian-Word-Database/blob/master/LICENSE
Dictionary.


=== Japanese ===
=== Japanese ===
==== http://www.edrdg.org/wiki/index.php/Main_Page ====
License: https://www.edrdg.org/edrdg/licence.html
Some of its users: https://jisho.org/, https://www.tagaini.net/
Japanese dictionaries.


==== https://github.com/KanjiVG/kanjivg/releases/ ====
==== https://github.com/KanjiVG/kanjivg/releases/ ====
Line 230: Line 369:


Kanji strokes.
Kanji strokes.
==== http://dico.fj.free.fr/dico.php ====
License: http://dico.fj.free.fr/copyright.php
Some of its users: http://dico.fj.free.fr/traduction/index.php
Japanese-French dictionary.


==== https://github.com/mifunetoshiro/kanjium ====
==== https://github.com/mifunetoshiro/kanjium ====
Line 255: Line 401:


Frequent characters according to school grades.
Frequent characters according to school grades.
=== Jeju ===
==== https://jeju.go.kr/culture/dialect/dictionary.htm ====
License: https://jeju.go.kr/help/policy/copyright.htm
Some of its users: https://jeju.go.kr/culture/dialect/dictionary.htm
Dictionary.
=== Klingon ===
==== http://klingonska.org/dict/dict.zdb ====
License: http://klingonska.org/dict/
Some of its users: http://klingonska.org/dict/
Dictionary.


=== Korean ===
=== Korean ===
==== https://krdict.korean.go.kr/mainAction ====
License: https://krdict.korean.go.kr/kboardPolicy/copyRightTermsInfo
Some of its users: https://krdict.korean.go.kr/mainAction
Dictionary. Download link is unknown.
==== https://opendict.korean.go.kr/main ====
License: https://opendict.korean.go.kr/service/copyrightPolicy
Some of its users: https://opendict.korean.go.kr/main
Dictionary. Download link is unknown.
==== https://stdict.korean.go.kr/main/main.do ====
License: https://stdict.korean.go.kr/join/copyrightPolicy.do
Some of its users: https://stdict.korean.go.kr/main/main.do
Dictionary. Download link is unknown.
==== https://github.com/garfieldnate/kengdic ====
License: https://github.com/garfieldnate/kengdic
Some if its users: http://www.toktogi.com/
Dictionary.


==== https://github.com/libhangul/libhangul/blob/master/data/hanja/hanja.txt ====
==== https://github.com/libhangul/libhangul/blob/master/data/hanja/hanja.txt ====
Line 282: Line 474:
Spell checker with dictionary.
Spell checker with dictionary.


=== Sanskrit ===
=== Russian ===
 
==== https://en.openrussian.org/dictionary ====
License: https://en.openrussian.org/dictionary


==== https://github.com/hemanth/sanskrit-dict/blob/master/dict.js ====
Some of its users: https://en.openrussian.org/
License: https://github.com/hemanth/sanskrit-dict/blob/master/license


Dictionary.
Dictionary.
Line 297: Line 491:


=== Vietnamese ===
=== Vietnamese ===
==== http://www.informatik.uni-leipzig.de/~duc/Dict/install.html ====
License: http://www.informatik.uni-leipzig.de/~duc/Dict/install.html
Some of its users: https://www.informatik.uni-leipzig.de/~duc/Dict/
Dictionaries in several languages.
There is a page of introduction: https://vi.wiktionary.org/wiki/Wiktionary:Ngu%E1%BB%93n_g%E1%BB%91c/FVDP
==== http://www.denisowski.org/Vietnamese/vnedict_readme.htm ====
License: http://www.denisowski.org/Vietnamese/vnedict_readme.htm
Some of its users: http://www.denisowski.org/Vietnamese/vnedict_readme.htm
Dictionary.


==== https://github.com/duyetdev/vietnamese-wordlist ====
==== https://github.com/duyetdev/vietnamese-wordlist ====
Line 361: Line 571:
!field separator
!field separator
!
!
!field 1
!1<sup>st</sup> field
!field 2
!2<sup>nd</sup> field
!field 3
!3<sup>rd</sup> field
!field 4
!4<sup>th</sup> field
!field 5
!5<sup>th</sup> field
!field 6
!6<sup>th</sup> field
!field 7
!7<sup>th</sup> field
!field 8
!8<sup>th</sup> field
!field 9
!9<sup>th</sup> field
!field 10
!10<sup>th</sup> field
!field 11
!11<sup>th</sup> field
!field 12
!12<sup>th</sup> field
!field 13
|-
! colspan="15" |dictionary
!
!
|-
|[https://github.com/tomcumming/tocfl-word-list An ordered and extended TOCFL word-list]
|tocfl.tsv
|<tab>
|
|Word
|Pinyin
|OtherPinyin
|Level
|First Translation
|Other Translation
|
|
|
|
|
|
|
|-
|-
|[https://cantonese.org/download.html CC-Canto]
|[https://cantonese.org/download.html CC-Canto]
Line 406: Line 593:
|{jyut6 ping3}
|{jyut6 ping3}
|/English equivalent 1/equivalent 2/
|/English equivalent 1/equivalent 2/
|
|
|
|
|
Line 423: Line 609:
|[pin1 yin1]
|[pin1 yin1]
|/English equivalent 1/equivalent 2/
|/English equivalent 1/equivalent 2/
|
|
|
|
|
Line 441: Line 626:
|[pin1 yin1]
|[pin1 yin1]
|/traduction 1/traduction2/
|/traduction 1/traduction2/
|
|
|
|
|
Line 467: Line 651:
|
|
|
|
|
|-
|[https://github.com/skywind3000/ECDICT ECDICT]
|ecdict.csv
|,
|
|word
|phonetic
|definition
|translation
|pos
|collins
|oxford
|tag
|bnc
|frq
|exchange
|detail
|audio
|-
|-
|[https://github.com/amirshnll/English-Persian-Word-Database English Persian Word Database]
|[https://github.com/amirshnll/English-Persian-Word-Database English Persian Word Database]
Line 493: Line 658:
|EnglishWord
|EnglishWord
|PersianWord
|PersianWord
|
|
|
|
|
Line 511: Line 675:
|Esperanto
|Esperanto
|English
|English
|
|
|
|
|
Line 531: Line 694:
|[pin1 yin1]
|[pin1 yin1]
|/deutsche Entsprechung 1 /Entsprechung 2/
|/deutsche Entsprechung 1 /Entsprechung 2/
|
|
|
|
|
Line 548: Line 710:
|Hanja
|Hanja
|note
|note
|
|
|
|
|
Line 565: Line 726:
|Interlingua
|Interlingua
|English
|English
|
|
|
|
|
Line 583: Line 743:
|eeste
|eeste
|inglise
|inglise
|
|
|
|
|
Line 609: Line 768:
|Hiragana
|Hiragana
|English
|English
|
|
|
|
|
Line 636: Line 794:
|?
|?
|?
|?
|
|-
|-
|[http://www.taiwanesedictionary.org/ The Maryknoll Taiwanese-English Dictionary & English-Taiwanese Dictionary 2013 edition]
|[http://www.taiwanesedictionary.org/ The Maryknoll Taiwanese-English Dictionary & English-Taiwanese Dictionary 2013 edition]
Line 646: Line 803:
|Chinese
|Chinese
|English
|English
|
|
|
|
|
Line 670: Line 826:
|
|
|
|
|
|
|
|-
! colspan="15" |word list
|
|
|
|
Line 686: Line 837:
|길잡이말
|길잡이말
|품사
|품사
|
|
|
|
|
|
|
|
|
|-
|[https://lingua.mtsu.edu/chinese-computing/statistics/index.html 古汉语单字字频: Character frequency list of Classical Chinese]
|CharFreq-Classical.xls
|
|
|Serial number; 序号
|Character; 汉字
|
|
|
|
|
|
|
Line 716: Line 848:
|[https://lingua.mtsu.edu/chinese-computing/statistics/index.html 现代汉语单字字频: Character frequency list of Modern Chinese]  
|[https://lingua.mtsu.edu/chinese-computing/statistics/index.html 现代汉语单字字频: Character frequency list of Modern Chinese]  
|CharFreq.txt
|CharFreq.txt
|<tab>
|
|
|
|Serial number; 序号
|Serial number; 序号
Line 724: Line 856:
|Pinyin; 拼音
|Pinyin; 拼音
|English translation; 英文翻译
|English translation; 英文翻译
|
|
|
|
|
Line 738: Line 869:
|编号
|编号
|字形
|字形
|
|
|
|
|
Line 758: Line 888:
|Unicode
|Unicode
|常用字
|常用字
|
|
|
|
|
Line 773: Line 902:
|
|
|单词(等级)
|单词(等级)
|
|
|
|
|
Line 811: Line 939:
# Replace '/\n' with '\n';
# Replace '/\n' with '\n';
# Add 'Traditional\tSimplified\tpin1 yin1\tEnglish equivalent 1/equivalent 2\n' at the beginning;
# Add 'Traditional\tSimplified\tpin1 yin1\tEnglish equivalent 1/equivalent 2\n' at the beginning;
|-
|CharFreq.txt
|
# Delete lines starting with '/';
# [https://stackoverflow.com/questions/15361632/delete-a-column-with-awk-or-sed Delete fields 3, 4];
# Add '序列号\t汉字\t拼音\t英文翻译' at the beginning;
|-
|CharFreq-Classical.xls
|
# Delete the first row;
# Delete fields 3, 4;
# Save as TSV file or save as CSV file and select '<tab>' as field separator;
|-
|-
|CHDICT.u8
|CHDICT.u8
Line 830: Line 946:
# Replace the first ' ' in each line with '\t';
# Replace the first ' ' in each line with '\t';
# Replace the first ' [' in each line with '\t';
# Replace the first ' [' in each line with '\t';
# Replace '] /' with '\t';
# Replace '] /' iwth '\t';
# Replace '/\n' with '\n';
# Replace '/\n' with '\n';
# Add 'Tradicionális\tEgyszerűsített\tpin1 yin1\tmagyar egyenérték 1/ egyenérték 2\n' at the beginning;
# Add 'Tradicionális\tEgyszerűsített\tpin1 yin1\tmagyar egyenérték 1/ egyenérték 2\n' at the beginning;
|-
|ecdict.csv
|
# Open with a spreadsheet program;
# Save as TSV file or save as CSV file and select '<tab>' as field separator;
|-
|-
|eestiinglise.txt
|eestiinglise.txt
Line 862: Line 973:
# Replace '] /' with '\t';
# Replace '] /' with '\t';
# Replace '/\n' with '\n';
# Replace '/\n' with '\n';
# Add 'Traditionel\tVereinfacht\tpin1 yin1\tdeutsche Entsprechung 1/Entsprechung 2\n' at the beginning;
# Add 'Traditionel\tVereinfacht\tpin1 yin1\tdeutsche Entsprechung 1 /Entsprechung 2 \n' at the beginning;
|-
|-
|hanja.txt
|hanja.txt
|
|
# Delete lines starting with ' #';
# Delete lines starting with ' #';
# Replace '<nowiki>:</nowiki>' with '\t';
# Replace '<nowiki>:</nowiki>' with '\t\';
# Add 'Hangul\tHanja\tnote' at the beginning;
# Add 'Hangul\tHanja\note' at the beginning;
|-
|-
|HSK-2012.xls
|HSK-2012.xls
Line 887: Line 998:
|kengdic_2011.tsv
|kengdic_2011.tsv
|
|
# Delete fields 1, 3, 5, 6, 7, 8, 9, 11, 12;
# [https://stackoverflow.com/questions/15361632/delete-a-column-with-awk-or-sed Delete fields 1, 3, 5, 6, 7, 8, 9, 11, 12];
# Add 'word\tdef\hanja\n' at the beginning;
|-
|-
|Mkdictionary.xls
|Mkdictionary.xls
Line 894: Line 1,004:
# Open with a spreadsheet program;
# Open with a spreadsheet program;
# Save as TSV file or save as CSV file and select '<tab>' as field separator;
# Save as TSV file or save as CSV file and select '<tab>' as field separator;
|-
|tocfl.tsv
|
# Replace '"\t"' with '\t';
# Replace '"\n"' with '\n';
# Replace the first '"' with <nowiki>''</nowiki>;
# Replace the last '"' with <nowiki>''</nowiki>;
|-
|-
|vnedict.txt
|vnedict.txt
Line 931: Line 1,034:
|}
|}


[[Category:Free-Resources]]
Open-source programs for viewing databases: https://sqlitebrowser.org/, https://sqlitestudio.pl/
[[Category:Computer-Knowledge]]
 
==Other Lessons==
* [[Language/Multiple-languages/Culture/Wiki-Notice-Board|Wiki Notice Board]]
* [[Language/Multiple-languages/Culture/Cultural-differences-by-country|Cultural differences by country]]
* [[Language/Multiple-languages/Culture/Most-Famous-Non–Contemporary-Artists|Most Famous Non–Contemporary Artists]]
* [[Language/Multiple-languages/Culture/IRFP-in-brief|IRFP in brief]]
* [[Language/Multiple-languages/Culture/Introduction-to-Sci–Tech-Index|Introduction to Sci–Tech Index]]
* [[Language/Multiple-languages/Culture/Online-Specialized-Dictionaries|Online Specialized Dictionaries]]
* [[Language/Multiple-languages/Culture/How-to-contribute-to-wiki-lessons-(FAQ)|How to contribute to wiki lessons (FAQ)]]
* [[Language/Multiple-languages/Culture/Cities-with-the-best-quality-of-life|Cities with the best quality of life]]
* [[Language/Multiple-languages/Culture/Techniques-for-learning-languages|Techniques for learning languages]]
* [[Language/Multiple-languages/Culture/Countries-and-Flag-Emoji-by-Languages|Countries and Flag Emoji by Languages]]
<span links></span>

Please note that all contributions to Polyglot Club WIKI may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see PolyglotClub-WIKI:Copyrights for details). Do not submit copyrighted work without permission!

Cancel Editing help (opens in new window)