Editing Language/Multiple-languages/Culture/Producing-dictionaries-with-web-scraping

Jump to navigation Jump to search

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 16: Line 16:
How slow is good? A quest every 10 seconds won't be a problem. Maybe 150 quests per day, unless you can learn more.
How slow is good? A quest every 10 seconds won't be a problem. Maybe 150 quests per day, unless you can learn more.


The principle of web scrapping is to read its webpage code, find where the corresponding contents locate by their tags, then look up a list of entries on different pages at the same locations and write them down.
The principle of web scrapping is to read its webpage code, find where the corresponding contents locate by its tags, then look up a list of words on different pages at the same locations.


If you want to try, there is a [https://polyglotclub.com/wiki/Language/Multiple-languages/Culture/Internet-Dictionaries dictionary list]. Just pick a website that claims to have made their content under whatsoever public license and provides no download link. Some websites provide API, good job.
If you want to try, there is a [https://polyglotclub.com/wiki/Language/Multiple-languages/Culture/Internet-Dictionaries dictionary list]. Just pick a website that claims to have made their content under whatsoever public license and provides no download link. Some websites provide API, good job.

Please note that all contributions to Polyglot Club WIKI may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see PolyglotClub-WIKI:Copyrights for details). Do not submit copyrighted work without permission!

Cancel Editing help (opens in new window)