Our Resources Wiki - Overviews of useful programs and courses for learning languages as well as a large section for specific languages, including links to subreddits. Resources How to start learning a languageįAQ - If you have questions, and/or are new to language learning, please first check here. There are a few more rules, which you can find here, but that's the golden one. Welcome all and please enjoy your stay.Ĭome join us on Discord! Read our FAQ before asking for help Read our Moderation Policy before submitting Be mature and respectful to others Whether you are just starting, a polyglot or a language nerd, this is the place for you! Content related to specific languages, general language learning and linguistics are all allowed. PDF, data./r/Languagelearning is a community for anybody interested in learning other languages. "Worldlex: Twitter and blog word frequencies for 66 languages." Behavior research methods, 2015, pp. You should check on an individual basis.) (Some resources are available under a permissive or copyleft license, however others may be covered by copyright. Wordlists in the CLARIN infrastructure - just over half are monolingual lists in 10 languages (Dutch, Estonian, Finnish, German, Greek, Maltese, Ngbugu, Slovenian, Spanish, Swedish), while the other two dozen are in bilingual and multilingual combinations.(Some resources are available under a generic CC license, however others are covered by copyright. Resources are available for Balinese, Burmese, Indonesian, Javanese, Karen, Khmer, Lao, Malay, Maguindanao, Maranao, Mon, Shan, Thai, Vietnamese, among others. The SEAlang Library aims to collect lexical resources for the languages of South-East Asia.Swedish: (CC-BY-SA 3.0, LGPL 3.0) (all others: CC BY-ND-NC-SA 2.0, meaning they are incompatible with wiktionary) Frequency lists for learners of Arabic, Chinese, English, Greek, Italian, Norwegian, Polish, Russian and Swedish, available as part of the Kelly project.(Data under various licence conditions, some of which may be incompatible with Wiktionary.) The wordfreq Python library contains large frequency lists for 40+ languages.Frequency lists for English, Russian, Arabic, Chinese, French, German, Greek, Italian, Japanese, Portuguese and Spanish derived from corpora assembled by Leeds University's Centre for Translation Studies (CC BY-2.5).50K and larger word lists based on for 60+ Languages (CC BY-SA-4.0).Word frequency lists from 10K up to 1M+ for 270+ languages, available for download as part of the Leipzig Corpora Collection (CC BY-4.0).Please be mindful too that not all of the resources listed here are suitable for use directly in Wiktionary, mainly due to problems with licensing compatibilities.Įxternal links Resources covering many languages As an aid to navigating this list, consider enabling the OrangeLinks.js gadget to reveal headword pages which exist (and so will still show a blue link) but which do not yet contain an entry for the relevant language. or do not fulfil the Wiktionary Criteria for Inclusion.Ĭollocations may or may not warrant their own individual entries, and not necessarily in the exact form they appear here.are not commonly accepted words of the language for which they appear,.Please be mindful that there will be many words which Thus a word's presence in any of these lists is merely an invitation for further investigation as to whether an entry is warranted. However, this system is far from perfect due to the variable quality of the source data and the automated nature of processing. While creating entries for words, please leave valid bluelinks in place as these pages may be copied for use with other language projects in the future. If you see a word in this list that is clearly out of place (wrong language, punctuation, superfluous capitalisation), you are welcome to remove it. If you are involved in another non-English language edition of Wiktionary, you might also consider implementing or expanding on this idea, if there is not already something similar in place. These forms reflect words as they are likely to be encountered and thus as they may be used in lookup.įeel free to add definitions for words on these lists if you know the languages involved! Even better if you can include usage citations and references. Since English Wiktionary aims not just to be a mere database of lemmas, but a multi-directional, multi-lingual dictionary aimed at English speaking users, there are certain advantages to lists which include inflected forms as well. One use for such lists in the context of the Wiktionary project is as an aid in identifying missing terms of high-frequency and thus, it is assumed, of high priority. Suggestions for how to use these lists įrequency lists have many applications in the realm of second language acquisition and beyond.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |