Language Support#

Label Sleuth supports text data in more than 150 languages. To start the system with your chosen language, use the following command:

python -m label_sleuth.start_label_sleuth --language <YOUR_LANGUAGE>

where <YOUR_LANGUAGE> is the name of the language from the list of supported languages below. Note that if the language name consists of multiple words, it should be enclosed in double quotes.

Note

Not every machine learning model is compatible with every language. For model-language compatibility, see here.

Supported languages#

Label Sleuth supports the following languages:

Language
Afrikaans
Albanian
Alemannic
Amharic
Arabic
Aragonese
Armenian
Assamese
Asturian
Azerbaijani
Bashkir
Basque
Bavarian
Belarusian
Bengali
Bihari
Bishnupriya Manipuri
Bosnian
Breton
Bulgarian
Burmese
Catalan
Cebuano
Central Bicolano
Chechen
Chinese
Chuvash
Corsican
Croatian
Czech
Danish
Divehi
Dutch
Eastern Punjabi
Egyptian Arabic
Emilian-Romagnol
English default
Erzya
Esperanto
Estonian
Fiji Hindi
Finnish
French
Galician
Georgian
German
Goan Konkani
Greek
Gujarati
Haitian
Hebrew
Hill Mari
Hindi
Hungarian
Icelandic
Ido
Ilokano
Indonesian
Interlingua
Irish
Italian
Japanese
Javanese
Kannada
Kapampangan
Kazakh
Khmer
Kirghiz
Korean
Kurdish (Kurmanji)
Kurdish (Sorani)
Latin
Latvian
Limburgish
Lithuanian
Lombard
Low Saxon
Luxembourgish
Macedonian
Maithili
Malagasy
Malay
Malayalam
Maltese
Manx
Marathi
Mazandarani
Meadow Mari
Minangkabau
Mingrelian
Mirandese
Mongolian
Nahuatl
Neapolitan
Nepali
Newar
North Frisian
Northern Sotho
Norwegian (Bokmål)
Norwegian (Nynorsk)
Occitan
Oriya
Ossetian
Palatinate German
Pashto
Persian
Piedmontese
Polish
Portuguese
Quechua
Romanian
Romansh
Russian
Sakha
Sanskrit
Sardinian
Scots
Scottish Gaelic
Serbian
Serbo-Croatian
Sicilian
Sindhi
Sinhalese
Slovak
Slovenian
Somali
Southern Azerbaijani
Spanish
Sundanese
Swahili
Swedish
Tagalog
Tajik
Tamil
Tatar
Telugu
Thai
Tibetan
Turkish
Turkmen
Ukrainian
Upper Sorbian
Urdu
Uyghur
Uzbek
Venetian
Vietnamese
Volapük
Walloon
Waray
Welsh
West Flemish
West Frisian
Western Punjabi
Yiddish
Yoruba
Zazaki
Zeelandic