Language Support#

Label Sleuth supports text data in more than 150 languages. To start the system with your chosen language, use the following command:

python -m label_sleuth.start_label_sleuth --language <YOUR_LANGUAGE>

where <YOUR_LANGUAGE> is the name of the language from the list of supported languages below. Note that if the language name consists of multiple words, it should be enclosed in double quotes.

Note

Not every machine learning model is compatible with every language. For model-language compatibility, see here.

Supported languages#

Label Sleuth supports the following languages:

Language

Afrikaans

Albanian

Alemannic

Amharic

Arabic

Aragonese

Armenian

Assamese

Asturian

Azerbaijani

Bashkir

Basque

Bavarian

Belarusian

Bengali

Bihari

Bishnupriya Manipuri

Bosnian

Breton

Bulgarian

Burmese

Catalan

Cebuano

Central Bicolano

Chechen

Chinese

Chuvash

Corsican

Croatian

Czech

Danish

Divehi

Dutch

Eastern Punjabi

Egyptian Arabic

Emilian-Romagnol

English default

Erzya

Esperanto

Estonian

Fiji Hindi

Finnish

French

Galician

Georgian

German

Goan Konkani

Greek

Gujarati

Haitian

Hebrew

Hill Mari

Hindi

Hungarian

Icelandic

Ido

Ilokano

Indonesian

Interlingua

Irish

Italian

Japanese

Javanese

Kannada

Kapampangan

Kazakh

Khmer

Kirghiz

Korean

Kurdish (Kurmanji)

Kurdish (Sorani)

Latin

Latvian

Limburgish

Lithuanian

Lombard

Low Saxon

Luxembourgish

Macedonian

Maithili

Malagasy

Malay

Malayalam

Maltese

Manx

Marathi

Mazandarani

Meadow Mari

Minangkabau

Mingrelian

Mirandese

Mongolian

Nahuatl

Neapolitan

Nepali

Newar

North Frisian

Northern Sotho

Norwegian (Bokmål)

Norwegian (Nynorsk)

Occitan

Oriya

Ossetian

Palatinate German

Pashto

Persian

Piedmontese

Polish

Portuguese

Quechua

Romanian

Romansh

Russian

Sakha

Sanskrit

Sardinian

Scots

Scottish Gaelic

Serbian

Serbo-Croatian

Sicilian

Sindhi

Sinhalese

Slovak

Slovenian

Somali

Southern Azerbaijani

Spanish

Sundanese

Swahili

Swedish

Tagalog

Tajik

Tamil

Tatar

Telugu

Thai

Tibetan

Turkish

Turkmen

Ukrainian

Upper Sorbian

Urdu

Uyghur

Uzbek

Venetian

Vietnamese

Volapük

Walloon

Waray

Welsh

West Flemish

West Frisian

Western Punjabi

Yiddish

Yoruba

Zazaki

Zeelandic