Locales
Locales and Locale Strings
Overview
This page will discuss the standard used by General Translation to represent locales & languages and the list of currently supported locales.
Locale strings
General Translation uses a variant of the BCP 47 Language Tag standard to represent locales and languages. BCP 47 Language Tags are the Internet Best Current Practices (BCP) standard for identifying languages in both spoken and written forms. These tags provide a uniform way to specify languages, allowing applications to adapt content, format, and behavior based on the user's locale.
Language tags are composed of one or more subtags separated by the "-" character. The subtags include the following components:
- Language Subtag: Represents the primary language, e.g., enfor English,esfor Spanish.
- Region Subtag: Specifies a country or region, e.g., USfor the United States,FRfor France.
- Script Subtag (optional): Indicates the writing script, e.g., Latnfor Latin script.
- Variant Subtag (optional): Identifies a specific variation of a language, e.g., arevelafor Eastern Armenian.
When you combine these tags together, we refer to this as a Locale or Locale tag.
Commonly Used Tags
In practice, most language tags consist of two subtags: a language and a region. Here are some common examples:
| Language Tag | Description | 
|---|---|
| en-US | English as used in the US | 
| es-ES | Spanish as used in Spain | 
| fr-CA | French as used in Canada | 
| zh-CN | Simplified Chinese (China) | 
| de-DE | German as used in Germany | 
Extended Tags
Language tags can include additional subtags for more specificity:
- Example: hy-Latn-IT-arevela- hy: Armenian (language)
- Latn: Latin (script)
- IT: Italy (region)
- arevela: Eastern Armenian (variant)
 
This tag represents Eastern Armenian written in Latin script, as used in Italy.
Exceptions to BCP 47 in GT
đ§ This section is currently under construction. đ§
Supported locales
This section lists all locales that are currently supported by General Translation.
A note on low-resource languages
Our system leverages some of the most advanced LLM models on the market to provide accurate translations; however, these models are not without their limitations. Certain resource languages may not be supported by the model provider you have selected or any of the available providers. These languages are known as "low-resource languages".
Low-resource languages can vary between models, so if you specify a preferred model provider in your configuration, you may want to check the list of supported languages for that provider.
Official list
- afđżđŚAfrikaans
- amđŞđšAmharic
- arđŞđŹArabic
- ar-AEđŚđŞArabic (United Arab Emirates)
- ar-EGđŞđŹArabic (Egypt)
- ar-LBđąđ§Arabic (Lebanon)
- ar-MAđ˛đŚArabic (Morocco)
- ar-SAđ¸đŚArabic (Saudi Arabia)
- bgđ§đŹBulgarian
- bnđ§đŠBangla
- bsđ§đŚBosnian
- cađCatalan
- csđ¨đżCzech
- cyđ´ó §ó ˘ó ˇó Źó łó żWelsh
- dađŠđ°Danish
- deđŠđŞGerman
- de-ATđŚđšAustrian German
- de-CHđ¨đSwiss High German
- de-DEđŠđŞGerman (Germany)
- elđŹđˇGreek
- el-CYđ¨đžGreek (Cyprus)
- el-ELđGreek (EL)
- enđşđ¸English
- en-AUđŚđşAustralian English
- en-CAđ¨đŚCanadian English
- en-GBđŹđ§British English
- en-NZđłđżEnglish (New Zealand)
- en-USđşđ¸American English
- esđŞđ¸Spanish
- es-419đLatin American Spanish
- es-ARđŚđˇSpanish (Argentina)
- es-CLđ¨đąSpanish (Chile)
- es-COđ¨đ´Spanish (Colombia)
- es-ESđŞđ¸European Spanish
- es-MXđ˛đ˝Mexican Spanish
- es-PEđľđŞSpanish (Peru)
- es-USđşđ¸Spanish (United States)
- es-VEđťđŞSpanish (Venezuela)
- etđŞđŞEstonian
- fađŽđˇPersian
- fiđŤđŽFinnish
- filđľđFilipino
- frđŤđˇFrench
- fr-BEđ§đŞFrench (Belgium)
- fr-CAđ¨đŚCanadian French
- fr-CHđ¨đSwiss French
- fr-CMđ¨đ˛French (Cameroon)
- fr-FRđŤđˇFrench (France)
- fr-SNđ¸đłFrench (Senegal)
- guđŽđłGujarati
- heđŽđąHebrew
- hiđŽđłHindi
- hrđđˇCroatian
- huđđşHungarian
- hyđŚđ˛Armenian
- idđŽđŠIndonesian
- isđŽđ¸Icelandic
- itđŽđšItalian
- it-CHđ¨đItalian (Switzerland)
- it-ITđŽđšItalian (Italy)
- jađŻđľJapanese
- kađŹđŞGeorgian
- kkđ°đżKazakh
- knđŽđłKannada
- kođ°đˇKorean
- lađťđŚLatin
- ltđąđšLithuanian
- lvđąđťLatvian
- mkđ˛đ°Macedonian
- mlđŽđłMalayalam
- mnđ˛đłMongolian
- mrđŽđłMarathi
- msđ˛đžMalay
- myđ˛đ˛Burmese
- nlđłđąDutch
- nl-BEđ§đŞFlemish
- nl-NLđłđąDutch (Netherlands)
- nođłđ´Norwegian
- pađŽđłPunjabi
- plđľđąPolish
- ptđ§đˇPortuguese
- pt-BRđ§đˇBrazilian Portuguese
- pt-PTđľđšEuropean Portuguese
- qbrđqbr
- rođˇđ´Romanian
- ruđˇđşRussian
- skđ¸đ°Slovak
- slđ¸đŽSlovenian
- sođ¸đ´Somali
- sqđŚđąAlbanian
- srđˇđ¸Serbian
- svđ¸đŞSwedish
- swđšđżSwahili
- sw-KEđ°đŞSwahili (Kenya)
- sw-TZđšđżSwahili (Tanzania)
- tađŽđłTamil
- teđŽđłTelugu
- thđšđThai
- tlđľđFilipino
- trđšđˇTurkish
- ukđşđŚUkrainian
- urđľđ°Urdu
- viđťđłVietnamese
- zhđ¨đłChinese
- zh-CNđ¨đłChinese (China)
- zh-HKđđ°Chinese (Hong Kong SAR China)
- zh-Hansđ¨đłSimplified Chinese
- zh-HantđšđźTraditional Chinese
- zh-SGđ¸đŹChinese (Singapore)
- zh-TWđšđźChinese (Taiwan)
Notes
- General Translations uses Locale Tags (Locales) to identify languages and regions internally.
Next Steps
- See our List of Supported Locales to find the language tags available in General Translation.
- Refer to the official IETF Language Tag Registry and the BCP 47 Language Tag standard for more information.
How is this guide?

