Language and voice support for the Speech service

The following tables summarize language support for speech to text, text to speech, pronunciation assessment, speech translation, and more service features.

You can also get a list of locales and voices supported for each specific region or endpoint via:

Supported languages

Language support varies by Speech service functionality.

Choose a Speech feature

The table in this section summarizes the locales supported for real-time transcription, fast transcription, and batch transcription transcription.

More remarks for speech to text locales are included in the custom speech section of this article.

Tip

Try out the Azure AI Speech Toolkit to easily build and run samples on Visual Studio Code.

Locale (BCP-47) Language Fast transcription support Custom speech support
af-ZA Afrikaans (South Africa) Yes Plain text
am-ET Amharic (Ethiopia) Yes Plain text
ar-AE Arabic (United Arab Emirates) Yes Audio + human-labeled transcript

Plain text
ar-BH Arabic (Bahrain) Yes Audio + human-labeled transcript

Plain text
ar-DZ Arabic (Algeria) No Audio + human-labeled transcript

Plain text
ar-EG Arabic (Egypt) Yes Audio + human-labeled transcript

Plain text

Structured text
ar-IL Arabic (Israel) Yes Audio + human-labeled transcript

Plain text
ar-IQ Arabic (Iraq) Yes Audio + human-labeled transcript

Plain text
ar-JO Arabic (Jordan) Yes Audio + human-labeled transcript

Plain text
ar-KW Arabic (Kuwait) Yes Audio + human-labeled transcript

Plain text
ar-LB Arabic (Lebanon) Yes Audio + human-labeled transcript

Plain text
ar-LY Arabic (Libya) Yes Audio + human-labeled transcript

Plain text
ar-MA Arabic (Morocco) No Audio + human-labeled transcript

Plain text
ar-OM Arabic (Oman) Yes Audio + human-labeled transcript

Plain text
ar-PS Arabic (Palestinian Authority) Yes Audio + human-labeled transcript

Plain text
ar-QA Arabic (Qatar) Yes Audio + human-labeled transcript

Plain text
ar-SA Arabic (Saudi Arabia) Yes Audio + human-labeled transcript

Plain text

Structured text

Phrase list
ar-SY Arabic (Syria) Yes Audio + human-labeled transcript

Plain text
ar-TN Arabic (Tunisia) No Audio + human-labeled transcript

Plain text
ar-YE Arabic (Yemen) No Audio + human-labeled transcript

Plain text
as-IN Assamese (India) No Audio + human-labeled transcript
az-AZ Azerbaijani (Latin, Azerbaijan) Yes Plain text
bg-BG Bulgarian (Bulgaria) No Plain text
bn-IN Bengali (India) Yes Plain text
bs-BA Bosnian (Bosnia and Herzegovina) No Plain text
ca-ES Catalan No Plain text

Pronunciation
cs-CZ Czech (Czechia) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
cy-GB Welsh (United Kingdom) No Plain text
da-DK Danish (Denmark) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation
de-AT German (Austria) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
de-CH German (Switzerland) Yes Audio + human-labeled transcript

Plain text

Pronunciation

Phrase list
de-DE German (Germany) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation

Phrase list
el-GR Greek (Greece) No Audio + human-labeled transcript

Plain text

Structured text
en-AU English (Australia) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Output format

Pronunciation

Phrase list
en-CA English (Canada) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Output format

Pronunciation

Phrase list
en-GB English (United Kingdom) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Output format

Pronunciation

Phrase list
en-GH English (Ghana) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Pronunciation
en-HK English (Hong Kong SAR, China) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Output format

Pronunciation
en-IE English (Ireland) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Output format

Pronunciation

Phrase list
en-IN English (India) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation

Phrase list
en-KE English (Kenya) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Pronunciation
en-NG English (Nigeria) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Output format

Pronunciation
en-NZ English (New Zealand) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Output format

Pronunciation
en-PH English (Philippines) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Output format

Pronunciation
en-SG English (Singapore) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Output format

Pronunciation
en-TZ English (Tanzania) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Pronunciation
en-US English (United States) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Output format

Pronunciation

Phrase list
en-ZA English (South Africa) Yes Audio + human-labeled transcript

Audio

Plain text

Structured text

Pronunciation

Phrase list
es-AR Spanish (Argentina) Yes Plain text

Structured text

Pronunciation
es-BO Spanish (Bolivia) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-CL Spanish (Chile) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-CO Spanish (Colombia) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-CR Spanish (Costa Rica) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-CU Spanish (Cuba) Yes Plain text

Structured text

Pronunciation
es-DO Spanish (Dominican Republic) Yes Plain text

Structured text

Pronunciation
es-EC Spanish (Ecuador) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-ES Spanish (Spain) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation

Phrase list
es-GQ Spanish (Equatorial Guinea) Yes Audio + human-labeled transcript

Plain text

Structured text
es-GT Spanish (Guatemala) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-HN Spanish (Honduras) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-MX Spanish (Mexico) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation

Phrase list
es-NI Spanish (Nicaragua) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-PA Spanish (Panama) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-PE Spanish (Peru) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-PR Spanish (Puerto Rico) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-PY Spanish (Paraguay) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-SV Spanish (El Salvador) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-US Spanish (United States)1 Yes Plain text

Structured text

Pronunciation

Phrase list
es-UY Spanish (Uruguay) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
es-VE Spanish (Venezuela) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
et-EE Estonian (Estonia) Yes Plain text

Pronunciation
eu-ES Basque Yes Plain text
fa-IR Persian (Iran) No Plain text
fi-FI Finnish (Finland) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation
fil-PH Filipino (Philippines) Yes Plain text

Pronunciation
fr-BE French (Belgium) No Plain text
fr-CA French (Canada)1 No Plain text

Structured text

Output format

Pronunciation

Phrase list
fr-CH French (Switzerland) No Plain text

Pronunciation
fr-FR French (France) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation

Phrase list
ga-IE Irish (Ireland) Yes Plain text

Pronunciation
gl-ES Galician Yes Plain text
gu-IN Gujarati (India) No Plain text
he-IL Hebrew (Israel) Yes Audio + human-labeled transcript

Plain text
hi-IN Hindi (India) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Phrase list
hr-HR Croatian (Croatia) No Plain text

Pronunciation
hu-HU Hungarian (Hungary) No Audio + human-labeled transcript

Plain text

Structured text

Pronunciation
hy-AM Armenian (Armenia) Yes Plain text
id-ID Indonesian (Indonesia) Yes Audio + human-labeled transcript

Plain text

Structured text

Pronunciation

Phrase list
is-IS Icelandic (Iceland) Yes Plain text
it-CH Italian (Switzerland) No Plain text
it-IT Italian (Italy) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation

Phrase list
ja-JP Japanese (Japan) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Phrase list
jv-ID Javanese (Latin, Indonesia) Yes Plain text
ka-GE Georgian (Georgia) Yes Plain text
kk-KZ Kazakh (Kazakhstan) Yes Plain text
km-KH Khmer (Cambodia) No Plain text
kn-IN Kannada (India) No Plain text
ko-KR Korean (Korea) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Phrase list
lo-LA Lao (Laos) Yes Plain text
lt-LT Lithuanian (Lithuania) Yes Plain text

Pronunciation
lv-LV Latvian (Latvia) No Plain text

Pronunciation
mk-MK Macedonian (North Macedonia) Yes Plain text
ml-IN Malayalam (India) Yes Plain text
mn-MN Mongolian (Mongolia) Yes Plain text
mr-IN Marathi (India) No Audio + human-labeled transcript

Plain text
ms-MY Malay (Malaysia) No Plain text
mt-MT Maltese (Malta) Yes Plain text
my-MM Burmese (Myanmar) Yes Plain text
nb-NO Norwegian Bokmål (Norway) No Plain text

Output format
ne-NP Nepali (Nepal) No Plain text
nl-BE Dutch (Belgium) No Plain text
nl-NL Dutch (Netherlands) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation

Phrase list
or-IN Odia (India) No Audio + human-labeled transcript
pa-IN Punjabi (India) No Audio + human-labeled transcript
pl-PL Polish (Poland) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation

Phrase list
ps-AF Pashto (Afghanistan) Yes Plain text
pt-BR Portuguese (Brazil) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation

Phrase list
pt-PT Portuguese (Portugal) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation

Phrase list
ro-RO Romanian (Romania) No Plain text

Pronunciation
ru-RU Russian (Russia) Yes Audio + human-labeled transcript

Plain text

Structured text

Phrase list
si-LK Sinhala (Sri Lanka) No Plain text
sk-SK Slovak (Slovakia) No Plain text

Pronunciation
sl-SI Slovenian (Slovenia) No Plain text

Pronunciation
so-SO Somali (Somalia) Yes Plain text
sq-AL Albanian (Albania) Yes Plain text
sr-RS Serbian (Cyrillic, Serbia) Yes Plain text
sv-SE Swedish (Sweden) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Pronunciation

Phrase list
sw-KE Kiswahili (Kenya) Yes Plain text
sw-TZ Kiswahili (Tanzania) No Plain text
ta-IN Tamil (India) No Audio + human-labeled transcript

Plain text

Structured text
te-IN Telugu (India) No Audio + human-labeled transcript

Plain text

Structured text
th-TH Thai (Thailand) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Phrase list
tr-TR Turkish (Türkiye) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format
uk-UA Ukrainian (Ukraine) No Audio + human-labeled transcript

Plain text
ur-IN Urdu (India) Yes Audio + human-labeled transcript
uz-UZ Uzbek (Latin, Uzbekistan) Yes Plain text
vi-VN Vietnamese (Vietnam) No Plain text

Phrase list
wuu-CN Chinese (Wu, Simplified) No Plain text
yue-CN Chinese (Cantonese, Simplified) No Plain text
zh-cn Chinese (Mandarin, Simplified) Yes Audio + human-labeled transcript

Plain text

Structured text

Output format

Phrase list
zh-cn-shandong Chinese (Jilu Mandarin, Simplified) No Plain text
zh-cn-sichuan Chinese (Southwestern Mandarin, Simplified) No Plain text
zh-HK Chinese (Cantonese, Traditional) No Audio + human-labeled transcript

Plain text

Structured text

Output format

Phrase list
zh-TW Chinese (Taiwanese Mandarin, Traditional) No Audio + human-labeled transcript

Plain text

Structured text

Output format

Phrase list
zu-ZA isiZulu (South Africa) Yes Plain text

1 The model is bilingual and also supports English.

Custom speech

To improve speech to text recognition accuracy, customization is available for some languages and base models. Depending on the locale, you can upload audio + human-labeled transcripts, plain text, and pronunciation data. By default, plain text customization is supported for all available base models. To learn more about customization, see custom speech.

These locales support the display text format feature: da-DK, de-DE, en-AU, en-CA, en-GB, en-HK, en-IE, en-IN, en-NG, en-NZ, en-PH, en-SG, en-US, es-ES, es-MX, fi-FI, fr-CA, fr-FR, hi-IN, it-IT, ja-JP, ko-KR, nb-NO, nl-NL, pl-PL, pt-BR, pt-PT, sv-SE, tr-TR, zh-cn, zh-HK.

Next steps