Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
The Speech Transcription software development kit (SDK) exposes the LLM Speech and Fast Transcription capabilities of the Speech Service, making it easier to develop high quality transcription applications. The Speech Transcription SDK is available in many programming languages and across platforms. The Speech Transcription SDK is ideal for near-real-time and non-real-time scenarios, by using local device captured audio, files, and Azure Blob Storage data.
In some cases, you can't or shouldn't use the Speech Transcription SDK. In those cases, you can use real-time streaming via WebSockets or REST APIs to access the Speech service. For example use the Speech SDK for real-time streaming, or use the Speech to text REST API for batch transcription of high-volume processing and custom speech model management.
Supported languages
The Speech Transcription SDK supports the following languages and platforms:
| Programming language | Reference | Platform support |
|---|---|---|
| Java | Java | Android, Windows, Linux, macOS |
| Python | Python | Windows, Linux, macOS |
Code samples
Speech Transcription SDK code samples are available in the documentation and GitHub.
Docs samples
| Programming language | Quickstart doc |
|---|---|
| Java | Fast transcription |
| Python | Fast transcription |
If a sample isn't available in your preferred programming language, you can select another programming language to get started and learn about the concepts, or see the reference and samples linked from the beginning of the article.
GitHub samples
You can find samples for each programming language in the respective GitHub repositories.
| Programming language | Samples Repository |
|---|---|
| Java | Java Samples |
| Python | Python Samples |
Help options
The developer community can use the Stack Overflow forums to ask and answer questions about Azure Cognitive Speech and other services. Microsoft monitors the forums and replies to questions that the community didn't yet answer. To make sure that Microsoft sees your question, tag it with 'azure-speech'.
You can suggest an idea or report a bug by creating an issue on GitHub.
See also Foundry Tools support and help options to get support, stay up-to-date, give feedback, and report bugs for Foundry Tools.
Next steps
- Learn about the Speech SDK for broader speech capabilities