azure-ai-transcription-py
✓Verified·Scanned 2/18/2026
This skill provides an Azure AI Transcription Python client for real-time and batch speech-to-text transcription. It reads TRANSCRIPTION_ENDPOINT and TRANSCRIPTION_KEY from the environment and uses https://<resource>.cognitiveservices.azure.com and https://<storage>/audio.wav for network calls.
from clawhub.ai·v24a44d8·1.7 KB·0 installs
Scanned from 0.1.0 at 24a44d8 · Transparency log ↗
$ vett add clawhub.ai/thegovind/azure-ai-transcription-py
Azure AI Transcription SDK for Python
Client library for Azure AI Transcription (speech-to-text) with real-time and batch transcription.
Installation
pip install azure-ai-transcription
Environment Variables
TRANSCRIPTION_ENDPOINT=https://<resource>.cognitiveservices.azure.com
TRANSCRIPTION_KEY=<your-key>
Authentication
Use subscription key authentication (DefaultAzureCredential is not supported for this client):
import os
from azure.ai.transcription import TranscriptionClient
client = TranscriptionClient(
endpoint=os.environ["TRANSCRIPTION_ENDPOINT"],
credential=os.environ["TRANSCRIPTION_KEY"]
)
Transcription (Batch)
job = client.begin_transcription(
name="meeting-transcription",
locale="en-US",
content_urls=["https://<storage>/audio.wav"],
diarization_enabled=True
)
result = job.result()
print(result.status)
Transcription (Real-time)
stream = client.begin_stream_transcription(locale="en-US")
stream.send_audio_file("audio.wav")
for event in stream:
print(event.text)
Best Practices
- Enable diarization when multiple speakers are present
- Use batch transcription for long files stored in blob storage
- Capture timestamps for subtitle generation
- Specify language to improve recognition accuracy
- Handle streaming backpressure for real-time transcription
- Close transcription sessions when complete