azure-ai-transcription-py

✓Verified·Scanned 2/18/2026

This skill provides an Azure AI Transcription Python client for real-time and batch speech-to-text transcription. It reads TRANSCRIPTION_ENDPOINT and TRANSCRIPTION_KEY from the environment and uses https://<resource>.cognitiveservices.azure.com and https://<storage>/audio.wav for network calls.

from clawhub.ai·v24a44d8·1.7 KB·0 installs

Scanned from 0.1.0 at 24a44d8 · Transparency log ↗

$ vett add clawhub.ai/thegovind/azure-ai-transcription-py

Azure AI Transcription SDK for Python

Client library for Azure AI Transcription (speech-to-text) with real-time and batch transcription.

Installation

pip install azure-ai-transcription

Environment Variables

TRANSCRIPTION_ENDPOINT=https://<resource>.cognitiveservices.azure.com
TRANSCRIPTION_KEY=<your-key>

Authentication

Use subscription key authentication (DefaultAzureCredential is not supported for this client):

import os
from azure.ai.transcription import TranscriptionClient

client = TranscriptionClient(
    endpoint=os.environ["TRANSCRIPTION_ENDPOINT"],
    credential=os.environ["TRANSCRIPTION_KEY"]
)

Transcription (Batch)

job = client.begin_transcription(
    name="meeting-transcription",
    locale="en-US",
    content_urls=["https://<storage>/audio.wav"],
    diarization_enabled=True
)
result = job.result()
print(result.status)

Transcription (Real-time)

stream = client.begin_stream_transcription(locale="en-US")
stream.send_audio_file("audio.wav")
for event in stream:
    print(event.text)

Best Practices

Enable diarization when multiple speakers are present
Use batch transcription for long files stored in blob storage
Capture timestamps for subtitle generation
Specify language to improve recognition accuracy
Handle streaming backpressure for real-time transcription
Close transcription sessions when complete