How to Transcribe Audio Files from Azure Blob Storage Using Python?

I'm working on a project where I need to transcribe audio files stored in Azure Blob Storage. I'm using OpenAI's Whisper for transcription, which requires the audio file to be locally accessible or in a supported format like WAV or MP3.

My current idea is to download the file from Blob Storage using the azure-storage-blob Python SDK and then feed it into Whisper's transcribe() method. However, I'm not sure if this is the most efficient way, especially for streaming use cases or handling large files.

Here are my questions:

  1. Is there a way to stream the audio file directly from Blob Storage into Whisper without downloading it fully to local storage?

  2. Has anyone tried integrating Azure Blob Storage and Whisper? If so, how did you handle the audio file formats and ensure smooth processing?

  3. Are there better alternatives to pydub or other libraries for converting streamed audio into Whisper-compatible formats?

Any advice, best practices, or example code would be much appreciated!