Represents an audio input object.
Base64 encoded audio data.
The format of the audio data (e.g., "wav", "mp3").