Transcribe uploaded, recorded, or online audio to text
Generate analysis and response based on policy and prompt