Meta Llama3 8b with Llava Multimodal capabilities
Generate speech in a cloned voice from reference audio
Generate images from text prompts