Fine-tune language models locally and chat with them
Encode text to audio and decode audio back to text