Abstract
Breeze-7B, an open-source language model derived from Mistral-7B, achieves top performance in language comprehension and chatbot-oriented tasks through additional pretraining and fintuning.
Breeze-7B is an open-source language model based on Mistral-7B, designed to address the need for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese. This technical report provides an overview of the additional pretraining, finetuning, and evaluation stages for the Breeze-7B model. The Breeze-7B family of base and chat models exhibits good performance on language comprehension and chatbot-oriented tasks, reaching the top in several benchmarks among models comparable in its complexity class.
Get this paper in your agent:
hf papers read 2403.02712 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 20
Browse 20 models citing this paperDatasets citing this paper 0
No dataset linking this paper