Papers
arxiv:2403.02712

Breeze-7B Technical Report

Published on Mar 5, 2024
Authors:
,
,

Abstract

Breeze-7B, an open-source language model derived from Mistral-7B, achieves top performance in language comprehension and chatbot-oriented tasks through additional pretraining and fintuning.

AI-generated summary

Breeze-7B is an open-source language model based on Mistral-7B, designed to address the need for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese. This technical report provides an overview of the additional pretraining, finetuning, and evaluation stages for the Breeze-7B model. The Breeze-7B family of base and chat models exhibits good performance on language comprehension and chatbot-oriented tasks, reaching the top in several benchmarks among models comparable in its complexity class.

Community

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2403.02712
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 20

Browse 20 models citing this paper

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2403.02712 in a dataset README.md to link it from this page.

Spaces citing this paper 11

Collections including this paper 1