Instructions to use tencent/Hy-MT2-7B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use tencent/Hy-MT2-7B with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "translation" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("translation", model="tencent/Hy-MT2-7B")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("tencent/Hy-MT2-7B") model = AutoModelForCausalLM.from_pretrained("tencent/Hy-MT2-7B") - Notebooks
- Google Colab
- Kaggle
upload Hy-MT2 files
Browse files- .gitattributes +1 -0
- HY_MT2_0_Report.pdf +3 -0
- LICENSE.txt +1 -1
- README.md +16 -20
- README_CN.md +5 -11
- imgs/main_result.png +2 -2
.gitattributes
CHANGED
|
@@ -36,3 +36,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 36 |
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
| 37 |
HY_MT2_0_Technical_Report.pdf filter=lfs diff=lfs merge=lfs -text
|
| 38 |
imgs/main_result.png filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 36 |
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
| 37 |
HY_MT2_0_Technical_Report.pdf filter=lfs diff=lfs merge=lfs -text
|
| 38 |
imgs/main_result.png filter=lfs diff=lfs merge=lfs -text
|
| 39 |
+
HY_MT2_0_Report.pdf filter=lfs diff=lfs merge=lfs -text
|
HY_MT2_0_Report.pdf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:47d84a5a737994ffe0bd3a74d255698273c292a3acc9dda3902a38ea2440afaf
|
| 3 |
+
size 2326558
|
LICENSE.txt
CHANGED
|
@@ -14,7 +14,7 @@ f. “Materials” shall mean, collectively, Tencent’s proprietary Tencent HY
|
|
| 14 |
g. “Model Derivatives” shall mean all: (i) modifications to Tencent HY or any Model Derivative of Tencent HY; (ii) works based on Tencent HY or any Model Derivative of Tencent HY; or (iii) any other machine learning model which is created by transfer of patterns of the weights, parameters, operations, or Output of Tencent HY or any Model Derivative of Tencent HY, to that model in order to cause that model to perform similarly to Tencent HY or a Model Derivative of Tencent HY, including distillation methods, methods that use intermediate data representations, or methods based on the generation of synthetic data Outputs by Tencent HY or a Model Derivative of Tencent HY for training that model. For clarity, Outputs by themselves are not deemed Model Derivatives.
|
| 15 |
h. “Output” shall mean the information and/or content output of Tencent HY or a Model Derivative that results from operating or otherwise using Tencent HY or a Model Derivative, including via a Hosted Service.
|
| 16 |
i. “Tencent,” “We” or “Us” shall mean the applicable entity or entities in the Tencent corporate family that own(s) intellectual property or other rights embodied in or utilized by the Materials.
|
| 17 |
-
j. “Tencent HY” shall mean the large language models, text/image/video/audio/3D generation models, and multimodal large language models and their software and algorithms, including trained model weights, parameters (including optimizer states), machine-learning model code, inference-enabling code, training-enabling code, fine-tuning enabling code and other elements of the foregoing made publicly available by Us, including, without limitation to, Tencent Hy-MT2-1.8B released at https://huggingface.co/tencent/Hy-MT2-1.8B, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-1.8B; Tencent Hy-MT2-7B released at https://huggingface.co/tencent/Hy-MT2-7B, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-7B; Tencent Hy-MT2-30B-A3B released at https://huggingface.co/tencent/Hy-MT2-30B-A3B, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-30B-A3B; Tencent Hy-MT2-1.8B-FP8 released at https://huggingface.co/tencent/Hy-MT2-1.8B-FP8, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-1.8B-FP8; Tencent Hy-MT2-7B-FP8 released at https://huggingface.co/tencent/Hy-MT2-7B-FP8, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-7B-FP8; Tencent Hy-MT2-30B-A3B-FP8 released at https://huggingface.co/tencent/Hy-MT2-30B-A3B-FP8, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-30B-A3B-FP8; Hy-MT2-1.8B-GGUF released at https://huggingface.co/tencent/Hy-MT2-1.8B-GGUF, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-1.8B-GGUF; Hy-MT2-7B-GGUF released at https://huggingface.co/tencent/Hy-MT2-7B-GGUF, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-7B-GGUF.
|
| 18 |
k. “Tencent HY Works” shall mean: (i) the Materials; (ii) Model Derivatives; and (iii) all derivative works thereof.
|
| 19 |
l. “Territory” shall mean the worldwide territory, excluding the territory of the European Union.
|
| 20 |
m. “Third Party” or “Third Parties” shall mean individuals or legal entities that are not under common control with Us or You.
|
|
|
|
| 14 |
g. “Model Derivatives” shall mean all: (i) modifications to Tencent HY or any Model Derivative of Tencent HY; (ii) works based on Tencent HY or any Model Derivative of Tencent HY; or (iii) any other machine learning model which is created by transfer of patterns of the weights, parameters, operations, or Output of Tencent HY or any Model Derivative of Tencent HY, to that model in order to cause that model to perform similarly to Tencent HY or a Model Derivative of Tencent HY, including distillation methods, methods that use intermediate data representations, or methods based on the generation of synthetic data Outputs by Tencent HY or a Model Derivative of Tencent HY for training that model. For clarity, Outputs by themselves are not deemed Model Derivatives.
|
| 15 |
h. “Output” shall mean the information and/or content output of Tencent HY or a Model Derivative that results from operating or otherwise using Tencent HY or a Model Derivative, including via a Hosted Service.
|
| 16 |
i. “Tencent,” “We” or “Us” shall mean the applicable entity or entities in the Tencent corporate family that own(s) intellectual property or other rights embodied in or utilized by the Materials.
|
| 17 |
+
j. “Tencent HY” shall mean the large language models, text/image/video/audio/3D generation models, and multimodal large language models and their software and algorithms, including trained model weights, parameters (including optimizer states), machine-learning model code, inference-enabling code, training-enabling code, fine-tuning enabling code and other elements of the foregoing made publicly available by Us, including, without limitation to, Tencent Hy-MT2-1.8B released at https://huggingface.co/tencent/Hy-MT2-1.8B, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-1.8B; Tencent Hy-MT2-7B released at https://huggingface.co/tencent/Hy-MT2-7B, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-7B; Tencent Hy-MT2-30B-A3B released at https://huggingface.co/tencent/Hy-MT2-30B-A3B, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-30B-A3B; Tencent Hy-MT2-1.8B-FP8 released at https://huggingface.co/tencent/Hy-MT2-1.8B-FP8, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-1.8B-FP8; Tencent Hy-MT2-7B-FP8 released at https://huggingface.co/tencent/Hy-MT2-7B-FP8, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-7B-FP8; Tencent Hy-MT2-30B-A3B-FP8 released at https://huggingface.co/tencent/Hy-MT2-30B-A3B-FP8, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-30B-A3B-FP8; Hy-MT2-1.8B-GGUF released at https://huggingface.co/tencent/Hy-MT2-1.8B-GGUF, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-1.8B-GGUF; Hy-MT2-7B-GGUF released at https://huggingface.co/tencent/Hy-MT2-7B-GGUF, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-7B-GGUF; Hy-MT2-1.8B-2bit-GGUF released at https://huggingface.co/tencent/Hy-MT2-1.8B-2bit-GGUF, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-1.8B-2bit-GGUF; Hy-MT2-1.8B-2bit-GGUF released at https://huggingface.co/tencent/Hy-MT2-1.8B-1.25bit-GGUF, https://modelscope.cn/models/Tencent-Hunyuan/Hy-MT2-1.8B-1.25bit-GGUF.
|
| 18 |
k. “Tencent HY Works” shall mean: (i) the Materials; (ii) Model Derivatives; and (iii) all derivative works thereof.
|
| 19 |
l. “Territory” shall mean the worldwide territory, excluding the territory of the European Union.
|
| 20 |
m. “Third Party” or “Third Parties” shall mean individuals or legal entities that are not under common control with Us or You.
|
README.md
CHANGED
|
@@ -17,22 +17,16 @@
|
|
| 17 |
</div>
|
| 18 |
|
| 19 |
<p align="center">
|
| 20 |
-
🖥️ <a href="https://aistudio.tencent.com/"><b>Official Website</b></a> |
|
| 21 |
💬 <a href="https://github.com/Tencent-Hunyuan/Hy-MT2"><b>GitHub</b></a> |
|
| 22 |
🪡 <a href="https://github.com/Tencent/AngelSlim/tree/main"><b>AngelSlim</b></a></p>
|
| 23 |
|
| 24 |
## Model Introduction
|
| 25 |
|
| 26 |
|
| 27 |
-
|
| 28 |
-
|
| 29 |
-
|
| 30 |
-
|
| 31 |
-
* **General Translation (FLORES-200)**: The average performance of the three models reaches 89.9%, 97.9%, and 98.6% of **Gemini 3.1 Pro (Think)**, respectively. Among them, the 7B and A3B models outperform **DeepSeek-V4-Pro**, while the 1.8B model achieves better overall performance than commercial APIs such as Microsoft Translator.
|
| 32 |
-
* **Real-World Scenarios and Professional Domains (WildMTBench/DomainMTBench)**: The GEMBA scores of the three models reach more than 96%–99% of Gemini 3.1 Pro (Think), and all of them outperform larger open-source models.
|
| 33 |
-
* **Translation Instruction Following (IFMTBench)**: The models significantly outperform open-source models of the same scale, while the A3B model approaches the performance of Gemini 3.1 Pro (Think).
|
| 34 |
-
|
| 35 |
-
In summary, Hy-MT2 is an efficient and powerful translation model series designed for complex real-world scenarios.
|
| 36 |
|
| 37 |
In this release, we also open-source [IFMTBench](./IFMTBench/README.md), a benchmark for evaluating translation instruction-following capabilities.
|
| 38 |
|
|
@@ -40,7 +34,7 @@ We also welcome everyone to use our released Hy-MT2-Translator Skill, which make
|
|
| 40 |
|
| 41 |
## News
|
| 42 |
|
| 43 |
-
* 2026.5.21 We open-sourced **Hy-MT2-1.8B**/**Hy-MT2-7B**/**Hy-MT2-30B-A3B** on HuggingFace and ModelScope.
|
| 44 |
* 2025.12.30 We open-sourced **HY-MT1.5-1.8B** and **HY-MT1.5-7B** on HuggingFace and ModelScope.
|
| 45 |
* 2025.9.1 We open-sourced **Hunyuan-MT-7B** and **Hunyuan-MT-Chimera-7B** on HuggingFace and ModelScope.
|
| 46 |
|
|
@@ -50,21 +44,23 @@ We also welcome everyone to use our released Hy-MT2-Translator Skill, which make
|
|
| 50 |
<img src="imgs/main_result.png" width = "100%" />
|
| 51 |
</div>
|
| 52 |
|
| 53 |
-
For more experimental results and analysis, please refer to our [
|
| 54 |
|
| 55 |
|
| 56 |
|
| 57 |
## Model Links
|
| 58 |
| Model Name | Description | Download Link |
|
| 59 |
| ----------- | ----------- |-----------
|
| 60 |
-
| Hy-MT2-1.8B |
|
| 61 |
-
| Hy-MT2-1.8B-FP8 |
|
| 62 |
-
| Hy-MT2-1.8B-GGUF |
|
| 63 |
-
| Hy-MT2-
|
| 64 |
-
| Hy-MT2-
|
| 65 |
-
| Hy-MT2-7B
|
| 66 |
-
| Hy-MT2-
|
| 67 |
-
| Hy-MT2-
|
|
|
|
|
|
|
| 68 |
|
| 69 |
|
| 70 |
|
|
|
|
| 17 |
</div>
|
| 18 |
|
| 19 |
<p align="center">
|
| 20 |
+
🖥️ <a href="https://aistudio.tencent.com/llm/en?tabIndex=0"><b>Official Website</b></a> |
|
| 21 |
💬 <a href="https://github.com/Tencent-Hunyuan/Hy-MT2"><b>GitHub</b></a> |
|
| 22 |
🪡 <a href="https://github.com/Tencent/AngelSlim/tree/main"><b>AngelSlim</b></a></p>
|
| 23 |
|
| 24 |
## Model Introduction
|
| 25 |
|
| 26 |
|
| 27 |
+
Hy-MT2 is a family of “fast-thinking” multilingual translation models designed for complex real-world scenarios. It includes three model sizes: 1.8B, 7B, and 30B-A3B (MoE), all of which support translation among 33 languages and effectively follow translation instructions in multiple languages.
|
| 28 |
+
For on-device deployment, AngelSlim 1.25-bit extreme quantization reduces the storage requirement of the 1.8B model to only 440 MB and improves inference speed by 1.5x.
|
| 29 |
+
Multi-dimensional evaluations show that Hy-MT2 delivers outstanding performance across general, real-world business, domain-specific, and instruction-following translation tasks. The 7B and 30B-A3B models outperform open-source models such as DeepSeek-V4-Pro and Kimi K2.6 in fast-thinking mode, while the lightweight 1.8B model also surpasses mainstream commercial APIs from providers such as Microsoft and Doubao overall.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 30 |
|
| 31 |
In this release, we also open-source [IFMTBench](./IFMTBench/README.md), a benchmark for evaluating translation instruction-following capabilities.
|
| 32 |
|
|
|
|
| 34 |
|
| 35 |
## News
|
| 36 |
|
| 37 |
+
* 2026.5.21 We open-sourced **Hy-MT2-1.8B**/**Hy-MT2-7B**/**Hy-MT2-30B-A3B**/**IFMTBench** on HuggingFace and ModelScope.
|
| 38 |
* 2025.12.30 We open-sourced **HY-MT1.5-1.8B** and **HY-MT1.5-7B** on HuggingFace and ModelScope.
|
| 39 |
* 2025.9.1 We open-sourced **Hunyuan-MT-7B** and **Hunyuan-MT-Chimera-7B** on HuggingFace and ModelScope.
|
| 40 |
|
|
|
|
| 44 |
<img src="imgs/main_result.png" width = "100%" />
|
| 45 |
</div>
|
| 46 |
|
| 47 |
+
For more experimental results and analysis, please refer to our [report](./HY_MT2_0_Report.pdf).
|
| 48 |
|
| 49 |
|
| 50 |
|
| 51 |
## Model Links
|
| 52 |
| Model Name | Description | Download Link |
|
| 53 |
| ----------- | ----------- |-----------
|
| 54 |
+
| Hy-MT2-1.8B | Hy 1.8B translation model |🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B)|
|
| 55 |
+
| Hy-MT2-1.8B-FP8 | Hy 1.8B translation model, FP8 quantization | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B-FP8)|
|
| 56 |
+
| Hy-MT2-1.8B-GGUF | Hy 1.8B translation model, llama.cpp | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B-GGUF)|
|
| 57 |
+
| Hy-MT2-1.8B-2bit-GGUF | Hy 1.8B translation model, llama.cpp, 2bit | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B-2bit-GGUF)|
|
| 58 |
+
| Hy-MT2-1.8B-1.25bit-GGUF | Hy 1.8B translation model, llama.cpp, 1.25bit | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B-1.25bit-GGUF)|
|
| 59 |
+
| Hy-MT2-7B | Hy 7B translation model | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-7B)|
|
| 60 |
+
| Hy-MT2-7B-FP8 | Hy 7B translation model, FP8 quantization | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-7B-FP8)|
|
| 61 |
+
| Hy-MT2-7B-GGUF | Hy 7B translation model, llama.cpp | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-7B-GGUF)|
|
| 62 |
+
| Hy-MT2-30B-A3B | Hy 30B-A3B translation model | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-30B-A3B)|
|
| 63 |
+
| Hy-MT2-30B-A3B-FP8 | Hy 30B-A3B translation model, FP8 quantization | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-30B-A3B-FP8)|
|
| 64 |
|
| 65 |
|
| 66 |
|
README_CN.md
CHANGED
|
@@ -17,7 +17,7 @@
|
|
| 17 |
</div>
|
| 18 |
|
| 19 |
<p align="center">
|
| 20 |
-
🖥️ <a href="https://aistudio.tencent.com/"><b>官方网站</b></a> |
|
| 21 |
💬 <a href="https://github.com/Tencent-Hunyuan/Hy-MT2"><b>GitHub</b></a> |
|
| 22 |
🪡 <a href="https://github.com/Tencent/AngelSlim/tree/main"><b>AngelSlim</b></a></p>
|
| 23 |
|
|
@@ -25,15 +25,7 @@
|
|
| 25 |
## 模型介绍
|
| 26 |
|
| 27 |
|
| 28 |
-
|
| 29 |
-
|
| 30 |
-
评测结果表明,Hy-MT2 在多场景下表现出众:
|
| 31 |
-
|
| 32 |
-
* **通用翻译(FLORES-200)**:三款模型平均性能分别达到 **Gemini 3.1 Pro (Think)** 的 89.9%、97.9% 和 98.6%。其中 7B 和 A3B 性能超越 **DeepSeek-V4-Pro**,1.8B 综合表现超越微软翻译等商业 API。
|
| 33 |
-
* **真实场景与专业领域(WildMTBench/DomainMTBench)**:三款模型 GEMBA 评分达 Gemini 3.1 Pro (Think) 的 96%~99% 以上,且均优于更大规模的开源模型。
|
| 34 |
-
* **翻译指令遵循(IFMTBench)**:大幅超越同规模开源模型,A3B 性能逼近 Gemini 3.1 Pro (Think)。
|
| 35 |
-
|
| 36 |
-
总之,Hy-MT2 是一个面向真实复杂场景、高效且强大的翻译模型系列。
|
| 37 |
|
| 38 |
同时,本次我们也开源了一个针对翻译指令遵循能力的评测集[IFMTBench](./IFMTBench/README_zh.md)。
|
| 39 |
|
|
@@ -51,7 +43,7 @@
|
|
| 51 |
<img src="imgs/main_result.png" width = "100%" />
|
| 52 |
</div>
|
| 53 |
|
| 54 |
-
更多的实验效果和分析可以参考我们的[
|
| 55 |
|
| 56 |
|
| 57 |
|
|
@@ -61,6 +53,8 @@
|
|
| 61 |
| Hy-MT2-1.8B | 混元1.8B翻译模型 |🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B)|
|
| 62 |
| Hy-MT2-1.8B-FP8 | 混元1.8B翻译模型,fp8量化 | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B-FP8)|
|
| 63 |
| Hy-MT2-1.8B-GGUF | 混元1.8B翻译模型, llama.cpp | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B-GGUF)|
|
|
|
|
|
|
|
| 64 |
| Hy-MT2-7B | 混元7B翻译模型 | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-7B)|
|
| 65 |
| Hy-MT2-7B-FP8 | 混元7B翻译模型,fp8量化 | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-7B-FP8)|
|
| 66 |
| Hy-MT2-7B-GGUF | 混元7B翻译模型, llama.cpp | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-7B-GGUF)|
|
|
|
|
| 17 |
</div>
|
| 18 |
|
| 19 |
<p align="center">
|
| 20 |
+
🖥️ <a href="https://aistudio.tencent.com/llm/zh?tabIndex=0"><b>官方网站</b></a> |
|
| 21 |
💬 <a href="https://github.com/Tencent-Hunyuan/Hy-MT2"><b>GitHub</b></a> |
|
| 22 |
🪡 <a href="https://github.com/Tencent/AngelSlim/tree/main"><b>AngelSlim</b></a></p>
|
| 23 |
|
|
|
|
| 25 |
## 模型介绍
|
| 26 |
|
| 27 |
|
| 28 |
+
Hy-MT2 是一款面向真实复杂场景的“快思考”多语言翻译模型家族,涵盖 1.8B、7B 和 30B-A3B(MoE)三种体量,支持 33 种语言互译并具备强大的多语言指令遵循能力。在端侧部署上,得益于 AngelSlim 1.25-bit 极端量化,其 1.8B 模型仅需 440MB 存储空间,推理速度显著提升 1.5 倍。多维度评测表明,Hy-MT2 在通用、真实业务、专业领域及指令遵循等翻译任务中表现卓越:7B 和 30B-A3B 模型性能不仅超越了 DeepSeek-V4-Pro、Kimi K2.6 等开源模型在快思考模式下的表现,轻量级 1.8B 模型亦在整体上超越了微软和豆包等主流商业 API。
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 29 |
|
| 30 |
同时,本次我们也开源了一个针对翻译指令遵循能力的评测集[IFMTBench](./IFMTBench/README_zh.md)。
|
| 31 |
|
|
|
|
| 43 |
<img src="imgs/main_result.png" width = "100%" />
|
| 44 |
</div>
|
| 45 |
|
| 46 |
+
更多的实验效果和分析可以参考我们的[报告](./HY_MT2_0_Report.pdf)。
|
| 47 |
|
| 48 |
|
| 49 |
|
|
|
|
| 53 |
| Hy-MT2-1.8B | 混元1.8B翻译模型 |🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B)|
|
| 54 |
| Hy-MT2-1.8B-FP8 | 混元1.8B翻译模型,fp8量化 | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B-FP8)|
|
| 55 |
| Hy-MT2-1.8B-GGUF | 混元1.8B翻译模型, llama.cpp | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B-GGUF)|
|
| 56 |
+
| Hy-MT2-1.8B-2bit-GGUF | 混元1.8B翻译模型, llama.cpp, 2bit | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B-2bit-GGUF)|
|
| 57 |
+
| Hy-MT2-1.8B-1.25bit-GGUF | 混元1.8B翻译模型, llama.cpp, 1.25bit | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-1.8B-1.25bit-GGUF)|
|
| 58 |
| Hy-MT2-7B | 混元7B翻译模型 | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-7B)|
|
| 59 |
| Hy-MT2-7B-FP8 | 混元7B翻译模型,fp8量化 | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-7B-FP8)|
|
| 60 |
| Hy-MT2-7B-GGUF | 混元7B翻译模型, llama.cpp | 🤗 [Model](https://huggingface.co/tencent/Hy-MT2-7B-GGUF)|
|
imgs/main_result.png
CHANGED
|
Git LFS Details
|
|
Git LFS Details
|