tencent
/

Hy-MT2-1.8B

hunyuan_v1_dense

text-generation

Model card Files Files and versions

stevenkuang commited on 2 days ago

Commit

bad21e9

·

verified ·

1 Parent(s): 283296c

Update README.md

Files changed (1) hide show

README.md +18 -2

README.md CHANGED Viewed

@@ -8,13 +8,13 @@ pip install transformers==4.56.0
 The following code snippet shows how to use the transformers library to load and apply the model.
-we use tencent/HY-MT1.5-1.8B for example
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 import os
-model_name_or_path = "tencent/HY-MT1.5-1.8B"
 tokenizer = AutoTokenizer.from_pretrained(model_name_or_path)
 model = AutoModelForCausalLM.from_pretrained(model_name_or_path, device_map="auto")  # You may want to use bfloat16 and/or move to GPU here
@@ -41,4 +41,20 @@ We recommend using the following set of parameters for inference. Note that our
   "repetition_penalty": 1.05,
   "temperature": 0.7
 }
 ```

 The following code snippet shows how to use the transformers library to load and apply the model.
+we use tencent/Hy-MT2-1.8B for example
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 import os
+model_name_or_path = "tencent/Hy-MT2-1.8B"
 tokenizer = AutoTokenizer.from_pretrained(model_name_or_path)
 model = AutoModelForCausalLM.from_pretrained(model_name_or_path, device_map="auto")  # You may want to use bfloat16 and/or move to GPU here
   "repetition_penalty": 1.05,
   "temperature": 0.7
 }
+```
+### Use with vllm
+Start the vLLM server:
+```bash
+vllm serve tencent/Hy-MT2-1.8B --tensor-parallel-size 1
+```
+### Use with sglang
+Launch SGLang server:
+```bash
+python3 -m sglang.launch_server --model tencent/Hy-MT2-1.8B --tp 1
 ```