inclusionAI
/

Ling-2.6-flash

Text Generation

Model card Files Files and versions

caizhi1 commited on 9 days ago

Commit

5e2e8d3

·

verified ·

1 Parent(s): 85b9552

Update README.md

Files changed (1) hide show

README.md +8 -7

README.md CHANGED Viewed

@@ -1,8 +1,8 @@
----
-license: mit
-language:
-- en
----
 ## Ling-2.6-flash: Faster Responses, Stronger Execution, Higher Token Efficiency
 ### Introduction
 Today, we announce the official open-source release of **Ling-2.6-flash**, an **instruct model** with **104B total parameters** and **7.4B active parameters**.
@@ -69,7 +69,8 @@ uv venv ~/my_ling_env
 source ~/my_ling_env/bin/activate
-uv pip install sglang
 ```
 ##### Run Inference
@@ -169,4 +170,4 @@ Ling-2.6-flash has already made meaningful progress in our pursuit of an extreme
 At the same time, we are fully aware that pushing intelligence efficiency to the limit comes with tradeoffs. In some highly complex scenarios, the model can still exhibit **tool hallucinations** due to limited reasoning depth. In addition, there is still room for improvement in areas such as **natural bilingual switching between Chinese and English** and **compliance with highly complex instructions**.
-Looking ahead, we will continue exploring the frontier of intelligence efficiency. While preserving the model’s high-efficiency inference characteristics, we aim to further improve the balance between **output quality** and **token efficiency**, and to continuously strengthen the model’s **stability, usability, and interaction experience across a wider range of real-world scenarios**.

+---
+license: mit
+language:
+- en
+---
 ## Ling-2.6-flash: Faster Responses, Stronger Execution, Higher Token Efficiency
 ### Introduction
 Today, we announce the official open-source release of **Ling-2.6-flash**, an **instruct model** with **104B total parameters** and **7.4B active parameters**.
 source ~/my_ling_env/bin/activate
+# uv pip "sglang-kernel>=0.4.1"
+uv pip install "sglang>=0.5.10.post1"
 ```
 ##### Run Inference
 At the same time, we are fully aware that pushing intelligence efficiency to the limit comes with tradeoffs. In some highly complex scenarios, the model can still exhibit **tool hallucinations** due to limited reasoning depth. In addition, there is still room for improvement in areas such as **natural bilingual switching between Chinese and English** and **compliance with highly complex instructions**.
+Looking ahead, we will continue exploring the frontier of intelligence efficiency. While preserving the model’s high-efficiency inference characteristics, we aim to further improve the balance between **output quality** and **token efficiency**, and to continuously strengthen the model’s **stability, usability, and interaction experience across a wider range of real-world scenarios**.