OpenOneRec
/

KSA-4B-base

Text Generation

Model card Files Files and versions

OpenOneRec commited on 5 days ago

Commit

a4c0a01

·

verified ·

1 Parent(s): b9e7bae

Update README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -1,3 +1,12 @@
 <div align="center">
   <h1>Kwai Summary Attention (KSA)</h1>
   <p align="center">
@@ -283,4 +292,4 @@ KSA is built upon and inspired by the open-source ecosystem. We would like to th
 - **HuggingFace Transformers** — for the model / tokenizer / generation abstractions that make `trust_remote_code` deployment painless.
 - **PyTorch distributed training** — for FSDP, DCP, and the communication primitives that make large-scale pretraining tractable.
-We sincerely thank these projects for their outstanding work.

+---
+license: apache-2.0
+language:
+- zh
+- en
+base_model:
+- Qwen/Qwen3-4B-Base
+pipeline_tag: text-generation
+---
 <div align="center">
   <h1>Kwai Summary Attention (KSA)</h1>
   <p align="center">
 - **HuggingFace Transformers** — for the model / tokenizer / generation abstractions that make `trust_remote_code` deployment painless.
 - **PyTorch distributed training** — for FSDP, DCP, and the communication primitives that make large-scale pretraining tractable.
+We sincerely thank these projects for their outstanding work.