Upload merged Qwen3-4B-Instruct-2507 model (auto-generated README)

Files changed (3) hide show

README.md CHANGED Viewed

@@ -36,10 +36,10 @@ tool use, and recovery from errors.
 - Base model: Qwen/Qwen3-4B-Instruct-2507
 - Method: LoRA (full precision base)
-- Max sequence length: 2560
-- Epochs: 3
-- Learning rate: 2e-06
-- LoRA: r=128, alpha=256
 ## Usage

 - Base model: Qwen/Qwen3-4B-Instruct-2507
 - Method: LoRA (full precision base)
+- Max sequence length: 2048
+- Epochs: 2
+- Learning rate: 1e-06
+- LoRA: r=64, alpha=128
 ## Usage

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cc08ee561fbb079f2700873d0698cef760516a670c58fbe23b913056a1d0bbad
 size 4967215360

 version https://git-lfs.github.com/spec/v1
+oid sha256:6721613c3dc9f8f2fee11abba4768b4e6b72d3665ba090692263990846e28c86
 size 4967215360

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:79a8a96f0de3bf49b01df10afc3e2ff9d0a7a7c5b8924265b8396b252c21c463
 size 3077766632

 version https://git-lfs.github.com/spec/v1
+oid sha256:ef1ba7c93242651862b2bdf0de484212b427d5c0568e989b26a1d6c9a4f0254e
 size 3077766632