Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Chattso-GPT
/
sft-exp006-mix-qwen3-4b
like
0
Text Generation
PEFT
Safetensors
u-10bei/structured_data_with_cot_dataset_512_v5
daichira/structured-5k-mix-sft
daichira/structured-hard-sft-4k
English
qlora
lora
structured-output
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
sft-exp006-mix-qwen3-4b (Exp-006 Mix-SFT)
Configuration
sft-exp006-mix-qwen3-4b (Exp-006 Mix-SFT)
LoRA adapter for structured output tasks.
Configuration
Context Length: 1024
Epochs: 1
LR: 1e-4
Loss: Assistant-Only (CoT Masking enabled)
Downloads last month
230
Inference Providers
NEW
Text Generation
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for
Chattso-GPT/sft-exp006-mix-qwen3-4b
Base model
Qwen/Qwen3-4B-Instruct-2507
Adapter
(
5273
)
this model
Datasets used to train
Chattso-GPT/sft-exp006-mix-qwen3-4b
daichira/structured-hard-sft-4k
Viewer
•
Updated
Jan 24
•
4k
•
53
•
1
daichira/structured-5k-mix-sft
Viewer
•
Updated
Jan 24
•
5k
•
24
u-10bei/structured_data_with_cot_dataset_512_v5
Viewer
•
Updated
Jan 23
•
5.68k
•
20
•
3