Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Nanbeige
/
Nanbeige4.1-3B
like
1.09k
Follow
Nanbeige LLM Lab
861
Text Generation
Transformers
Safetensors
English
Chinese
llama
llm
nanbeige
conversational
Eval Results
text-generation-inference
arxiv:
2602.13367
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
43
Deploy
Use this model
Chain-of-Thought or Chain-of-Mimicry? The Over-SFT problem in Nanbeige 4.1-3B aka "I_Should_X"
#26
by
srs6901
- opened
Feb 19
Discussion
srs6901
Feb 19
This comment has been hidden (marked as Resolved)
srs6901
Feb 20
This comment has been hidden (marked as Resolved)
leran1995
changed discussion status to
closed
Feb 22
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment