Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
amang1802
/
Llama3.2-1B-summary-length-exp5
like
0
Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
Model Card for Model ID
Model Details
Model Card for Model ID
Summary Length PPO experiment #5
No KL divergence in loss
Model Details
Dataset size: 1024
Epochs: 1
Batch Size: 4 * 8 (w/ Gradient Accumulation)
Optimizer args: Torch AdamW default, except
LR = 0.00001
Downloads last month
1
Safetensors
Model size
1B params
Tensor type
BF16
·
Chat template
Files info
Inference Providers
NEW
Featherless AI
Text Generation
Examples
Input a message to start chatting with
amang1802/Llama3.2-1B-summary-length-exp5
.
Send
View Code
Snippets
Compare providers