Pratyush-01
/

physix-3b-rl-ckpt

Generated from Trainer

Model card Files Files and versions

physix-3b-rl-ckpt / sft

Ctrl+K

Ctrl+K

1 contributor

History: 5 commits

Pratyush-01's picture

SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=4 lora_r=32

8424cdf verified 12 days ago

added_tokens.json

632 Bytes
SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=3 lora_r=32 12 days ago
chat_template.jinja

2.51 kB
SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=3 lora_r=32 12 days ago
config.json

1.72 kB
SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=3 lora_r=32 12 days ago
merges.txt

1.67 MB
SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=3 lora_r=32 12 days ago
model-00001-of-00002.safetensors

3.97 GB
xet

SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=4 lora_r=32 12 days ago
model-00002-of-00002.safetensors

2.2 GB
xet

SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=4 lora_r=32 12 days ago
model.safetensors.index.json

35.6 kB
SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=3 lora_r=32 12 days ago
special_tokens_map.json

499 Bytes
SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=3 lora_r=32 12 days ago
tokenizer.json

11.4 MB
xet

SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=3 lora_r=32 12 days ago
tokenizer_config.json

7.54 kB
SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=3 lora_r=32 12 days ago
vocab.json

2.78 MB
SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=3 lora_r=32 12 days ago