Instructions to use cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO")
model = AutoModelForCausalLM.from_pretrained("cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO

SGLang

How to use cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO with Docker Model Runner:
```
docker model run hf.co/cfei621/DeepSeek-R1-Distill-Llama-8B-GRPO
```

DeepSeek-R1-Distill-Llama-8B-GRPO / trainer_state.json

Commit History

Model save

835faae
verified

cfei621 commited on Apr 11, 2025

Model save

2a95588
verified

cfei621 commited on Apr 7, 2025

Model save

fc231c1
verified

cfei621 commited on Apr 5, 2025

Model save

2455459
verified

cfei621 commited on Apr 3, 2025

Model save

ce0e22c
verified

cfei621 commited on Mar 26, 2025

Model save

66f6bf5
verified

cfei621 commited on Mar 26, 2025

Model save

3b3391d
verified

cfei621 commited on Mar 24, 2025

Model save

57e084e
verified

cfei621 commited on Mar 22, 2025

Commit History

Model save 835faae verified

Model save 2a95588 verified

Model save fc231c1 verified

Model save 2455459 verified

Model save ce0e22c verified

Model save 66f6bf5 verified

Model save 3b3391d verified

Model save 57e084e verified