Instructions to use leapeto/Qwen3-4B-AbstractCoT-warmup with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use leapeto/Qwen3-4B-AbstractCoT-warmup with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="leapeto/Qwen3-4B-AbstractCoT-warmup")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("leapeto/Qwen3-4B-AbstractCoT-warmup", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use leapeto/Qwen3-4B-AbstractCoT-warmup with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "leapeto/Qwen3-4B-AbstractCoT-warmup" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "leapeto/Qwen3-4B-AbstractCoT-warmup", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/leapeto/Qwen3-4B-AbstractCoT-warmup
- SGLang
How to use leapeto/Qwen3-4B-AbstractCoT-warmup with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "leapeto/Qwen3-4B-AbstractCoT-warmup" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "leapeto/Qwen3-4B-AbstractCoT-warmup", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "leapeto/Qwen3-4B-AbstractCoT-warmup" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "leapeto/Qwen3-4B-AbstractCoT-warmup", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use leapeto/Qwen3-4B-AbstractCoT-warmup with Docker Model Runner:
docker model run hf.co/leapeto/Qwen3-4B-AbstractCoT-warmup
| { | |
| "losses": [ | |
| 0.5806382041424513, | |
| 0.5243994968011976, | |
| 0.49262564033269884, | |
| 0.4333665704354644, | |
| 0.373029216285795, | |
| 0.3593967686872929, | |
| 0.39847223716787994, | |
| 0.37207798319868746, | |
| 0.3787895118817687, | |
| 0.3720124014187604, | |
| 0.3636292540933937, | |
| 0.3601828854967607, | |
| 0.3554463139735162, | |
| 0.3865779357030988, | |
| 0.32862058384343984, | |
| 0.36783338263630866, | |
| 0.3428428391227499, | |
| 0.34551644229795786, | |
| 0.3680351444054395, | |
| 0.3469195322133601, | |
| 0.3622684331610799, | |
| 0.37623543343506755, | |
| 0.36850376506336036, | |
| 0.345283712586388, | |
| 0.3425974382087588, | |
| 0.4011214487836696, | |
| 0.3654101203195751, | |
| 0.3157559605082497, | |
| 0.36133123533800243, | |
| 0.35812310164328665, | |
| 0.34157210728153586 | |
| ], | |
| "lrs": [ | |
| 6.666666666666667e-05, | |
| 9.993008576227247e-05, | |
| 9.937194443381972e-05, | |
| 9.826190093588563e-05, | |
| 9.661236384224129e-05, | |
| 9.444177243274618e-05, | |
| 9.177439057064683e-05, | |
| 8.864003547001915e-05, | |
| 8.507374438531607e-05, | |
| 8.111538294891684e-05, | |
| 7.680919953486048e-05, | |
| 7.220333063028872e-05, | |
| 6.734926274378312e-05, | |
| 6.230125686563068e-05, | |
| 5.7115741913664264e-05, | |
| 5.185068394501791e-05, | |
| 4.6564938185035956e-05, | |
| 4.131759111665349e-05, | |
| 3.616729998467365e-05, | |
| 3.1171637098265064e-05, | |
| 2.638644626136587e-05, | |
| 2.1865218525109495e-05, | |
| 1.7658494240397126e-05, | |
| 1.3813298094746491e-05, | |
| 1.037261344883343e-05, | |
| 7.374901848832683e-06, | |
| 4.853673085668947e-06, | |
| 2.8371106072518195e-06, | |
| 1.3477564710088098e-06, | |
| 4.02259358460233e-07, | |
| 1.1188468644907079e-08 | |
| ], | |
| "wallclock_s": 942, | |
| "n_examples": 5000, | |
| "epochs": 1, | |
| "mode": "distill", | |
| "lora_rank": 32, | |
| "total_opt_steps": 156, | |
| "num_processes": 2 | |
| } |