zhsh17's picture
add arXiv link
5325475
---
title: EdgeRazor Playground
emoji: 🚀
colorFrom: blue
colorTo: gray
sdk: gradio
sdk_version: 6.5.1
python_version: 3.12.2
app_file: app.py
pinned: true
license: apache-2.0
---
<div align="center">
<br/>
<img src="https://raw.githubusercontent.com/zhangsq-nju/EdgeRazor/main/asset/Logo-full.png" alt="EdgeRazor Logo" width="60%">
<h3>
Lightweight Framework for Edge AI
</h3>
<p>
<a href="https://arxiv.org/abs/2605.04062" target="_blank">
<img src="https://img.shields.io/badge/arXiv-EdgeRazor-b31b1b?style=flat&logo=arxiv" alt="arXiv EdgeRazor">
</a>
<a href="https://github.com/zhangsq-nju/EdgeRazor" target="blank">
<img src="https://img.shields.io/badge/GitHub-EdgeRazor-blue?style=flat&logo=github" alt="GitHub EdgeRazor">
</a>
</p>
<h5>
✨ If you like our project, please consider giving us ❤️.
</h5>
</div>
## EdgeRazor Playground
A CPU-friendly chatbot powered by **[Qwen3-EdgeRazor-nbit](https://huggingface.co/collections/zhangsq-nju/edgerazor-nbit)**, running locally via [llama.cpp](https://github.com/ggerganov/llama.cpp). Displays real-time efficiency metrics (output tokens, time, decoding throughput) per turn.
## Dependencies
- [llama-cpp-python](https://abetlen.github.io/llama-cpp-python/whl/cpu/llama-cpp-python)
- Qwen3-EdgeRazor-nbit gguf files:
- [Qwen3-0.6B-EdgeRazor-GGUF](https://huggingface.co/zhangsq-nju/Qwen3-0.6B-EdgeRazor-GGUF)
- [Qwen3-1.7B-EdgeRazor-GGUF](https://huggingface.co/zhangsq-nju/Qwen3-1.7B-EdgeRazor-GGUF)