webai-community
/

ai-models

ONNX

GGUF

Model card Files Files and versions

xet

Community

ErazerControl commited on 12 days ago

Commit

e0b4570

verified ·

1 Parent(s): 84a8fe7

update phi-4-multimodal and whisper

Browse files

Files changed (1) hide show

README.md +76 -74

README.md CHANGED Viewed

@@ -1,75 +1,77 @@
-# Download
-Download a specific WebGPU model:
-```bash
-huggingface-cli download webai-community/ai-models --include "ai-models/<MODEL_NAME>/onnx-webgpu/*" --local-dir .
-```
-Download all WebGPU models:
-```bash
-huggingface-cli download webai-community/ai-models --include "*/onnx-webgpu/*" --local-dir .
-```
-# Model List
-| model name | params size | gguf model | ort webgpu model | model info |
-| --- | --- | --- | --- | --- |
-| Phi-4-mini-instruct | 3.8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Phi-4-mini-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Phi-4-mini-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Phi-4-mini-instruct/README.md) |
-| Phi-4-mini-reasoning | 3.8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Phi-4-mini-reasoning/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Phi-4-mini-reasoning/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Phi-4-mini-reasoning/README.md) |
-| Phi-3.5-mini-instruct | 3.8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Phi-3.5-mini-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Phi-3.5-mini-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Phi-3.5-mini-instruct/README.md) |
-| Phi-3-mini-4k-instruct | 3.8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Phi-3-mini-4k-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Phi-3-mini-4k-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Phi-3-mini-4k-instruct/README.md) |
-| Phi-3-mini-128k-instruct | 3.8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Phi-3-mini-128k-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Phi-3-mini-128k-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Phi-3-mini-128k-instruct/README.md) |
-| Qwen3-0.6B | 0.6B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-0.6B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-0.6B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen3-0.6B/README.md) |
-| Qwen3-1.7B | 1.7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-1.7B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-1.7B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen3-1.7B/README.md) |
-| Qwen3-4B | 4B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-4B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-4B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen3-4B/README.md) |
-| Qwen3-8B | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-8B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-8B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen3-8B/README.md) |
-| Qwen2.5-0.5B-Instruct | 0.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-0.5B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-0.5B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-0.5B-Instruct/README.md) |
-| Qwen2.5-1.5B-Instruct | 1.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-1.5B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-1.5B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-1.5B-Instruct/README.md) |
-| Qwen2.5-3B-Instruct | 3B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-3B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-3B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-3B-Instruct/README.md) |
-| Qwen2.5-7B-Instruct | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-7B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-7B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-7B-Instruct/README.md) |
-| Qwen2-0.5B-Instruct | 0.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2-0.5B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2-0.5B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2-0.5B-Instruct/README.md) |
-| Qwen2-1.5B-Instruct | 1.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2-1.5B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2-1.5B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2-1.5B-Instruct/README.md) |
-| Qwen2-7B-Instruct | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2-7B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2-7B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2-7B-Instruct/README.md) |
-| DeepSeek-R1-Distill-Qwen-1.5B | 1.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-Distill-Qwen-1.5B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-Distill-Qwen-1.5B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/DeepSeek-R1-Distill-Qwen-1.5B/README.md) |
-| DeepSeek-R1-Distill-Qwen-7B | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-Distill-Qwen-7B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-Distill-Qwen-7B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/DeepSeek-R1-Distill-Qwen-7B/README.md) |
-| DeepSeek-R1-Distill-Llama-8B | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-Distill-Llama-8B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-Distill-Llama-8B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/DeepSeek-R1-Distill-Llama-8B/README.md) |
-| DeepSeek-R1-0528-Qwen3-8B | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-0528-Qwen3-8B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-0528-Qwen3-8B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/DeepSeek-R1-0528-Qwen3-8B/README.md) |
-| gemma-3-1b-it | 1B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/gemma-3-1b-it/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/gemma-3-1b-it/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/gemma-3-1b-it/README.md) |
-| gemma-2-2b-it | 2B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/gemma-2-2b-it/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/gemma-2-2b-it/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/gemma-2-2b-it/README.md) |
-| gemma-2-9b-it | 9B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/gemma-2-9b-it/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/gemma-2-9b-it/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/gemma-2-9b-it/README.md) |
-| gemma-2b-it | 2B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/gemma-2b-it/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/gemma-2b-it/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/gemma-2b-it/README.md) |
-| gemma-7b-it | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/gemma-7b-it/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/gemma-7b-it/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/gemma-7b-it/README.md) |
-| internlm2_5-7b-chat | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/internlm2_5-7b-chat/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/internlm2_5-7b-chat/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/internlm2_5-7b-chat/README.md) |
-| internlm2-chat-1_8b | 1.8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/internlm2-chat-1_8b/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/internlm2-chat-1_8b/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/internlm2-chat-1_8b/README.md) |
-| internlm2-chat-7b | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/internlm2-chat-7b/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/internlm2-chat-7b/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/internlm2-chat-7b/README.md) |
-| Nemotron-Mini-4B-Instruct | 4B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Nemotron-Mini-4B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Nemotron-Mini-4B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Nemotron-Mini-4B-Instruct/README.md) |
-| Nemotron-Cascade-8B-Thinking | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Nemotron-Cascade-8B-Thinking/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Nemotron-Cascade-8B-Thinking/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Nemotron-Cascade-8B-Thinking/README.md) |
-| SmolLM2-1.7B-Instruct | 1.7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM2-1.7B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM2-1.7B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SmolLM2-1.7B-Instruct/README.md) |
-| SmolLM2-360M-Instruct | 360M | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM2-360M-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM2-360M-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SmolLM2-360M-Instruct/README.md) |
-| SmolLM2-135M-Instruct | 135M | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM2-135M-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM2-135M-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SmolLM2-135M-Instruct/README.md) |
-| SmolLM-1.7B-Instruct | 1.7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM-1.7B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM-1.7B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SmolLM-1.7B-Instruct/README.md) |
-| SmolLM-360M-Instruct | 360M | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM-360M-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM-360M-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SmolLM-360M-Instruct/README.md) |
-| SmolLM-135M-Instruct | 135M | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM-135M-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM-135M-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SmolLM-135M-Instruct/README.md) |
-| Yi-Coder-1.5B-Chat | 1.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Yi-Coder-1.5B-Chat/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Yi-Coder-1.5B-Chat/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Yi-Coder-1.5B-Chat/README.md) |
-| Qwen2.5-Coder-0.5B-Instruct | 0.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-Coder-0.5B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-Coder-0.5B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-Coder-0.5B-Instruct/README.md) |
-| Qwen2.5-Coder-1.5B-Instruct | 1.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-Coder-1.5B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-Coder-1.5B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-Coder-1.5B-Instruct/README.md) |
-| Qwen2.5-Coder-7B-Instruct | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-Coder-7B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-Coder-7B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-Coder-7B-Instruct/README.md) |
-| TinyLlama-1.1B-Chat-v1.0 | 1.1B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/TinyLlama-1.1B-Chat-v1.0/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/TinyLlama-1.1B-Chat-v1.0/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/TinyLlama-1.1B-Chat-v1.0/README.md) |
-| CodeLlama-7b-Instruct-hf | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/CodeLlama-7b-Instruct-hf/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/CodeLlama-7b-Instruct-hf/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/CodeLlama-7b-Instruct-hf/README.md) |
-| SOLAR-10.7B-Instruct-v1.0 | 10.7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SOLAR-10.7B-Instruct-v1.0/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SOLAR-10.7B-Instruct-v1.0/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SOLAR-10.7B-Instruct-v1.0/README.md) |
-| gpt-oss-20b | 20B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/gpt-oss-20b/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/gpt-oss-20b/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/gpt-oss-20b/README.md) |
-| granite-3.1-2b-instruct | 2B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.1-2b-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.1-2b-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/granite-3.1-2b-instruct/README.md) |
-| granite-3.1-8b-instruct | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.1-8b-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.1-8b-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/granite-3.1-8b-instruct/README.md) |
-| granite-3.2-2b-instruct | 2B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.2-2b-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.2-2b-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/granite-3.2-2b-instruct/README.md) |
-| granite-3.2-8b-instruct | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.2-8b-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.2-8b-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/granite-3.2-8b-instruct/README.md) |
-| granite-3.3-2b-instruct | 2B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.3-2b-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.3-2b-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/granite-3.3-2b-instruct/README.md) |
-| granite-3.3-8b-instruct | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.3-8b-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.3-8b-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/granite-3.3-8b-instruct/README.md) |
-| Ministral-8B-Instruct-2410 | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Ministral-8B-Instruct-2410/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Ministral-8B-Instruct-2410/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Ministral-8B-Instruct-2410/README.md) |
-| Mistral-7B-Instruct-v0.2 | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Mistral-7B-Instruct-v0.2/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Mistral-7B-Instruct-v0.2/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Mistral-7B-Instruct-v0.2/README.md) |
-| Mistral-7B-Instruct-v0.3 | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Mistral-7B-Instruct-v0.3/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Mistral-7B-Instruct-v0.3/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Mistral-7B-Instruct-v0.3/README.md) |
-| Mistral-Nemo-Instruct-2407 | 12B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Mistral-Nemo-Instruct-2407/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Mistral-Nemo-Instruct-2407/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Mistral-Nemo-Instruct-2407/README.md) |
-| Yi-1.5-6B-Chat | 6B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Yi-1.5-6B-Chat/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Yi-1.5-6B-Chat/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Yi-1.5-6B-Chat/README.md) |
 | Yi-1.5-9B-Chat | 9B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Yi-1.5-9B-Chat/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Yi-1.5-9B-Chat/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Yi-1.5-9B-Chat/README.md) |

+# Download
+Download a specific WebGPU model:
+```bash
+huggingface-cli download webai-community/ai-models --include "ai-models/<MODEL_NAME>/onnx-webgpu/*" --local-dir .
+```
+Download all WebGPU models:
+```bash
+huggingface-cli download webai-community/ai-models --include "*/onnx-webgpu/*" --local-dir .
+```
+# Model List
+| model name | params size | gguf model | ort webgpu model | model info |
+| --- | --- | --- | --- | --- |
+| Phi-4-mini-instruct | 3.8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Phi-4-mini-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Phi-4-mini-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Phi-4-mini-instruct/README.md) |
+| Phi-4-mini-reasoning | 3.8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Phi-4-mini-reasoning/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Phi-4-mini-reasoning/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Phi-4-mini-reasoning/README.md) |
+| Phi-4-multimodal-instruct| 6B |  | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Phi-4-multimodal-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Phi-4-multimodal-instruct/README.md) |
+| Phi-3.5-mini-instruct | 3.8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Phi-3.5-mini-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Phi-3.5-mini-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Phi-3.5-mini-instruct/README.md) |
+| Phi-3-mini-4k-instruct | 3.8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Phi-3-mini-4k-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Phi-3-mini-4k-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Phi-3-mini-4k-instruct/README.md) |
+| Phi-3-mini-128k-instruct | 3.8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Phi-3-mini-128k-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Phi-3-mini-128k-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Phi-3-mini-128k-instruct/README.md) |
+| Qwen3-0.6B | 0.6B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-0.6B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-0.6B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen3-0.6B/README.md) |
+| Qwen3-1.7B | 1.7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-1.7B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-1.7B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen3-1.7B/README.md) |
+| Qwen3-4B | 4B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-4B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-4B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen3-4B/README.md) |
+| Qwen3-8B | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-8B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen3-8B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen3-8B/README.md) |
+| Qwen2.5-0.5B-Instruct | 0.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-0.5B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-0.5B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-0.5B-Instruct/README.md) |
+| Qwen2.5-1.5B-Instruct | 1.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-1.5B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-1.5B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-1.5B-Instruct/README.md) |
+| Qwen2.5-3B-Instruct | 3B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-3B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-3B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-3B-Instruct/README.md) |
+| Qwen2.5-7B-Instruct | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-7B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-7B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-7B-Instruct/README.md) |
+| Qwen2-0.5B-Instruct | 0.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2-0.5B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2-0.5B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2-0.5B-Instruct/README.md) |
+| Qwen2-1.5B-Instruct | 1.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2-1.5B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2-1.5B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2-1.5B-Instruct/README.md) |
+| Qwen2-7B-Instruct | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2-7B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2-7B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2-7B-Instruct/README.md) |
+| DeepSeek-R1-Distill-Qwen-1.5B | 1.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-Distill-Qwen-1.5B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-Distill-Qwen-1.5B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/DeepSeek-R1-Distill-Qwen-1.5B/README.md) |
+| DeepSeek-R1-Distill-Qwen-7B | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-Distill-Qwen-7B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-Distill-Qwen-7B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/DeepSeek-R1-Distill-Qwen-7B/README.md) |
+| DeepSeek-R1-Distill-Llama-8B | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-Distill-Llama-8B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-Distill-Llama-8B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/DeepSeek-R1-Distill-Llama-8B/README.md) |
+| DeepSeek-R1-0528-Qwen3-8B | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-0528-Qwen3-8B/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/DeepSeek-R1-0528-Qwen3-8B/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/DeepSeek-R1-0528-Qwen3-8B/README.md) |
+| gemma-3-1b-it | 1B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/gemma-3-1b-it/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/gemma-3-1b-it/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/gemma-3-1b-it/README.md) |
+| gemma-2-2b-it | 2B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/gemma-2-2b-it/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/gemma-2-2b-it/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/gemma-2-2b-it/README.md) |
+| gemma-2-9b-it | 9B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/gemma-2-9b-it/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/gemma-2-9b-it/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/gemma-2-9b-it/README.md) |
+| gemma-2b-it | 2B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/gemma-2b-it/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/gemma-2b-it/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/gemma-2b-it/README.md) |
+| gemma-7b-it | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/gemma-7b-it/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/gemma-7b-it/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/gemma-7b-it/README.md) |
+| internlm2_5-7b-chat | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/internlm2_5-7b-chat/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/internlm2_5-7b-chat/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/internlm2_5-7b-chat/README.md) |
+| internlm2-chat-1_8b | 1.8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/internlm2-chat-1_8b/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/internlm2-chat-1_8b/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/internlm2-chat-1_8b/README.md) |
+| internlm2-chat-7b | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/internlm2-chat-7b/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/internlm2-chat-7b/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/internlm2-chat-7b/README.md) |
+| Nemotron-Mini-4B-Instruct | 4B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Nemotron-Mini-4B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Nemotron-Mini-4B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Nemotron-Mini-4B-Instruct/README.md) |
+| Nemotron-Cascade-8B-Thinking | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Nemotron-Cascade-8B-Thinking/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Nemotron-Cascade-8B-Thinking/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Nemotron-Cascade-8B-Thinking/README.md) |
+| SmolLM2-1.7B-Instruct | 1.7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM2-1.7B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM2-1.7B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SmolLM2-1.7B-Instruct/README.md) |
+| SmolLM2-360M-Instruct | 360M | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM2-360M-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM2-360M-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SmolLM2-360M-Instruct/README.md) |
+| SmolLM2-135M-Instruct | 135M | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM2-135M-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM2-135M-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SmolLM2-135M-Instruct/README.md) |
+| SmolLM-1.7B-Instruct | 1.7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM-1.7B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM-1.7B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SmolLM-1.7B-Instruct/README.md) |
+| SmolLM-360M-Instruct | 360M | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM-360M-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM-360M-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SmolLM-360M-Instruct/README.md) |
+| SmolLM-135M-Instruct | 135M | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM-135M-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SmolLM-135M-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SmolLM-135M-Instruct/README.md) |
+| Yi-Coder-1.5B-Chat | 1.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Yi-Coder-1.5B-Chat/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Yi-Coder-1.5B-Chat/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Yi-Coder-1.5B-Chat/README.md) |
+| Qwen2.5-Coder-0.5B-Instruct | 0.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-Coder-0.5B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-Coder-0.5B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-Coder-0.5B-Instruct/README.md) |
+| Qwen2.5-Coder-1.5B-Instruct | 1.5B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-Coder-1.5B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-Coder-1.5B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-Coder-1.5B-Instruct/README.md) |
+| Qwen2.5-Coder-7B-Instruct | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-Coder-7B-Instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Qwen2.5-Coder-7B-Instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Qwen2.5-Coder-7B-Instruct/README.md) |
+| TinyLlama-1.1B-Chat-v1.0 | 1.1B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/TinyLlama-1.1B-Chat-v1.0/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/TinyLlama-1.1B-Chat-v1.0/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/TinyLlama-1.1B-Chat-v1.0/README.md) |
+| CodeLlama-7b-Instruct-hf | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/CodeLlama-7b-Instruct-hf/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/CodeLlama-7b-Instruct-hf/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/CodeLlama-7b-Instruct-hf/README.md) |
+| SOLAR-10.7B-Instruct-v1.0 | 10.7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/SOLAR-10.7B-Instruct-v1.0/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/SOLAR-10.7B-Instruct-v1.0/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/SOLAR-10.7B-Instruct-v1.0/README.md) |
+| whisper-tiny | 0.39B |  | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/whisper-tiny/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/whisper-tiny/README.md) |
+| gpt-oss-20b | 20B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/gpt-oss-20b/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/gpt-oss-20b/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/gpt-oss-20b/README.md) |
+| granite-3.1-2b-instruct | 2B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.1-2b-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.1-2b-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/granite-3.1-2b-instruct/README.md) |
+| granite-3.1-8b-instruct | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.1-8b-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.1-8b-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/granite-3.1-8b-instruct/README.md) |
+| granite-3.2-2b-instruct | 2B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.2-2b-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.2-2b-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/granite-3.2-2b-instruct/README.md) |
+| granite-3.2-8b-instruct | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.2-8b-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.2-8b-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/granite-3.2-8b-instruct/README.md) |
+| granite-3.3-2b-instruct | 2B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.3-2b-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.3-2b-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/granite-3.3-2b-instruct/README.md) |
+| granite-3.3-8b-instruct | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.3-8b-instruct/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/granite-3.3-8b-instruct/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/granite-3.3-8b-instruct/README.md) |
+| Ministral-8B-Instruct-2410 | 8B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Ministral-8B-Instruct-2410/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Ministral-8B-Instruct-2410/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Ministral-8B-Instruct-2410/README.md) |
+| Mistral-7B-Instruct-v0.2 | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Mistral-7B-Instruct-v0.2/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Mistral-7B-Instruct-v0.2/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Mistral-7B-Instruct-v0.2/README.md) |
+| Mistral-7B-Instruct-v0.3 | 7B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Mistral-7B-Instruct-v0.3/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Mistral-7B-Instruct-v0.3/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Mistral-7B-Instruct-v0.3/README.md) |
+| Mistral-Nemo-Instruct-2407 | 12B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Mistral-Nemo-Instruct-2407/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Mistral-Nemo-Instruct-2407/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Mistral-Nemo-Instruct-2407/README.md) |
+| Yi-1.5-6B-Chat | 6B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Yi-1.5-6B-Chat/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Yi-1.5-6B-Chat/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Yi-1.5-6B-Chat/README.md) |
 | Yi-1.5-9B-Chat | 9B | [gguf](https://huggingface.co/webai-community/ai-models/tree/main/Yi-1.5-9B-Chat/gguf) | [onnx-webgpu](https://huggingface.co/webai-community/ai-models/tree/main/Yi-1.5-9B-Chat/onnx-webgpu) | [README](https://huggingface.co/webai-community/ai-models/blob/main/Yi-1.5-9B-Chat/README.md) |