Inference Providers
Active filters: web-llm
mlc-ai/CodeLlama-70b-Python-hf-q3f16_1-MLC
Updated • 3
• 1
mlc-ai/CodeLlama-70b-Python-hf-q4f16_1-MLC
Updated • 5
• 1
mlc-ai/stablelm-2-zephyr-1_6b-q0f16_1-MLC
Updated
mlc-ai/stablelm-zephyr-3b-q0f16-MLC
mlc-ai/stablelm-zephyr-3b-q0f32-MLC
mlc-ai/stablelm-zephyr-3b-q4f16_1-MLC
mlc-ai/stablelm-zephyr-3b-q4f32_1-MLC
Updated • 4
• 1
mlc-ai/gemma-2b-it-q4f16_1-MLC
Updated • 3.71k
• 6
mlc-ai/gemma-2b-it-q4f32_1-MLC
Updated • 545
• 2
mlc-ai/gemma-2b-it-q0f16-MLC
mlc-ai/gemma-2b-it-q0f32-MLC
mlc-ai/gemma-7b-it-q0f16-MLC
mlc-ai/gemma-7b-it-q4f16_2-MLC
Updated • 6
• 1
mlc-ai/gorilla-openfunctions-v1-q4f16_1-MLC
mlc-ai/gorilla-openfunctions-v2-q4f16_1-MLC
Updated • 4
• 2
mlc-ai/gorilla-openfunctions-v2-q4f32_1-MLC
mlc-ai/gemma-2b-it-q4f16_0-MLC
mlc-ai/Qwen1.5-MoE-A2.7B-Chat-q4f16_1-MLC
Updated • 6
• 3
mlc-ai/Llama-3-8B-Instruct-q0f32-MLC
mlc-ai/Llama-3-70B-Instruct-q3f16_1-MLC
Updated • 50
• 1
mlc-ai/Llama-3-8B-Instruct-q3f16_2-MLC
Updated • 6
• 2
mlc-ai/Hermes-2-Pro-Mistral-7B-q4f16_1-MLC
Updated • 191
• 3
mlc-ai/Phi-3-mini-4k-instruct-q0f32-MLC
Updated • 3
• 1
mlc-ai/Phi-3-mini-4k-instruct-q0f16-MLC
mlc-ai/Phi-3-mini-4k-instruct-q4f16_1-MLC
Updated • 22.9k
mlc-ai/Phi-3-mini-4k-instruct-q4f16_2-MLC
mlc-ai/Phi-3-mini-128k-instruct-q0f32-MLC
mlc-ai/Phi-3-mini-128k-instruct-q4f16_2-MLC
mlc-ai/Llama-2-7b-chat-hf-q0f16-MLC
mlc-ai/stablelm-2-zephyr-1_6b-q0f16-MLC